PE&RS November 2018 Full - page 735

Multi-Response Linear Regression (MRLR)

The

MRLR

model is an effective method for the ensemble of

heterogeneous base classifiers. The advantages of using

MRLR

is its interpretability as it provides a method of combining

the results generated by the level-0 into a final decision. The

weights generated by

MRLR

indicate the different contribu-

tions that each base classifier makes for class prediction,

which can be described as follows. Suppose the training sam-

ple set

= {(

)}

contains N observations, where

= (

,…,

)

is a p-dimensional eigenvector,

is a class label

and

∈Γ

= {

,…,

}. We use the training set

to train

different classification algorithms to obtain the integration

= {

,…,

} of the

base classifiers. We assume that each

base classifier

(

= 1, 2, …,

) predicts an observed value as a

posterior probability distribution vector:

P x P w x P w x P w x

P x P x

C m

( )

(

)

(

)

…

(

)

(

)

( )

, ,

…

( )

(

)

= …

, , ,

P x i

m T

1 2

(1)

where

(

) is the possibility value of the pixels in the

class

obtained by the

base classifier. We can therefore describe

the input data of the meta-classifier as a

matrix

(

P x P x P x P x

P x P

T T

ClassifierC

( )

…

( )

(

)

( )

…

, ,

…

, ,

P x P x P x P

ClassifierC

( )

…

( )

…

ClassifierC

( )













(2)

MRLR

transforms the C classification problems into C

regression problems. For example, for class

, if the sample

has a class label

, its output value is 1; otherwise, the output

value is 0. For each class

MRLR

chooses each base classi-

fier’s predicted

belonging to class

to establish a linear

model, which is defined as:

R x

a P x a j

( )

( ),

, ,

( )

= ≥

0 1 2



(3)

and the estimation of parameter

{ }

usually utilizes the

NNLS

algorithm.

For each sample,

MRLR

utilizes the predicted values of

the

base classifiers to construct the input feature data but

ignores the association with neighboring pixels. In this paper,

considering the spatial information, the weighted average of

the sample’s eight neighbors is also taken into account when

constructing the feature data of the meta-classifier. The spe-

cific input data can be represented as:

x P x Q x P x Q x

P x P Q x

T T

( )

…

( )

(

)

( )

…

( )

1 1

, ,

…

( )

…

( )

Q P x P x Q x

Classifier C

…

, ,

…

( )













Q x

Classifier C

(4)

where

(

) is the weighted average of the probabilities of the

eight neighboring pixels in the

class obtained by the

-th

base classifier.

To estimate the model parameters in

MRLR

, we propose the

FOA

and compare it with the

NNLS

algorithm (Li and Ngom,

2013).

NNLS

algorithm is the most commonly used method for

parameter estimation of the

MRLR

model. The

FOA

is one of the

recently developed swarm optimization algorithms, and it has

global optimization ability (Iscan and Gunduz, 2015). Besides,

FOA

is a stable algorithm, which solves the problems fast.

Construction of Multi-Source Feature Dataset

and Automatic Selection of Training Samples

As aforementioned, many studies have demonstrated the

effectiveness of combination of texture, morphological, and

spectral features. The gray level co-occurrence matrix (

GLCM

)

is a conventional method of extracting statistical texture

features. In this paper, five second-moment descriptors, i.e.,

mean, variance, homogeneity, contrast, and dissimilarity,

are applied. For the selection of window size, according to

the size and distribution of various features in the image,

we choose 5 × 5 size window and 0° direction to extract the

features. The morphological features are also a type of texture

features called structure texture. Two commonly used mor-

phological operators are opening and closing. The mathemati-

cal morphology framework defines a series of operators to em-

phasize homogeneous spatial structures in a gray level image.

The strategy of opening reconstruction is to dilate an eroded

image in order to recover as much as possible of the eroded

image. In contrast, closing reconstruction is to erode a dilated

image in order to recover the initial shape of image structures

that have been dilated. The opening-and-closing reconstruc-

tion integrates the advantages of both operations regarding

their capacity to preserve original shapes of spatial structures.

Therefore, these three morphological reconstruction filters are

used to construct the input dataset. According to the distribu-

tion of features in images, a circular structure with a radius of

5 is chosen as the structuring element.

Despite the advantages of supervised classifiers in classi-

fication, they require training samples as labeled beforehand.

Manual selection of training samples can lead to incomplete-

ness of selected categories, and it is time-consuming. So in

this paper, the training samples are selected by Change Vector

Analysis (

CVA

), an unsupervised change detection method.

CVA

is very effective in combining different types of change

features. The training samples are selected from the change

map by using two thresholds and defined as:

T k c T k c

a k

T l

nc T l

= +

− +

= −

−

[

* _ ,

* _

(

)

* _ ]

[

* _ ,

* _

(

− +











* _ ]

(5)

where

is determined by the expectation maximization (

)

algorithm, the

and

are the standard deviation of the

changed pixels and unchanged pixels, respectively, and

and

are the adjustment coefficients as

= 1, 2,…,

. Here,

= (

x_max

–

and

= (

T–x_min

with

x_max

and

x_min

being the maximum and minimum value of

the

CVA

change map, respectively.

Pixel-wise Change Detection Based on the

Stacked Generalization Hybrid Ensemble System

As mentioned earlier,

ELM

SVM

, and

KNN

are chosen to con-

struct the base classifiers at level-0. The

MRLR

is utilized as

the meta-classifier at level-1. In order to improve computa-

tional efficiency and ensure a high accuracy, the

ELM

homo-

geneous integration algorithm based on random subspace

method (

RSM

) is adopted to label a large part of pixels. The re-

maining unlabeled pixels are then classified by the proposed

hybrid ensemble system. The specific change detection

processes are as follows.

1. Generation of the level-0 base classifier

As described in the previous section, we randomly divide

the automatically acquired training samples into three

sub-training sets, then we utilize two parts to train

ELM

SVM

, and

KNN

to generate the base classifiers at level-0.

When training the

ELM

, the two sub-training sets use the

RSM

ensemble strategy to classify all the pixels. According

to the label determination rules, a large number of pixels

are labeled, and the remaining pixels are reclassified by

the trained

SVM

and

KNN

. The outputs of

ELM

SVM

, and

KNN

based on the

RSM

homogeneous integration and the

PHOTOGRAMMETRIC ENGINEERING & REMOTE SENSING

November 2018

735

SEO Version

Warning.

You are currently viewing the SEO version of !text.
It has a number of design and functionality limitations.

We recommend viewing the Flash version or the basic HTML version of this publication.

667...,725,726,727,728,729,730,731,732,733,734 736,737,738,739,740,741,742,743,744,745,...746