PE&RS October 2015 - page 779













→ =















After this transformation, the between-class and within-class

scatter matrices are calculated with using matrix form of

training samples. Then, the eigenvector corresponding to the

largest eigenvalue of

–1

can be selected as the projection

vector (

). By transforming the original feature vector of each

pixel of image into a feature matrix (

×1



), the

-di-

mensional projected vector (

), which is the extracted feature

vector of matrix

, is obtained by:

×1

(4)

We multiply the matrix form of each sample of data (

) in

for extraction of a

-dimensional feature vector (

) from that

sample. The scatter matrices in

2DLDA

can be calculated as

follows:

∑

(

–

)

(

–

)

(5)

∑

(

–

)

(

–

)

(6)

In the above equations,

is the

sample of class

–

is the

mean of class

, and

–

denotes the mean of entire training

samples. Note that

–

, and

–

are

matrices. The pro-

jection vector for feature extraction using

2DLDA

is obtained

by maximizing the Fisher criterion as follows:

= arg max

p p

(7)

The above optimization problem, is solved to get the opti-

mal

. We solve the following generalized eigenvalue problem

to obtain:

(8)

where

is the maximal eigenvalue of

–1

and

is the eigen

vecor associated with

. Note that in traditional

LDA

(

1DLDA

the dimensions of scatter matrices are

while in

2DLDA

and

are

matrices where

. In addition, in

1DLDA

, we

have a projection matrix (

) for feature extraction while in

2DLDA

, we have a projection vector (

×1

). The use of the

2DLDA

approach for feature extraction of hyperspectral data has two

main advantages:

1. For high dimensional matrices, inversion is a really

sensitive operation that can only be reliably done if the

estimate of matrix is really good. But in

LDA

, it is really

difficult to obtain a precise estimate of

using limited

number of training samples. Thus,

will be almost

singular and this will cause overfitting in the

LDA

method. In

2DLDA

, with transformation the feature vec-

tor of each sample of data into a feature matrix, we deal

with the

SSS

problem. The within-class scatter matrix

2DLDA

is usually nonsingular. Li and Yuan (2005)

show that in

2DLDA

, the within-class scatter

matrix

is nonsingular when

N n

m n

≥ +

min( , )

(

and

are the number of rows and the number of

columns of

respectively, and

is the number of

total training samples). Obviously, this inequality is

usually satisfied and therefore, the

SSS

problem does

not exist in

2DLDA

LDA

can extract a maximum of

– 1 features while

2DLDA

can extract each number of features with no

limitation. In

2DLDA

, the rank of

is not limited to the

number of classes. Moreover, without considering the

rank of

, only one eigenvector associated with the

largest eigenvalue of

–1

can be considered as the

projection vector (

). But, we show later that the use of

all eigenvectors for calculation of

improves the clas-

sification accuracy.

We may deal with two problems when we use the

2DLDA

method for feature extraction of hyperspectral images:

1. Because the number of spectral bands (

) must be writ-

ten as a product of two integers (

must be

composite number. In other words, we must be able

to transform the feature vector (

×1

) of each pixel of

image into a feature matrix (

). Thus, if

is a prime

number, the use of

2DLDA

is not possible.

2. The number of extracted features is equal to the number

of rows (

) of

, because we have:

×1

Thus, if

is not divisible by

, extraction of

features

is not possible.

Now, we represent the solutions to deal with the aforemen-

tioned problems:

1. If the number of spectral bands (

) is a prime number,

we add

new features to

original features so that

becomes a composite number, and so, we can write

as a product of two integers.

2. For extraction of

features, if

is not divisible by

, we

add

such a way that

becomes divisible by

In general, for extraction of

features, we add as small

a value of

as possible to

so that

becomes composite

and divisible by

. (

is an nonnegative integer). Actually,

with adding

, we add

new features to the

× 1 original

feature vector (

×1



(

)×1

) where

. We consider

the central moments of order two or more (

≥

2) as the added

new features. We choose the central moments as added new

features because the calculation of them is simple, fast and

also efficient from a classification accuracy point of view. The

central moment is defined as:

[(

–

)

= 2, 3, …

(9)

where

is the mean of feature vector (

×1

). For better under-

standing of the proposed process, please see the example. For

instance, suppose that the number of spectral bands (features)

in a hyperspectral image is

= 200. For extraction of

= 6

features from data, first, we should add

= 4 new features to

the original feature vector of each pixel of image to

= 204

becomes divisible by 6. So, we should add 2,3,4, and 5 order

central moments to each feature vector. Then, feature vector

of each sample that becomes a 204 × 1 vector is transformed

into a 6 × 34 matrix, and then,

the scatter matrices are esti-

mated by transformed training samples.

The

–1

is a

matrix which contains

eigenvalues.

We can use just one eigenvector, which is associated to the

largest eigenvalue of

–1

, as the projection vector

. But, it

is better that we use the capability of all the eigenvectors for

calculation of

This improves the classification accuracy. In

other words, each eigenvector, proportional to the magnitude

of its eigenvalue, can contribute for calculation of

. Thus,

the projection vector is calculated in a weighted manner as

follows:

PHOTOGRAMMETRIC ENGINEERING & REMOTE SENSING

October 2015

779

SEO Version

Warning.

You are currently viewing the SEO version of !text.
It has a number of design and functionality limitations.

We recommend viewing the Flash version or the basic HTML version of this publication.

751...,769,770,771,772,773,774,775,776,777,778 780,781,782,783,784,785,786,787,788,789,...822