PERS_April2018_Public - page 207

training data set (

) is con-

structed to train the discriminative

model-

. Our task is labeling a

new coming video clip

* to a traf-

fic state with the highest probabil-

ity

(

For simple illustration the

binary classification with two traf-

fic states

∈

{–1, +1} is considered

here. The binary classification is

easily extended to multiple classifi-

cation by using the one-against-all

or one-against-one strategy.

The general formulation of prob-

ability prediction for a new test

sample given the training data (

)

under a

model is:

(

*=+1|

*)=

∫

(

(24)

where

(

*) is the distribution of latent variable

cor-

responding to sample

*. It is obtained by integrating over the

latent variable

, …,

(

*)=

∫

(

)

(

)

(25)

where

(

) (/)

(

) is the posterior over

the latent variables.

(

) is the marginal likelihood (evi-

dence),

(

) is the

prior over the latent function, which

model is a jointly zero mean Gaussian distribution and

with the covariance given by the kernel K.

The non-Gaussian likelihood in Equation 25 makes

the integral analytically intractable. We have to resort

to either analytical approximation of integrals or Monte

Carlo methods. Two well known analytical approximation

methods are very suitable for this task, namely the

La-

place

[williams1998bayesian] and the

Expectation Propaga-

tion

(EP) [minka2001family]. They both approximate the non-

Gaussian joint posterior as a Gaussian one. In this paper we

adopt the

Laplace

method since its computation cost relative

lower than EP with comparable accuracy. As introduced in

[26]

the mean and variance of

* are obtained as follows:

(

*)=

(μ*,

*),

(26)

with

μ*=

(

–

(27)

(

*)–

(

–

)

(

*).

(28)

where

= –

∇∇

log

(

) is diagonal.

denotes a

covariance matrix between

training points.

(

*) is a

covariance vector between T training video clips

and test

clip

*, while

(

*) is covariance for test clip

*, and

= argmax

(

). Given the mean and variance of latent

variable

* for test clip

*, we compute the prediction prob-

ability using Equation 24.

The covariance function and its hyper-parameters

cru-

cially affect

models performance. The Gaussian radial basis

function (

RBF

) is one of the most widely used kernels due to

its robustness for different types of data and is given as below:

RBF i j

c c

(

)

−

− 











exp

(29)

] is the hyper-parameter set for

RBF

. We optimize the

hyper-parameters using Conjugate Gradient method

[27]

Integration of Transition Information into GP Classifier

The input video is segmented into clips along time. It cannot

be ensured that each clip is precise in a traffic state interval.

In practice, sometimes the transition of two states occur in a

clip, as shown Figure 5a In the other cases, the scene is silent

in some clips: there are very few motions, as shown Figure 5a.

In these two cases, the

classifier is hard to exactly classify

the states. Fortunately, a crowded traffic scene is normally

regulated by traffic lights. The transition between two traffic

states is rule-based, e.g., the transition from state Figure 7a to

state Figure 7c is impossible. The transition information from

the Learning states using the

HDP

HMM

Section makes signifi-

cant sense here.

Figure 5 shows examples of confused traffic states: (a)

Imperfect segmented clip may contain motion information

belonging to different states, and (b) A silent clip contains too

few useful motion information. Both of these two cases make

the system hard to determine the right state.

We define a state energy for clip

as follows:

(

–1

) =–log{

(

)}

(30)

log{

} (1–

(

–1

))

=argmin

(

–1

)

(31)

where

(

) is the likelihood of the

clip labeled as state

given by Equation 24:

is the transition probability from

state

(state of last clip) to

, and

(

–1

)=1,

if y

else

is the weight of transition energy and is set experimentally as

0.1. It means that, if the state does not change, we do not need

to care about the transition problem. If the transition of the

states happens, we will take the transition information into

account and choose the state which has minimal state energy.

Abnormal Events Detection

Abnormal events identification is always one of the most

interesting and desired capabilities for automated video be-

havior analysis. However, dangerous or illegal activities often

have few examples to learn from and are often subtle. In other

words, it is a challenging problem for identifying abnormal

events according to their motion patterns for supervised clas-

sifier. To tackle this problem, the abnormal events should be

defined at first. They are roughly categorized into three groups.

Figure 5. Examples of confused traffic states: (a) Imperfect segmented clip may contain

motion information belonging to different states, and (b) A silent clip contains too few

useful motion information. Both of these two cases make the system hard to determine

the right state.

PHOTOGRAMMETRIC ENGINEERING & REMOTE SENSING

April 2018

207

SEO Version

Warning.

You are currently viewing the SEO version of !text.
It has a number of design and functionality limitations.

We recommend viewing the Flash version or the basic HTML version of this publication.

167...,197,198,199,200,201,202,203,204,205,206 208,209,210,211,212,213,214,215,216,217,...230