PERS_April2018_Public - page 206

GEM

(

(16)

(

(17)

(18)

–1

Multi

(

–1

(19)

Multi

(

(20)

where

,…,

} is the state of the

clip and

is the set of

possible states and

is the total number.

is the observation set

(visual words). In this case, each vector

}

I=1

…

is one row of

the Markov chain’s transition matrix from state

to the other states

and

is the number of states. For a better illustration, we denote

these transition matrix as

= {

i,j

}

i,j

…

throughout this paper.

Given the state

, the observation

is drawn from the mixture

component

indexed by

. Gibbs sampling schemes are applied

to do inference under this

HDP

HMM

. Figure 7 shows the typical

traffic states learned by

HDP

HMM

for

QMUL

Junction Dataset

[8]

The same as the activity learning using

HDP

model, the traf-

fic states learned by

HDP

HMM

also involve some unexpected

results. The typical traffic states are selected in the similar

way as previously described in this Section.

Representation of Activities and Video Clips

Activity Representation

Each activity

is characterized by a multi-nominal distribu-

tion {

} over the words in codebook. The probability of

word in activity

is denoted as

and

kx i

{ }

∑

and

is the size of codebook. Similar to the

operation previously described which selects the representa-

tive activity, we also select the representative visual words to

represent each activity in the same way:

is sorted in de-

scending order

′

= {

′

≥

…

≥

′

}, and then the accumulated

sum of probability is calculated as:

′ = ′

∑

(21)

those visual words which satisfy :

= {

′

≤

0.9}

(22)

are chosen to represent activity

. It is the set of the most

frequently co-occurring words in the same activity. The words

falling into the rest 10 percent are viewed as noise or rare

motion. Figure 4 shows a comparison between all possible

co-occurring visual words and the selected representative

words in the activity of vehicles driving downward.

Video Clip Representation

Feature vectors of activities from last step are variant in length

because the number of representative words of different activi-

ties is unexpected. They are not suitable to be used to describe

a video clip directly. We construct a feature vector to explain a

clip using learned activities in a new way as follows.

ti i

{ }

denotes that there are

the words present in

clip

totally.

is compared with each activity word set

and the percentage of intersection is calculated as:

∩

x w

(23)

It explains the proportion of activity

in this clip. The

feature vector which explains what happens in this clip is

represented as

, …,

}, as shown in Figure 7 (e) to (h).

Figure 4 is a comparison between the activity pattern

before and after filtering the unnecessary words. The vi-

sual words in the left part of image (a) seem chaotic and are

filtered out. In Figure 4b, the activity is represented better by

the selected visual words. The color of the arrow denotes the

quantified motion direction, as illustrated in Figure 4c.

Traffic States Classification

In this section, we first discuss how to use

models to clas-

sify traffic states in a newly screened video. Then, we inte-

grate the transition information learned by

HDP

HMM

with

model to enhance the classification accuracy.

Gaussian Process for Classification

The

HDP

HMM

has mined the main traffic states

from training

video sequence and each training video clip is labeled with

a state label

∈

, where the subscript

is the clip index.

}

is the feature vector of clip

given by Equation 23. Now the

Figure 3. A graphical representation of the

HDP-HMM

model.

Figure 4. A comparison between the activity pattern before and after filtering the unnecessary words. The visual words in the

left part of image (a) seem chaotic and are filtered out. In (b), the activity is represented better by the selected visual words.

The color of the arrow denotes the quantified motion direction, as illustrated in (c).

206

April 2018

PHOTOGRAMMETRIC ENGINEERING & REMOTE SENSING

SEO Version

Warning.

You are currently viewing the SEO version of !text.
It has a number of design and functionality limitations.

We recommend viewing the Flash version or the basic HTML version of this publication.

167...,196,197,198,199,200,201,202,203,204,205 207,208,209,210,211,212,213,214,215,216,...230