Velocity Moments Explained

In the field of computer vision, velocity moments are weighted averages of the intensities of pixels in a sequence of images, similar to image moments but in addition to describing an object's shape also describe its motion through the sequence of images. Velocity moments can be used to aid automated identification of a shape in an image when information about the motion is significant in its description. There are currently two established versions of velocity moments: Cartesian[1] and Zernike.[2]

Cartesian velocity moments

Cartesian moments for single images

A Cartesian moment of a single image is calculated by

mpq=

M
\sum
x=1
N
\sum
y=1

xpyqPxy

where

M

and

N

are the dimensions of the image,

Pxy

is the intensity of the pixel at the point

(x,y)

in the image, and

xpyq

is the basis function.

Cartesian velocity moments for sequences of images

Cartesian velocity moments are based on these Cartesian moments. A Cartesian velocity moment

vmpq\mu\gamma

is defined by

vmpq\mu\gamma=

images
\sum
i=2
M
\sum
x=1
N
\sum
y=1

U(i,\mu,\gamma)C(i,p,g)

P
ixy

where

M

and

N

are again the dimensions of the image,

images

is the number of images in the sequence, and
P
ixy
is the intensity of the pixel at the point

(x,y)

in image

i

.

C(i,p,q)

is taken from Central moments, added so the equation is translation invariant, defined as

C(i,p,q)=

p
(x-\overline{x
i})
q
(y-\overline{y
i})

where

\overline{xi}

is the

x

coordinate of the centre of mass for image

i

, and similarly for

y

.

U(i,\mu,\gamma)

introduces velocity into the equation as

U(i,\mu,\gamma)=(\overline{xi}-\overline{xi-1

})^\mu (\overline-\overline)^\gamma

where

\overline{xi-1

} is the

x

coordinate of the centre of mass for the previous image,

i-1

, and again similarly for

y

.

After the Cartesian velocity moment is calculated, it can be normalised by

\overline{vmpq\mu\gamma

} = \frac

where

A

is the average area of the object, in pixels, and

I

is the number of images. Now the value is not affected by the number of images in the sequence or the size of the object.

As Cartesian moments are non-orthogonal, so are Cartesian velocity moments, so different moments can be closely correlated. These velocity moments do however provide translation and scale invariance (unless the scale changes within the sequence of images).

Zernike velocity moments

Zernike moments for single images

A Zernike moment of a single image is calculated by

Amn=

m+1
\pi

\sumx\sumy[Vmn(r,\theta)]*Pxy

where

*

denotes the complex conjugate,

m

is an integer between

0

and

infty

, and

n

is an integer such that

m-|n|

is even and

|n|<m

. For calculating Zernike moments, the image, or section of the image which is of interest is mapped to the unit disc, then

Pxy

is the intensity of the pixel at the point

(x,y)

on the disc and

x2+y2\le1

is a restriction on values of

x

and

y

. The coordinates are then mapped to polar coordinates, and

r

and

\theta

are the polar coordinates of the point

(x,y)

on the unit disc map.

Vmn(r,\theta)

is derived from Zernike polynomials and is defined by

Vmn(r,\theta)=Rmn(r)ejn\theta

Rmn(r)=

m-|n|
2
\sum
s=0

(-1)sF(m,n,s,r)

F(m,n,s,r)=

(m-s)!
s!
(m+|n|
2-s)!
(m-|n|
2-s)!

rm-2s

Zernike velocity moments for sequences of images

Zernike velocity moments are based on these Zernike moments. A Zernike velocity moment

Amn\mu\gamma

is defined by

Amn\mu\gamma=

m+1
\pi
images
\sum
i=2

\sumx=1\sumy=1U(i,\mu,\gamma)[Vmn(r,\theta)]*

P
ixy

where

images

is again the number of images in the sequence, and
P
ixy
is the intensity of the pixel at the point

(x,y)

on the unit disc mapped from image

i

.

U(i,\mu,\gamma)

introduces velocity into the equation in the same way as in the Cartesian velocity moments and

[Vmn(r,\theta)]*

is from the Zernike moments equation above.

Like the Cartesian velocity moments, Zernike velocity moments can be normalised by

\overline{Amn\mu\gamma

} = \frac

where

A

is the average area of the object, in pixels, and

I

is the number of images.

As Zernike velocity moments are based on the orthogonal Zernike moments, they produce less correlated and more compact descriptions than Cartesian velocity moments. Zernike velocity moments also provide translation and scale invariance (even when the scale changes within the sequence).

Comparison of methods

Velocity moment typeTranslation invarianceScale invarianceOrthogonal
CartesianYesYes (if the object does not change scale within the sequence)No
ZernikeYesYesYes

References

  1. J. D. Shutler, M. S. Nixon, C. J. Harris, "Statistical Gait Description via Temporal Moments", Proc. SSIAI 2000 - Austin, Texas, :pp. 291-295, 2000
  2. J. D. Shutler and M. S. Nixon, "Zernike Velocity Moments for Description and Recognition of Moving Shapes", Proc. BMVC 2001, Manchester, UK, 2:pp. 705-714, 2001

External links