Direct linear transformation explained

Direct linear transformation (DLT) is an algorithm which solves a set of variables from a set of similarity relations:

x_k\proptoAy_k

for

k=1,\ldots,N

where

x_k

and

y_k

are known vectors,

\propto

denotes equality up to an unknown scalar multiplication, and

is a matrix (or linear transformation) which contains the unknowns to be solved.

This type of relation appears frequently in projective geometry. Practical examples include the relation between 3D points in a scene and their projection onto the image plane of a pinhole camera,^[1] and homographies.

Introduction

An ordinary system of linear equations

x_k=Ay_k

for

k=1,\ldots,N

can be solved, for example, by rewriting it as a matrix equation

X=AY

where matrices

and

contain the vectors

x_k

and

y_k

in their respective columns. Given that there exists a unique solution, it is given by

A=XY^T(YY^T)^-1.

Solutions can also be described in the case that the equations are over or under determined.

What makes the direct linear transformation problem distinct from the above standard case is the fact that the left and right sides of the defining equation can differ by an unknown multiplicative factor which is dependent on k. As a consequence,

cannot be computed as in the standard case. Instead, the similarity relations are rewritten as proper linear homogeneous equations which then can be solved by a standard method. The combination of rewriting the similarity equations as homogeneous linear equations and solving them by standard methods is referred to as a direct linear transformation algorithm or DLT algorithm. DLT is attributed to Ivan Sutherland.

Example

Suppose that

k\in\{1,...,N\}

. Let

x_k=(x_1k,x_2k)\inR²

and

y_k=(y_1k,y_2k,y_3k)\inR³

be two known vectors, and we want to find the

2 x 3

matrix

such that

\alpha_kx_k=Ay_k

where

\alpha_k ≠ 0

is the unknown scalar factor related to equation k.

To get rid of the unknown scalars and obtain homogeneous equations, define the anti-symmetric matrix

H=\begin{pmatrix}0&-1\ 1&0\end{pmatrix}

and multiply both sides of the equation with

	T
x
	k

from the left

	T
\begin{align} (x
	k

H)\alpha_kx_k&=

	T
(x
	k

H)Ay_k\\ \alpha_k

	T
x
	k

Hx_k&=

	T
x
	k

HAy_k\end{align}

Since

	T
x
	k

Hx_k=0,

the following homogeneous equations, which no longer contain the unknown scalars, are at hand

	T
x
	k

HAy_k=0

In order to solve

from this set of equations, consider the elements of the vectors

x_k

and

y_k

and matrix

x_k=\begin{pmatrix}x_1k\ x_2k\end{pmatrix}

y_k=\begin{pmatrix}y_1k\ y_2k\ y_3k\end{pmatrix}

, and

A=\begin{pmatrix}a₁₁&a₁₂&a₁₃\ a₂₁&a₂₂&a₂₃\end{pmatrix}

and the above homogeneous equation becomes

0=a₁₁x_2ky_1k-a₂₁x_1ky_1k+a₁₂x_2ky_2k-a₂₂x_1ky_2k+a₁₃x_2ky_3k-a₂₃x_1ky_3k

for

k=1,\ldots,N.

This can also be written in the matrix form:

	T
b
	k

for

k=1,\ldots,N

where

b_k

and

both are 6-dimensional vectors defined as

b_k=\begin{pmatrix}x_2ky_1k\ -x_1ky_1k\ x_2ky_2k\ -x_1ky_2k\ x_2ky_3k\ -x_1ky_3k\end{pmatrix}

and

a=\begin{pmatrix}a₁₁\ a₂₁\ a₁₂\ a₂₂\ a₁₃\ a₂₃\end{pmatrix}.

So far, we have 1 equation and 6 unknowns. A set of homogeneous equations can be written in the matrix form

0=Ba

where

is a

N x 6

matrix which holds the known vectors

b_k

in its rows. The unknown

can be determined, for example, by a singular value decomposition of

;

is a right singular vector of

corresponding to a singular value that equals zero. Once

has been determined, the elements of matrix

can rearranged from vector

. Notice that the scaling of

is not important (except that it must be non-zero) since the defining equations already allow for unknown scaling.

In practice the vectors

x_k

and

y_k

may contain noise which means that the similarity equations are only approximately valid. As a consequence, there may not be a vector

which solves the homogeneous equation

0=Ba

exactly. In these cases, a total least squares solution can be used by choosing

as a right singular vector corresponding to the smallest singular value of

More general cases

The above example has

x_k\inR²

and

y_k\inR³

, but the general strategy for rewriting the similarity relations into homogeneous linear equations can be generalized to arbitrary dimensions for both

x_k

and

y_k.

x_k\inR²

and

y_k\inR^q

the previous expressions can still lead to an equation

	T
x
	k

HAy_k

for

k=1,\ldots,N

where

now is

2 x q.

Each k provides one equation in the

unknown elements of

and together these equations can be written

Ba=0

for the known

N x 2q

matrix

and unknown 2q-dimensional vector

This vector can be found in a similar way as before.

In the most general case

x_k\inR^p

and

y_k\inR^q

. The main difference compared to previously is that the matrix

now is

p x p

and anti-symmetric. When

p>2

the space of such matrices is no longer one-dimensional, it is of dimension

	p(p-1)
	2

This means that each value of k provides M homogeneous equations of the type

	T
x
	k

H_mAy_k

for

m=1,\ldots,M

and for

k=1,\ldots,N

where

H_m

is a M-dimensional basis of the space of

p x p

anti-symmetric matrices.

Example p = 3

In the case that p = 3 the following three matrices

H_m

can be chosen

H₁=\begin{pmatrix}0&0&0\ 0&0&-1\ 0&1&0\end{pmatrix}

H₂=\begin{pmatrix}0&0&1\ 0&0&0\ -1&0&0\end{pmatrix}

H₃=\begin{pmatrix}0&-1&0\ 1&0&0\ 0&0&0\end{pmatrix}.

In this particular case, the homogeneous linear equations can be written as

0=[x_k]_xAy_k

for

k=1,\ldots,N

where

[x_k]_x

is the matrix representation of the vector cross product. Notice that this last equation is vector valued; the left hand side is the zero element in

R³

Each value of k provides three homogeneous linear equations in the unknown elements of

. However, since

[x_k]_x

has rank = 2, at most two equations are linearly independent. In practice, therefore, it is common to only use two of the three matrices

H_m

, for example, for m=1, 2. However, the linear dependency between the equations is dependent on

x_k

, which means that in unlucky cases it would have been better to choose, for example, m=2,3. As a consequence, if the number of equations is not a concern, it may be better to use all three equations when the matrix

is constructed.

The linear dependence between the resulting homogeneous linear equations is a general concern for the case p > 2 and has to be dealt with either by reducing the set of anti-symmetric matrices

H_m

or by allowing

to become larger than necessary for determining

References

Book: Richard Hartley and Andrew Zisserman . Multiple View Geometry in computer vision . Cambridge University Press. 2003 . 978-0-521-54051-3.

External links

Homography Estimation by Elan Dubrofsky (§2.1 sketches the "Basic DLT Algorithm")
A DLT Solver based on MATLAB by Hsiang-Jen (Johnny) Chien

Notes and References

Abdel-Aziz . Y.I. . Karara . H.M. . Direct Linear Transformation from Comparator Coordinates into Object Space Coordinates in Close-Range Photogrammetry . Photogrammetric Engineering & Remote Sensing . American Society for Photogrammetry and Remote Sensing . 81 . 2 . 2015-02-01 . 0099-1112 . 10.14358/pers.81.2.103 . 103–107. free .