Tut Homography 04

(c) 2004 F.J. Estrada & A.D. Jepson & D.
Fleet
Planar Homographies Nov. 15, ’04
References:
• Your lecture notes on coordinate frames

• Your lecture notes on epipolar geometry
• http://www.robots.ox.ac.uk/˜vgg/projects/SingleView/
Planar Homographies 1
(c) 2004 F.J. Estrada & A.D. Jepson & D. Fleet
A quick review of projection
We need to determine the parameters that govern the projection

from points in the world to points in an image, a quick review of
coordinate frames and transformations is in order.
Homogeneous Coordinates:
x = (x1, x2, x3)T be a 3-D point in some reference

Let ~
frame, we can express ~ x in Homogeneous coordinates as
xH = (x1, x2, x3, 1)T .
~
A general 3D rigid transformation can be written as a 4 by 4

matrix multiplication using Homogeneous coordinates:
d~3,1

R3,3
M3D−trans = (1)
01,3 1
Where R is a 3 by 3 rotation matrix, and d = (dx, dy , dz )T ,

specifies translation components for the x, y , and z coordinates.
The rotation matrix in turn can be decomposed into three
matrices, each specifying a rotation around each of the coordinate
axes.
A transformation of the type described above is used to convert

coordinates between different coordinate frames. The first set
A quick review of projection 2

of parameters that governs the mapping of points in the world

to points in the image is given by a transformation from world
coordinates to camera coordinates:

Mext = R3,3 −R3,3 d~3,1 (2)
Where R is the rotation from world to camera coordinates, and

d~ contains the world coordinates of the camera’s nodal point.
Notice that Mext is a 3 by 4 matrix with 12 D.O.F., R and d~ are
known as the extrinsic camera parameters.

A quick review of projection
The second set of parameters that govern the mapping

from world points to image points is given by the specific
characteristics of the projection of light rays onto the image.
Figure 1 illustrates the projection process in a pinhole camera.
In this figure, C is the center of projection, X , Y , and Z are

the coordinate axes in camera coordinates, u and v are the
image coordinate axes, f is the focal length of the camera, and
a point ~xcam in camera coordinates projects to point ~ xim in
image coordinates.

The transformation that maps from camera coordinates to

image coordinates has 2 components, the first component
converts camera coordinates to image coordinates. If ~
xcam =
T
(x, y, z) , then ~
xim = (u, v) = (f x/z, f y/z). This
operation can be written as the 3 by 3 perspective projection
matrix:
 
f 0 0
Mproj = 0 f 0 (3)
0 0 1
The second component accounts for the size and shape of pixels,
and for the image coordinates. These effects are specified with
a 3 by 3 matrix:
 
1/l1 0 pc,1
Mim = 0 1/l2 pc,2  (4)
0 0 1
Where l1 is the width of the pixels, l2 is the height of the

pixels, and pc = (pc,1 , pc,2 ) is the point (in image coordinates)
where the optical axis of the camera intersects the image plane.
The parameters in the previous 2 matrices are known as the
camera’s intrinsic parameters. The complete mapping from a
point W in (homogeneous) world coordinates to point P im in

image coordinates is:
Pim = Mim · Mproj · Mext · W~T (5)

2-D Homographies
2-D homographies are projective transformations that map

points from one plane to another plane (for example the
transformation mapping points in a planar surface in the world
to the image plane). Figure 2 illustrates the geometry involved
in this process.
2-D Homographies 7
2-D Homographies
Consider the problem of determining the homography that

maps points in one image to the corresponding points in a
second image.
Assuming that we can identify corresponding points in both

images (let’s say, by detecting and matching interest points),
such a homography exists and can be computed, consider the
homography Hl mapping points on a plane to image points on
the left side image:
~
xl,i = αHl · q
~i, α>0 (6)
2-D Homographies 8
Where q ~i = (u, v, 1)T is a point on some plane π , and ~

xl,i =
0 0
(ul , vl , 1)T its projection onto the left image. In a similar
fashion, consider the homography Hr that maps points on π
to image points on the right side image:
~
xr,i = βHr · q
~i, β>0 (7)
Solving for q
~i in Eq. and substituting in Eq. we have:
α −1
~
xl,i = Hl · H r · ~
xr,i = γH~
xr,i (8)
β
Where H = Hl · Hr−1 is the homography that maps points on

the right side image to points on the left side image. Notice that
we can do this without ever knowing the location of the points
q
~i. Figure 5 shows the result of applying the homography to
the left side image, and a composite of both images showing
the correspondence.
2-D Homographies 9
Range: [20.6, 254] Range: [−173, 215]

Dims: [384, 256] Dims: [384, 256]
2-D Homographies 10
2-D Homographies
A 2-D homography is defined as a 3 by 3 homogeneous matrix

such that for any point ~ xi = (γu, γv, γ) on π , and its
corresponding point ~
xi0 on π0:
~
xi 0 = H · ~
xi (9)
2-D homographies have 8 D.O.F. (9 entries in the H matrix, but

the common scale factor is not relevant), hence, to determine
the homography we require 4 pairs of corresponding points.
Notice though that 3 collinear points in either plane result in
a configuration with no unique solution.
Homographies can be applied to many problems in computer

vision including stereo reconstruction, image mosaics, and
applications using perspective geometry. Figure 3 shows an
example of using 2-D planar homographies to rectify an image
region (remove distortion due to perspective projection).
2-D Homographies 11
In the above example, the 4 corners of the window are

mapped to the corners of a rectangular polygon, however, any
rectangular polygon will offer 4 correspondence pairs. The
aspect ratio for the rectangular polygon can not be determined
from one view of the 4 corners of the window.
2-D Homographies 12
Perspective Structure
Finally, let’s take a look at an image taken under perspective

projection and observe some of its characteristics.
One important property of perspective projection is that

parallel lines converge to some point in the image (though
the point of convergence may be at infinity as is the case for
horizontal and vertical lines in the image above).
The image location of these intersection points (usually called

vanishing points) determine the orientation in 3D of the set of
parallel lines that converge to it. It is also possible to determine
Perspective Structure 13
from pairs of vanishing points the 3D orientation of planar

surfaces bounded by parallel lines converging to either of the
vanishing points. This enables us to do take the above image
and turn it into a 3D model as shown below.
..
This type of reconstruction relies heavily on planar

homographies to determine the correct mapping between image
pixels and reconstructed 3D surfaces. For more information on
this type of reconstruction, visit the link shown in the first
slide.
Perspective Structure 14

Tut Homography 04

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Tut Homography 04

Uploaded by

Copyright:

Available Formats

(c) 2004 F.J. Estrada & A.D. Jepson & D.

Planar Homographies Nov. 15, ’04

• Your lecture notes on coordinate frames

A quick review of projection

We need to determine the parameters that govern the projection

x = (x1, x2, x3)T be a 3-D point in some reference

A general 3D rigid transformation can be written as a 4 by 4

Where R is a 3 by 3 rotation matrix, and d = (dx, dy , dz )T ,

A transformation of the type described above is used to convert

A quick review of projection 2

of parameters that governs the mapping of points in the world

Where R is the rotation from world to camera coordinates, and

A quick review of projection 3

A quick review of projection

The second set of parameters that govern the mapping

In this figure, C is the center of projection, X , Y , and Z are

A quick review of projection 4

The transformation that maps from camera coordinates to

Where l1 is the width of the pixels, l2 is the height of the

A quick review of projection 5

image coordinates is:

Pim = Mim · Mproj · Mext · W~T (5)

A quick review of projection 6

2-D homographies are projective transformations that map

Consider the problem of determining the homography that

Assuming that we can identify corresponding points in both

Where q ~i = (u, v, 1)T is a point on some plane π , and ~

Where H = Hl · Hr−1 is the homography that maps points on

Range: [20.6, 254] Range: [−173, 215]

A 2-D homography is defined as a 3 by 3 homogeneous matrix

2-D homographies have 8 D.O.F. (9 entries in the H matrix, but

Homographies can be applied to many problems in computer

In the above example, the 4 corners of the window are

Finally, let’s take a look at an image taken under perspective

One important property of perspective projection is that

The image location of these intersection points (usually called

from pairs of vanishing points the 3D orientation of planar

This type of reconstruction relies heavily on planar

You might also like