Professional Documents
Culture Documents
*Also at Manila Observatory, Upper Atmosphere Division, Ateneo de Manila University Campus
e-mail: qsugon@observatory.ph
December 3, 2008
1
Here, the action of the right-acting matrix is defined variables, we shall show that we may recast Lagrange’s
by the action of its left-acting transpose, as given in theorem in Eq. (16) in terms of partial derivatives:
Symon[7]:
x′ ∂(n′ α′ )
MT · r = r · M. (9) = −1; x, x′ = constants. (17)
x ∂(nα)
Because the ray operators are 2 × 2 matrices, we
may decompose them as a linear combination of single- The quantity |xδ(nα)| is the Lagrange invariant.
element, unit matrices, as done by Campbell[8] and d. Outline. We shall divide the paper into five sec-
Harris[9]: tions. The first section is Introduction. In the second
section, we discuss the algebra of right-acting matrices
M = M11 e11 + M12 e12 + M21 e21 + M22 e22 , (10) and their actions on column vectors. In the third section,
we shall introduce the complex ray vector and its right-
where acting propagation and refraction matrices. We shall use
the properties of Fermion creation-annihilation matrices
1 0 0 1
e11 = , e12 = , (11) to compute the system matrices of thin and thick lenses.
0 0 0 0
In the fourth section, we shall revisit the classification of
0 0 0 0 optical systems: telescopic, Fourier transforming, inverse
e21 = , e22 = (12)
1 0 0 1 Fourier transforming, and imaging. We shall derive their
Lagrange theorems and see if we can define their corre-
are the Fermion matrices in Sakurai[10] and Le
sponding Lagrange invariants. We shall also rederive the
Bellac[11]. The dyadics e12 and e21 may represent either
Moebius transform and the Newton’s equation for the
the creation operator ↠or the annihilation operator â,
imaging system. The fifth section is Conclusions.
depending on the column matrix representations of e1
and e2 .
For the ray vector r = xe1 + nαe2 , the left-acting 2 Matrix Algebra
propagation and refraction matrices are given in Klein
and Furtak (transposed matrices)[12]: Let e1 and e2 be two orthonormal vectors represented as
column matrices,
T = 1 + e12 D/n, (13)
1 0
R = 1 − e21 P, (14) e1 = , e2 = , (18)
0 1
where 1 is the unit matrix. Thus, a lens system becomes and let e11 , e12 , e21 , and e22 be the four Fermion matrices
a product of propagation and refraction matrices: in Eqs. (11) and (12). The left and right action of the
dyadic operators on e1 and e2 are defined by the following
M = Tn Rn · · · T2 R2 T1 R1
n
relations[16]:
Y
= (1 + e12 Dk /nk )(1 + e21 Pk ), (15) eλ · eµν = δλµ eν , (19)
k=1
eµν · eλ = δνλ eµ , (20)
Later, we shall recast these equations for our com-
plex ray vectors and right-acting matrices. We shall also and
present new methods for computing the system matrix · eµ′ ν ′ · eµν = δν ′ µ (· eµ′ ν ), (21)
M. These methods are based on the Fermion identies
eµ′ ν ′ · eµν · = δν ′ µ eµ′ ν ·, (22)
satisfied by the four dyadic operators e11 , e12 , e21 , and
e22 . In particular, we shall study the allowed combina- where λ, µ, ν ∈ {1, 2}. Notice that we can rederive these
tions of e12 Dk /nk and e12 Pk′ and determine the matrix relations if we adopt the definitions
component basis ekk′ of the product chain.
c. Lagrange Invariants. The Lagrange theorem · eµν = · eµ eν , (23)
or the Smith-Helmholtz relationship[13, 14] is stated by eµν · = eµ eν ·, (24)
Welford[15] as
nuη = n′ u′ η ′ , (16) with the understanding that the dot product takes prece-
dence over the juxtaposition (geometric) product.
where n is refractive index of the input medium, the ob- Let ·M be a right-acting 2 × 2 matrix and let MT · be
ject height, u is the angle subtended from the object, and its left-acting transpose:
η is the height of the object; their primed counterparts
correspond to those of the image. The quantity nuη is ·M = M11 e11 + M12 e12 + M21 e21 + M22 e22 ,(25)
T
called the Lagrange invariant. Later, using our x and nα M · = M11 e11 + M12 e21 + M21 e12 + M22 e22 .(26)
2
The action of these two matrices on the vector α2
r = x1 e1 + x2 e2 (27)
are related by
x3
′ T α1
r = M · r = r · M, (28) x2
3
where where M is a 2 × 2 matrix. That is,
η · e3
cηz = (46) ′
|η · e3 | x x A −iC
= , (56)
n′ α′ i nαi −iB D
is the sign function describing the relative direction of
the outward normal vector η to the spherical interface of so that
radius R, with respect to the optical axis e3 . If cηz = +1,
η is approximately along e3 ; if cηz = −1, η is opposite. x′ = Ax + Bnα, (57)
′ ′
Using the definition of the ray cliffor r̂ in Eq. (32), nα = −Cx + Dnα. (58)
Eqs. (42) and (43) may be combined into Notice that these equations are similar to those in the
′
r̂ = r̂ · MP = r̂ · (1 + P), (47) literature, save for the sign of C.
In general, the system matrix M is a product of prop-
where agation and refraction operators (c.f. [23]):
P = −iP e12 . (48) N N N
Y Y Y
Sk Pk
That is, M= MSk MPk = e e = (1 + Sk )(1 + Pk ),
x′
x 1 −iP k=1 k=1 k=1
′ ′ = . (49) (59)
nαi nαi 0 1
where
Again, except for the −i, the refraction matrix is similar sk
to that given by Hecht[20]. Sk = e21 ,
−iSk e21 = −i (60)
nk
We may also rewrite the propagation and refraction nk − nk+1
matrices by using the definition of the exponential of the Pk = −iPk e12 = −icηk e12 . (61)
R
matrix A as
Note that the determinant of M is unity[24],
A2 A3
eA = 1 + A + + + ..., (50) N
2! 3!
Y
|M| = |eSk ||ePk | = 1, (62)
2 k=1
so that if A = 0, then
because its factors have a determinant of unity,
eA = 1 + A. (51)
|eSk | = |ePk | = 1. (63)
Thus, the exponential of a null-square matrix is the sum
Let us take some special cases. If Pk = 0 for all k,
of the matrix itself and the unit matrix.
then
Because e12 and e21 are null-square matrices, N
Y N
Y
M= MSk = eSk = eS = 1 + S, (64)
e212 = e221 = 0, (52) k=1 k=1
4
3.4 Fermion Combinatorics That is,
In general, we cannot simply add the arguments of the 1 −iP1
M= . (77)
exponentials in Eq. (59) because −iS1 1 − S1 P1
Sk Pℓ 6= Pℓ Sk . (68)
Case k = 2. The matrix M is
for all subscripts k and ℓ. So instead of the product-
of-exponentials form, we shall use the binomial product
M = (1 + S1 )(1 + P1 )(1 + S2 )(1 + P2 )
form and study how to expedite its expansion.
To expand the binomial product form in Eq. (59), we = 1 + (S1 + P1 + S2 + P2 )
note several things, which we shall label as rules: + (S1 P1 + S1 P2 + S2 P2 + P1 S2 )
Rule 1. The only allowed products are those with + (S1 P1 S2 + P1 S2 P2 ) + S1 P1 S2 P2 . (78)
alternating S and P factors, because
In Fermion basis, this is
Sk Sℓ = Pk Pℓ = 0. (69)
M = e11 (1 + (−i)2 P1 S2 )
Rule 2. Since matrix multiplication is not generally + e12 (−iP1 − iP2 + (−i)3 P1 S2 P2 )
commutative, then the order of the factors must be pre-
+ e21 (−iS1 − iS2 + (−i)3 S1 P1 S2 )
served. This means that the values of the subscript k
must be increasing from left to right; if the subscripts + e22 (1 + (−i)2 S1 P1 + (−i)2 S1 P2 + (−i)2 S2 P2
are the same, Sk should come before Pk . + (−i)4 S1 P1 S2 P2 ). (79)
Rule 3. The matrix Sk is an −ie21 quantity; Pk is a
−ie12 quantity. Because the allowed products are those That is,
with alternating S and P factors, then the Fermion basis
of the product depends only on the first and last factors: (1 − P1 S2 ) −i(P1 + P2 − P1 S2 P2 )
Sk · · · Sℓ , (−i)L e21 , (70) M=
S1 + S2 1 − S1 P1 − S1 P2 − S2 P2
−i
Sk · · · Pℓ , (−i)L e22 , (71) −S1 P1 S2 + S1 P1 S2 P2
Pk · · · Sℓ , (−i)L e11 , (72) (80)
L Check. To check our computations, we set S1 = 0 in
Pk · · · Pℓ , (−i) e12 , (73)
Eq. (80) to get
where L is the number of factors in the product chain.
(1 − P1 S2 ) −i(P1 + P2 − P1 S2 P2 )
Rule 4. The unit number 1 is a sum of e11 and e22 ,
Mthick = , (81)
1 = e11 + e22 . (74) −iS2 (1 − S2 P2 )
These four rules let us compute the system matrix in a Notice that except for the −i, Eq. (81) is similar in form
systematic way, by simply listing down the allowed com- to the system matrix for a thick lens as given in Klein and
binations of S and P. Furtak (they use left-acting matrices and angle-height
vectors)[25].
3.5 Thin and Thick Lenses Furthermore, if we set S2 = 0 in Eq. (81), then we
obtain the corresponding thin lens matrix[26]:
To illustrate our four combinatorial rules, let us compute
the expansions of M in Eq. (59) for k = 1 and k = 2. 1 −i(P1 + P2 )
Mthin = (82)
0 1
Case k = 1. The matrix M is
Comparison of Eq. (82) with the refraction matrix oper-
M = (1 + S1 )(1 + P1 ) ator MP in Eq. (49) yields the thin lens power
= 1 + (S1 + P1 ) + S1 P1 . (75)
P = P1 + P2 , (83)
In Fermion basis, this is
which is what we expect. That is, the total power of a
M = e11 (1) + e12 (−i)P1 thin lens is equal to the sum of the powers of its refracting
+ e21 (−i)S1 + e22 (1 + (−i)2 S1 P1 ). (76) surfaces.
5
4 Optical Systems Second, the elements of the system matrix are given by
So far, we have considered the input and output rays to A = M11 − M12 S ′ , (88)
be very close to the optical black box. We shall now relax B = M21 + M22 S ′ + M11 S − M12 SS ′ , (89)
this restriction.
C = M12 , (90)
Let the matrix Mbox describe a black box:
D = M22 − M12 S. (91)
M11 −iM22
Mbox = , (84) Now, the input and output rays are related to the sys-
−iM21 M22
tem matrix M by Eqs. (57) and (58). Our aim is to
which satisfies Eq. (62), use these equations to give a geometric interpretation to
the Mbox parameters M11 , M12 , M21 , and M22 . To do
|Mbox | = M11 M22 + M12 M21 = 1. (85) this, we shall choose from the set {x, x′ , α, α′ } a pair of
variables and set the other variables constant. We shall
Notice that unlike the determinant of standard system consider four cases:
matrices with real coefficients, the determinant of Mbox
in Eq. (85) is not a difference but a sum. x′ = x′ (x); α, α′ = constants (92)
x′ = x′ (α); x, α′ = constants (93)
α′ = α′ (x); α, x′ = constants (94)
′ ′ ′
α = α (α); x, x = constants. (95)
n Black Box n′
4.1 Telescopic System
s s′
6
Thus, in a telescopic system, the ratio of the outgoing and and (88) to (91), we arrive at
ingoing rays’s index-dilated angles is the angular magni-
fication M22 [32]; the ratio of the change in the output M22
S = ≡ f, (104)
and input heights is M11 . M12
If we multiply Eqs. (98) and (99), we get n′ α′
= −M12 , (105)
x
n′ α′ ∂x′ ∂x′ |Mbox | 1
= 1 = |Mbox |, (100) = = . (106)
nα ∂x ∂(nα) M12 M12
Thus, in an inverse Fourier transforming system, a point
because M12 = 0. Equation (100) is the Lagrange’s the- light source at the input focal distance f = M22 /M12
orem for a telescopic system. An alternative formulation to the left of the black box transforms to a bundle of
of this theorem is parallel rays at the output[33]; the ratio of the optical
inclination angle n′ α′ of the output beam to the height
n′ α′ δx′ = nα δx, (101) x of the point source is −M12 ; the change in the height
x′ of the output ray with respect to the change in the
where nα δx is the corresponding Lagrange invariant. dilated angle nα of the input ray is 1/M12 .
That is, if the angles of the input and output rays are If we multiply Eqs. (105) and (106), we get
positive constants, then the change in height x of the
input ray is proportional to the change in height of the n′ α′ ∂x′
= −1. (107)
output ray x′ . x ∂(nα)
7
transforming system. This means that we may differenti- Using these equations, together with Eqs. (57) to (58)
ate Eqs. (57) and (58) with respect to x to obtain[30, 31] and (88) to (91), we arrive at[34, 35]
8
Second, we may rewrite Eq. (119) as 5 Conclusions
(M12 S − M22 )(M12 S ′ − M11 ) = |Mbox | = 1. (122) We used the orthonormality axiom in geometric alge-
bra to show that the height-angle vector of a light ray
In terms of the focal lengths f in Eq. (104) and f ′ in must be complex. We proposed that the matrix oper-
Eq. (111), Eq. (122) becomes ators to this vector should be right-acting, in order to
follow the sequence of surfaces traversed by a light ray
1 as it moves from left-to-right close to the optical axis.
(S − f )(S ′ − f ′ ) = ZZ ′ = 2 . (123)
M12 We showed that the propagation and refraction matrix
operators may be expressed as a sum of a unit matrix
This is similar to the Newton’s lens equation[38] and a imaginary non-diagonal Fermion matrix. We de-
veloped combinatorial rules for finding the product of a
ZZ ′ = F F ′ . (124) succession of propagation and refraction matrices, with-
out doing explicit matrix multiplication. This product is
Note that the focal lengths f and f ′ are measured from the right-acting system matrix with real and imaginary
the left-most and right-most side of the optical black box, coefficients.
while F and F ′ are measured from the left and right We factored out the system matrix as a product of the
gaussian planes. input propagation matrix, the black box matrix, and the
output propagation matrix. Based on the coefficients of
4.5 Summary the system matrix, we classified the optical systems into
four: telescopic, inverse Fourier transforming, Fourier
The system matrix M is defined as the ABCD matrix transforming, and imaging. We showed that all four sys-
operating on input ray vector r̂ to yield the output ray tems have a corresponding Lagrange theorem expressed
vector r̂′ in partial derivatives, and that these theorems may be
′ combined into one. We transformed these theorems in
x x A −iC
= , (125) terms of Lagrange differentials, which allows us to ge-
in′ α′ inα −iB D ometrically interpret the effect of the change in input
variable to the change in the output variable. We showed
where
that these differential relations result to the Lagrange in-
∂x′ ∂(n′ α′ ) variants only for the telescopic and imaging systems.
A= , C=− , (126) In a future work, we shall revisit paraxial skew ray
∂x ∂x
∂x′ ∂(n′ α′ ) tracing using complex vectors and right-acting matrices.
B= , D= . (127)
∂(nα) ∂(nα)
Acknowledgments
Because the system matrix has a unit determinant, then
This research was supported by the Manila Observatory
∂x′ ∂(n′ α′ ) ∂(n′ α′ ) ∂x′ and by the Physics Department of Ateneo de Manila Uni-
1= − , (128)
∂x ∂(nα) ∂x ∂(nα) versity.
as given by Goodman.[37]
The four Lagrange theorems for the four optical sys- References
tem types may be combined into one:
[1] Allen Nussbaum and Richard A. Phillips, Contempo-
n′ α′ ∂x′ rary Optics for Scientists and Engineers (Prentice-Hall,
1 = ; α, α′ = const.
nα ∂x Englewood Cliffs, NJ, 1976), p. 10.
n′ α′ ∂x′
= − ; x, α′ = const. [2] William E. Baylis, J. Huschilt, and Jiansu Wei, “Why
x ∂(nα)
i?” Am. J. Phys. 60(9), 788–797 (1992). See p. 789.
x′ ∂(n′ α′ ) Baylis et al’s form is ej ek + ek ej = 2δjk .
= − ; α, x′ = const.
nα ∂x
x′ ∂(n′ α′ ) [3] Amnon Yariv, Quantum Electronics (John Wiley, New
= − ; x, x′ = const. (129) York, 1989), pp. 106–107.
x ∂(nα)
[4] Bernard Jancewicz, Multivectors and Clifford Algebra
We shall refer to Eq. (129) as the unified Lagrange the- in Electrodynamics (World Scientific, Singapore, 1988),
orem for optical systems. p. 32.
9
[5] David Hestenes, “Oersted medal lecture 2002: Reform- [19] Quirino M. Sugon Jr. and Daniel J. McNamara, “Parax-
ing the mathematical language of physics,” Am. J. Phys. ial meridional ray tracing equations from the uni-
71(2), 104–121 (2003). See p. 110. fied reflection-refraction law via geometric algebra,”
arXiv:0810.5224v1 [physics.optics] (29 Oct 2008). See
[6] Quirino M. Sugon Jr., Carlo B. Fernandez, and Daniel J.
p. 3.
McNamara, “A geometric algebra reformulation of 2 × 2
matrices: the dihedral group D4 in bra-ket notation,” [20] Eugene Hecht, Optics, 2nd ed., with contributions by
arXiv:0811.3680v1 [math-ph] (24 Nov 2008). See p. 9. Alfred Zajac (Addison Wesley, Reading, MA, 1987), p.
217.
[7] Keith R. Symon, Mechanics (Addison-Wesley, Reading,
MA, 1971), 3rd ed., p. 408. [21] See Ref. [19], p. 4.
[8] Charles E. Campbell, “Ray vector fields,” J. Opt. Soc. [22] R. Simon and Kurt Bernardo Wolf, “Structure of the
Am. A 11(2), 618–622 (1994). set of paraxial optical systems,” J. Opt. Soc. Am. 17(2),
342-354 (2000). See pp. 345-347.
[9] W. F. Harris, “Dioptric power: its nature and its rep-
resentation in three- and four-dimensional space,” Opt. [23] See Ref. [20], p. 219.
Vis. Sci., 74(6), 349–366 (1997). See p. 357.
[24] J. Warren Blaker and William M. Rosenblum, Optics:
[10] Jun John Sakurai, Advanced Quantum Mechanics An Introduction for Students of Engineering (MacMil-
(Addison-Wesley, Reading, MA, 1967), p. 28. lan, New York, 1993), p. 37.
[11] Michel Le Bellac, Quantum and Statistical Field Theory [25] See Ref. [12], p. 156.
(Clarendon, Oxford, 1991), p. 404.
[26] See Ref. [12], p. 157.
[12] Miles V. Klein and Thomas E. Furtak, Optics, (John
[27] Joseph W. Simmons and Mark J. Guttmann, States,
Wiley, New York, 1986), 2nd ed., p. 152.
Waves and Photons: A Modern Introduction to Light
[13] Jurgen R. Meyer-Arendt, Introduction to Classical and (Addison-Wesley, Reading, MA, 1970), p. 8.
Modern Optics, 2nd ed. (Prentice-Hall, Englewood
[28] Ajoy Ghatak, Optics (Tata, New Delhi, 1977), p. 52.
Cliffs, NJ, 1984), p. 55.
[29] See Ref. [1], p. 52.
[14] Max Born and Emil Wolf, Principles of Optics: Elec-
tromagnetic Theory of Propagation, Interference and [30] Douglas S. Goodman, “General Principles of Geometric
Diffraction of Light (Pergamon, Oxford, 1964), p. 165. Optics,” in Handbook of Optics, chapter 1 of vol. 1,
Fundamentals, Techniques, and Design (McGraw-Hill,
[15] W. T. Welford, Aberrations of the Optical System (Aca-
New York, 1995), p. 71.
demic, London, 1974), pp. 22–23.
[31] Moshe Nazarathy and Joseph Shamir, “First-order
[16] See Ref. [6], p. 8.
optics—a canonical operator representation: lossless
[17] Quirino M. Sugon Jr. and Daniel J. McNamara, “A ge- systems,” J. Opt. Soc. Am. 72(3), 356–364 (1982). See
ometric algebra reformulation of geometric optics,” Am. p. 362.
J. Phys. 72(1), 92-97. See p. 95. The concavity sign func-
[32] See Ref. [27], p. 11.
tion cση is defined as the normalized dot product of the
incident ray σ and the normal vector η. [33] See Ref. [27], p. 10.
[18] Quirino M. Sugon Jr. and Daniel J. McNamara, ”Ray [34] See Ref. [1], p. 16.
tracing in spherical interfaces using geometric algebra,”
in Advances in Imaging and Electron Physics 139, 179– [35] See Ref. [28], p. 56.
224. See pp. 206. The concavity function cση is general- [36] See Ref. [27], p. 12.
ized to the relative direction sign function cvw for two
vectors v and w. [37] See Ref. [30], p. 45.
[38] See Ref. [13], p. 48.
10