Hamiltonian optics[1] and Lagrangian optics[2] are two formulations of geometrical optics which share much of the mathematical formalism with Hamiltonian mechanics and Lagrangian mechanics.
See main article: Hamilton's principle.
In physics, Hamilton's principle states that the evolution of a system
\left(q1{\left(\sigma\right)},...,qN{\left(\sigma\right)}\right)
N
q |
k=dqk/d\sigma
L
\deltaS=0
k=1,...,N
The momentum is defined asand the Euler–Lagrange equations can then be rewritten aswhere
p |
k=dpk/d\sigma
A different approach to solving this problem consists in defining a Hamiltonian (taking a Legendre transform of the Lagrangian) asfor which a new set of differential equations can be derived by looking at how the total differential of the Lagrangian depends on parameter σ, positions
qi
q |
i
k=1,...,N
The general results presented above for Hamilton's principle can be applied to optics.[3] [4] In 3D euclidean space the generalized coordinates are now the coordinates of euclidean space.
See main article: Fermat's principle.
Fermat's principle states that the optical length of the path followed by light between two fixed points, A and B, is a stationary point. It may be a maximum, a minimum, constant or an inflection point. In general, as light travels, it moves in a medium of variable refractive index which is a scalar field of position in space, that is,
n=n\left(x1,x2,x3\right)
s=\left(x1\left(x3\right),x2\left(x3\right),x3\right)
A=\left(x1\left(x3A\right),x2\left(x3A\right),x3A\right)
B=\left(x1\left(x3B\right),x2\left(x3B\right),x3B\right)
x1
x2
qk
x3
\sigma
In the context of calculus of variations this can be written as[2] where is an infinitesimal displacement along the ray given by andis the optical Lagrangian and
x |
k=dxk/dx3
The optical path length (OPL) is defined aswhere n is the local refractive index as a function of position along the path between points A and B.
The general results presented above for Hamilton's principle can be applied to optics using the Lagrangian defined in Fermat's principle. The Euler-Lagrange equations with parameter σ =x3 and N=2 applied to Fermat's principle result inwith and where L is the optical Lagrangian and
x |
k=dxk/dx3
The optical momentum is defined asand from the definition of the optical Lagrangian this expression can be rewritten as
or in vector formwhere
\hat{e
The expression for the optical path length can also be written as a function of the optical momentum. Having in consideration that
x |
3=dx3/dx3=1
Similarly to what happens in Hamiltonian mechanics, also in optics the Hamiltonian is defined by the expression given above for corresponding to functions
x1{\left(x3\right)}
x2{\left(x3\right)}
Comparing this expression with
L=x |
1
p | |||
|
2p2+p3
And the corresponding Hamilton's equations with parameter σ =x3 and k=1,2 applied to optics are[5] [6] with
x |
k=dxk/dx3
p |
k=dpk/dx3
It is assumed that light travels along the x3 axis, in Hamilton's principle above, coordinates
x1
x2
qk
x3
\sigma
See main article: Snell's law and Specular reflection. If plane x1x2 separates two media of refractive index nA below and nB above it, the refractive index is given by a step functionand from Hamilton's equationsand therefore
p |
k=0
pk=Constant
An incoming light ray has momentum pA before refraction (below plane x1x2) and momentum pB after refraction (above plane x1x2). The light ray makes an angle θA with axis x3 (the normal to the refractive surface) before refraction and an angle θB with axis x3 after refraction. Since the p1 and p2 components of the momentum are constant, only p3 changes from p3A to p3B.
Figure "refraction" shows the geometry of this refraction from which
d=\|pA\|\sin\thetaA=\|pB\|\sin\thetaB
\|pA\|=nA
\|pB\|=nB
In figure "refraction", the normal to the refractive surface points in the direction of axis x3, and also of vector
v=pA-pB
n=v/\|v\|
pB
pA
n
A similar argument can be used for reflection in deriving the law of specular reflection, only now with nA=nB, resulting in θA=θB. Also, if i and r are unit vectors in the directions of the incident and refracted ray respectively, the corresponding normal to the surface is given by the same expression as for refraction, only with nA=nB
In vector form, if i is a unit vector pointing in the direction of the incident ray and n is the unit normal to the surface, the direction r of the refracted ray is given by:[3] with
If i⋅n<0 then −n should be used in the calculations. When
\Delta<0
From the definition of optical path length
\partialL/\partialxk=dpk/dx3
\partialH/\partialx3=-\partialL/\partialx3
H=-p3
Since p is a vector tangent to the light rays, surfaces S=Constant must be perpendicular to those light rays. These surfaces are called wavefronts. Figure "rays and wavefronts" illustrates this relationship. Also shown is optical momentum p, tangent to a light ray and perpendicular to the wavefront.
Vector field
p=\nablaS
This result may be applied to a closed path ABCDA as in figure "optical path length"
for curve segment AB the optical momentum p is perpendicular to a displacement ds along curve AB, or
p ⋅ ds=0
p ⋅ ds=nds
p ⋅ ds=-nds
p ⋅ ds=nds
See main article: Phase space.
Figure "2D phase space" shows at the top some light rays in a two-dimensional space. Here x2=0 and p2=0 so light travels on the plane x1x3 in directions of increasing x3 values. In this case
2=n | |
p | |
3 |
2
p=(p1,p3)
\|p\|=n
For example, ray rC crosses axis x1 at coordinate xB with an optical momentum pC, which has its tip on a circle of radius n centered at position xB. Coordinate xB and the horizontal coordinate p1C of momentum pC completely define ray rC as it crosses axis x1. This ray may then be defined by a point rC=(xB,p1C) in space x1p1 as shown at the bottom of the figure. Space x1p1 is called phase space and different light rays may be represented by different points in this space.
As such, ray rD shown at the top is represented by a point rD in phase space at the bottom. All rays crossing axis x1 at coordinate xB contained between rays rC and rD are represented by a vertical line connecting points rC and rD in phase space. Accordingly, all rays crossing axis x1 at coordinate xA contained between rays rA and rB are represented by a vertical line connecting points rA and rB in phase space. In general, all rays crossing axis x1 between xL and xR are represented by a volume R in phase space. The rays at the boundary ∂R of volume R are called edge rays. For example, at position xA of axis x1, rays rA and rB are the edge rays since all other rays are contained between these two. (A ray parallel to x1 would not be between the two rays, since the momentum is not in-between the two rays)
In three-dimensional geometry the optical momentum is given by
p=(p1,p2,p3)
2=n | |
p | |
3 |
2
See main article: Etendue.
Figure "volume variation" shows a volume V bound by an area A. Over time, if the boundary A moves, the volume of V may vary. In particular, an infinitesimal area dA with outward pointing unit normal n moves with a velocity v.
This leads to a volume variation
dV=dA(v ⋅ n)dt
The rightmost term is a volume integral over the volume V and the middle term is the surface integral over the boundary A of the volume V. Also, v is the velocity with which the points in V are moving.
In optics coordinate
x3
(x1,x2,p1,p2)
v=(x |
1,
x |
2,
p |
1,
p |
2)
x3
dx1
x1
dx2
x2
dp1
p1
dp2
p2
dV=dx1dx2dp1dp2
V
dV/dx3=0
dV=dx1dx2dp1dp2=Constant
The volume occupied by a set of rays in phase space is called etendue, which is conserved as light rays progress in the optical system along direction x3. This corresponds to Liouville's theorem, which also applies to Hamiltonian mechanics.
See main article: Nonimaging optics.
Figure "conservation of etendue" shows on the left a diagrammatic two-dimensional optical system in which x2=0 and p2=0 so light travels on the plane x1x3 in directions of increasing x3 values.
Light rays crossing the input aperture of the optic at point x1=xI are contained between edge rays rA and rB represented by a vertical line between points rA and rB at the phase space of the input aperture (right, bottom corner of the figure). All rays crossing the input aperture are represented in phase space by a region RI.
Also, light rays crossing the output aperture of the optic at point x1=xO are contained between edge rays rA and rB represented by a vertical line between points rA and rB at the phase space of the output aperture (right, top corner of the figure). All rays crossing the output aperture are represented in phase space by a region RO.
Conservation of etendue in the optical system means that the volume (or area in this two-dimensional case) in phase space occupied by RI at the input aperture must be the same as the volume in phase space occupied by RO at the output aperture.
In imaging optics, all light rays crossing the input aperture at x1=xI are redirected by it towards the output aperture at x1=xO where xI=m xO. This ensures that an image of the input is formed at the output with a magnification m. In phase space, this means that vertical lines in the phase space at the input are transformed into vertical lines at the output. That would be the case of vertical line rA rB in RI transformed to vertical line rA rB in RO.
In nonimaging optics, the goal is not to form an image but simply to transfer all light from the input aperture to the output aperture. This is accomplished by transforming the edge rays ∂RI of RI to edge rays ∂RO of RO. This is known as the edge ray principle.
Above it was assumed that light travels along the x3 axis, in Hamilton's principle above, coordinates
x1
x2
qk
x3
\sigma
A more general situation can be considered in which the path of a light ray is parametrized as
s=\left(x1{\left(\sigma\right)},x2{\left(\sigma\right)},x3{\left(\sigma\right)}\right)
x1
x2
x3
qk
L=nds/d\sigma
x |
k=dxk/d\sigma
x1{\left(\sigma\right)}
x2{\left(\sigma\right)}
x3{\left(\sigma\right)}
And the corresponding Hamilton's equations with k=1,2,3 applied optics arewith
x |
k=dxk/d\sigma
p |
k=dpk/d\sigma
The optical Lagrangian is given byand does not explicitly depend on parameter σ. For that reason not all solutions of the Euler-Lagrange equations will be possible light rays, since their derivation assumed an explicit dependence of L on σ which does not happen in optics.
The optical momentum components can be obtained fromwhere
x |
k=dxk/d\sigma
Comparing this expression for L with that for the Hamiltonian P it can be concluded that
From the expressions for the components
pk
The optical Hamiltonian is chosen as
although other choices could be made.[3] [4] The Hamilton's equations with k = 1, 2, 3 defined above together with
P=0
\left(q1\left(\sigma\right),q2\left(\sigma\right),q3\left(\sigma\right)\right)
\left(u1\left(\sigma\right),u2\left(\sigma\right),u3\left(\sigma\right)\right)
where the optical momentum is given byand
\hat{e
\hat{e
\hat{e
ukak/n
p
\hat{e