Matrix representation of Maxwell's equations explained

In electromagnetism, a branch of fundamental physics, the matrix representations of the Maxwell's equations are a formulation of Maxwell's equations using matrices, complex numbers, and vector calculus. These representations are for a homogeneous medium, an approximation in an inhomogeneous medium. A matrix representation for an inhomogeneous medium was presented using a pair of matrix equations.^[1] A single equation using 4 × 4 matrices is necessary and sufficient for any homogeneous medium. For an inhomogeneous medium it necessarily requires 8 × 8 matrices.^[2]

Introduction

Maxwell's equations in the standard vector calculus formalism, in an inhomogeneous medium with sources, are:^[3]

\begin{align} &{\nabla} ⋅ {D}\left({r},t\right)= \rho\\ &{\nabla} x {H}\left({r},t\right) -

	\partial
	\partialt

{D}\left({r},t\right)= {J}\\ &{\nabla} x {E}\left({r},t\right) +

	\partial
	\partialt

{B}\left({r},t\right)=0\\ &{\nabla} ⋅ {B}\left({r},t\right)=0. \end{align}

The media is assumed to be linear, that is

{D}=\varepsilonE, B=\muH

where scalar

\varepsilon=\varepsilon(r,t)

is the permittivity of the medium and scalar

\mu=\mu(r,t)

the permeability of the medium (see constitutive equation). For a homogeneous medium

\varepsilon

and

\mu

are constants.The speed of light in the medium is given by

v({r},t)=

	1
	\sqrt{\varepsilon({r

,t)\mu({r},t)}}

In vacuum,

\varepsilon₀=

8.85 × 10⁻¹² C²·N⁻¹·m⁻² and

\mu₀=4\pi

× 10⁻⁷ H·m⁻¹

One possible way to obtain the required matrix representation isto use the Riemann–Silberstein vector^[4] given by

\begin{align} {F}⁺\left({r},t\right) &=

	1
	\sqrt{2

}\left[\sqrt{\varepsilon ({\mathbf r}, t)\,}\,{\mathbf E} \left({\mathbf r}, t \right) + {\rm i}\,\frac{1}{\sqrt{\mu ({\mathbf r}, t)\,}} {\mathbf B} \left({\mathbf r}, t \right) \right] \\^ \left(t \right)& =\frac\left[\sqrt{\varepsilon ({\mathbf r}, t)\,}\,{\mathbf E} \left({\mathbf r}, t \right) - {\rm i}\,\frac{1}{\sqrt{\mu ({\mathbf r}, t)\,}} {\mathbf B} \left({\mathbf r}, t \right) \right]\,.\end

If for a certain medium

\varepsilon=\varepsilon(r,t)

and

\mu=\mu(r,t)

are scalar constants (or can be treated as local scalar constants under certain approximations), then the vectors

{F}^\pm(r,t)

satisfy

\begin{align} {\rmi}

	\partial
	\partialt

{F}^\pm\left({r},t\right) &= \pmv{\nabla} x {F}^\pm\left({r},t\right) -

	1
	\sqrt{2\epsilon

} (\,) \\ \cdot ^\left(t \right) & =\frac (\rho)\,.\end

Thus by using the Riemann–Silberstein vector, it is possible to reexpress the Maxwell's equations for a medium with constant

\varepsilon=\varepsilon(r,t)

and

\mu=\mu(r,t)

as a pair of constitutive equations.

Homogeneous medium

In order to obtain a single matrix equation instead of a pair, the following new functions are constructed using the components of the Riemann–Silberstein vector^[5]

\begin{align} \Psi⁺({r},t) &= \left[ \begin{array}{c} -

	+
F
	x

+{\rmi}

	+
F
	y

	+
\\ F
	z

	+
\\ F
	z

	+
\\ F
	x

+{\rmi}

	+
F
	y

\end{array} \right] \Psi^-({r},t)= \left[ \begin{array}{c} -

	-
F
	x

-{\rmi}

	-
F
	y

	-
\\ F
	z

	-
\\ F
	z

	-
\\ F
	x

-{\rmi}

	-
F
	y

\end{array} \right]. \end{align}

The vectors for the sources are

\begin{align} W⁺&= \left(

	1
	\sqrt{2\epsilon

}\right)\left[\begin{array}{c} - J_x + {\rm i} J_y \\ J_z - v \rho \\ J_z + v \rho \\ J_x + {\rm i} J_y \end{array} \right]\, \quadW^=\left(\frac\right)\left[\begin{array}{c} - J_x - {\rm i} J_y \\ J_z - v \rho \\ J_z + v \rho \\ J_x - {\rm i} J_y \end{array} \right]\,.\end

Then,

\begin{align}	\partial
	\partialt

\Psi⁺&= -v \left\{{M} ⋅ {\nabla}\right\}\Psi⁺-W⁺\\

	\partial
	\partialt

\Psi^-&= -v \left\{{M}^* ⋅ {\nabla}\right\}\Psi^--W^-\end{align}

where * denotes complex conjugation and the triplet, is a vector whose component elements are abstract 4×4 matricies given by

M_x= \begin{bmatrix} 0&0&1&0\\ 0&0&0&1\\ 1&0&0&0\\ 0&1&0&0 \end{bmatrix},

M_y= {\rmi}\begin{bmatrix} 0&0&-1&0\\ 0&0&0&-1\\ +1&0&0&0\\ 0&+1&0&0 \end{bmatrix},

M_z= \begin{bmatrix} +1&0&0&0\\ 0&+1&0&0\\ 0&0&-1&0\\ 0&0&0&-1 \end{bmatrix}.

The component M-matrices may be formed using:

\Omega= \begin{bmatrix} {0}&-{I_2}\\ {I_2}&{0} \end{bmatrix} \beta= \begin{bmatrix} {I_2}&{0}\\ {0}&-{I_{2}
\end{bmatrix},}

where

I₂= \begin{bmatrix} 1&0\\ 0&1 \end{bmatrix} ,

from which, get:

M_x=-\beta\Omega, M_y={\rmi}\Omega, M_z=\beta.

Alternately, one may use the matrix Which only differ by a sign. For our purpose it is fine to use either Ω or J. However, they have a different meaning: J is contravariant and Ω is covariant. The matrix Ω corresponds to the Lagrange brackets of classical mechanics and J corresponds to the Poisson brackets.

Note the important relation

\Omega=J^-1.

Each of the four Maxwell's equations are obtained from the matrix representation. This is done by taking the sums and differences of row-I with row-IV and row-II with row-III respectively. The first three give the y, x, and z components of the curl and the last one gives the divergence conditions.

The matrices M are all non-singular and all are Hermitian. Moreover, they satisfy the usual (quaternion-like) algebra of the Dirac matrices, including,

\begin{align} M_xM_z=-M_zM_x\\ M_yM_z=-M_zM_y

	2
\\ \\ M
	x

	2
M
	y

	2
M
	z

=I\\ \\ M_xM_y=-M_yM_x={\rmi}M_z\\ M_yM_z=-M_zM_y={\rmi}M_x\\ M_zM_x=-M_xM_z={\rmi}M_{y.
\end{align}}

The (Ψ^±, M) are not unique. Different choices of Ψ^± would give rise to different M, such that the triplet M continues to satisfy the algebra of the Dirac matrices. The Ψ^± via the Riemann–Silberstein vector has certain advantages over the other possible choices. The Riemann–Silberstein vector is well known in classical electrodynamics and has certain interesting properties and uses.^[6]

In deriving the above 4×4 matrix representation of the Maxwell's equations, the spatial and temporal derivatives of ε(r, t) and μ(r, t) in the first two of the Maxwell's equations have been ignored. The ε and μ have been treated as local constants.

Inhomogeneous medium

In an inhomogeneous medium, the spatial and temporal variations of ε = ε(r, t) and μ = μ(r, t) are not zero. That is they are no longer local constant. Instead of using ε = ε(r, t) and μ = μ(r, t), it is advantageous to use the two derived laboratory functions namely the resistance function and the velocity function

\begin{align} Velocityfunction:v({r},t) &=

	1
	\sqrt{\epsilon({r

,t)\mu({r},t)}}\\ Resistancefunction:h({r},t) &=\sqrt{

	\mu({r
	,

t)}{\epsilon({r},t)}}. \end{align}

In terms of these functions:

\varepsilon=

	1
	vh

, \mu=

	h
	v

. These functions occur in the matrix representation through their logarithmic derivatives;

\begin{align} {u}({r},t) &=

	1
	2v({r

,t)}{\nabla}v({r},t)=

	1
	2

{\nabla}\left\{lnv({r},t)\right\}= -

	1
	2

{\nabla}\left\{lnn({r},t)\right\}\\ {w}({r},t)&=

	1
	2h({r

,t)}{\nabla}h({r},t)=

	1
	2

{\nabla}\left\{lnh({r},t)\right\}\end{align}

where

n({r},t)=

	c
	v({r

,t)}

is the refractive index of the medium.

The following matrices naturally arise in the exact matrix representation of the Maxwell's equation in a medium

\begin{align} {\Sigma}= \left[ \begin{array}{cc} {\sigma}&{0}\\ {0}&{\sigma} \end{array} \right] {\alpha}= \left[ \begin{array}{cc} {0}&{\sigma}\\ {\sigma}&{0} \end{array} \right] {I}= \left[ \begin{array}{cc} {1}&{0}\\ {0}&{1} \end{array} \right]\end{align}

where Σ are the Dirac spin matrices and α are the matrices used in the Dirac equation, and σ is the triplet of the Pauli matrices

{\sigma}=(\sigma_x,\sigma_y,\sigma_z)= \left[ \begin{pmatrix} 0&1\\ 1&0 \end{pmatrix} , \begin{pmatrix} 0&-{\rmi}\\ {\rmi}&0 \end{pmatrix} , \begin{pmatrix} 1&0\\ 0&-1 \end{pmatrix} \right]

Finally, the matrix representation is

\begin{align} &	\partial
	\partialt

\left[ \begin{array}{cc} {I}&{0}\\ {0}&{I} \end{array} \right] \left[ \begin{array}{cc} \Psi⁺\\ \Psi^-\end{array} \right] -

•
v	({r

t)}{2v({r},t)} \left[ \begin{array}{cc} {I}&{0}\\ {0}&{I} \end{array} \right] \left[ \begin{array}{cc} \Psi⁺\\ \Psi^-\end{array} \right] +

•
h	({r

t)}{2h({r},t)} \left[ \begin{array}{cc} {0}&{\rmi}\beta\alpha_y\\ {\rmi}\beta\alpha_y&{0} \end{array} \right] \left[ \begin{array}{cc} \Psi⁺\\ \Psi^-\end{array} \right]\\ &=-v({r},t) \left[ \begin{array}{ccc} \left\{ {M} ⋅ {\nabla} + {\Sigma} ⋅ {u} \right\} && -{\rmi}\beta \left({\Sigma} ⋅ {w}\right) \alpha_y
\\
-{\rmi}\beta \left({\Sigma}^* ⋅

	*
{w}\right) \alpha
	y & \left\{ {M}

⋅ {\nabla} + {\Sigma}^* ⋅ {u} \right\} \end{array} \right] \left[ \begin{array}{cc} \Psi⁺\\ \Psi^-\end{array} \right]-\left[ \begin{array}{cc} {I}&{0}\\ {0}&{I} \end{array} \right] \left[ \begin{array}{c} W⁺\\ W^-\end{array} \right] \end{align}

The above representation contains thirteen 8 × 8 matrices. Ten of these are Hermitian. The exceptional ones are the ones that contain the three components of w(r, t), the logarithmic gradient of the resistance function. These three matrices, for the resistance function are antihermitian.

The Maxwell's equations have been expressed in a matrix form for a medium with varying permittivity ε = ε(r, t) and permeability μ = μ(r, t), in presence of sources. This representation uses a single matrix equation, instead of a pair of matrix equations. In this representation, using 8 × 8 matrices, it has been possible to separate the dependence of the coupling between the upper components (Ψ⁺) and the lower components (Ψ⁻) through the two laboratory functions. Moreover, the exact matrix representation has an algebraic structure very similar to the Dirac equation.^[2] Maxwell's equations can be derived from the Fermat's principle of geometrical optics by the process of "wavization" analogous to the quantization of classical mechanics.^[7]

Applications

One of the early uses of the matrix forms of the Maxwell's equations was to study certain symmetries, and the similarities with the Dirac equation.

The matrix form of the Maxwell's equations is used as a candidate for the Photon Wavefunction.^[8]

Historically, the geometrical optics is based on the Fermat's principle of least time. Geometrical optics can be completely derived from the Maxwell's equations. This is traditionally done using the Helmholtz equation. The derivation of the Helmholtz equation from the Maxwell's equations is an approximation as one neglects the spatial and temporal derivatives of the permittivity and permeability of the medium. A new formalism of light beam optics has been developed, starting with the Maxwell's equations in a matrix form: a single entity containing all the four Maxwell's equations.Such a prescription is sure to provide a deeper understanding of beam-optics and polarization in a unified manner.^[9] The beam-optical Hamiltonian derived from this matrix representation has an algebraic structure very similar to the Dirac equation, making it amenable to the Foldy-Wouthuysen technique.^[10] This approach is very similar to one developed for the quantum theory of charged-particle beam optics.^[11]

References

Others

Bialynicki-Birula, I. (1994). On the wave function of the photon. Acta Physica Polonica A, 86, 97–116.
Bialynicki-Birula, I. (1996a). The Photon Wave Function. In Coherence and Quantum Optics VII. Eberly, J. H., Mandel, L. and Emil Wolf (ed.), Plenum Press, New York, 313.
Bialynicki-Birula, I. (1996b). Photon wave function. in Progress in Optics, Vol. XXXVI, Emil Wolf. (ed.), Elsevier, Amsterdam, 245–294.
Jackson, J. D. (1998). Classical Electrodynamics, Third Edition, John Wiley & Sons.
Jagannathan, R., (1990). Quantum theory of electron lenses based on the Dirac equation. Physical Review A, 42, 6674–6689.
Jagannathan, R. and Khan, S. A. (1996). Quantum theory of the optics of charged particles. In Hawkes Peter, W. (ed.), Advances in Imaging and Electron Physics, Vol. 97, Academic Press, San Diego, pp. 257–358.
Jagannathan, R., Simon, R., Sudarshan, E. C. G. and Mukunda, N. (1989). Quantum theory of magnetic electron lenses based on the Dirac equation. Physics Letters A 134, 457–464.
Khan, S. A. (1997). Quantum Theory of Charged-Particle Beam Optics, Ph.D Thesis, University of Madras, Chennai, India. (complete thesis available from Dspace of IMSc Library, The Institute of Mathematical Sciences, where the doctoral research was done).
Sameen Ahmed Khan. (2002). Maxwell Optics: I. An exact matrix representation of the Maxwell equations in a medium. E-Print: https://arxiv.org/abs/physics/0205083/.
Sameen Ahmed Khan. (2005). An Exact Matrix Representation of Maxwell's Equations. Physica Scripta, 71(5), 440–442.
Sameen Ahmed Khan. (2006a). The Foldy-Wouthuysen Transformation Technique in Optics. Optik-International Journal for Light and Electron Optics. 117(10), pp. 481–488 http://www.elsevier-deutschland.de/ijleo/.
Sameen Ahmed Khan. (2006b). Wavelength-Dependent Effects in Light Optics. in New Topics in Quantum Physics Research, Editors: Volodymyr Krasnoholovets and Frank Columbus, Nova Science Publishers, New York, pp. 163–204. (and).
Sameen Ahmed Khan. (2008). The Foldy-Wouthuysen Transformation Technique in Optics, In Hawkes Peter, W. (ed.), Advances in Imaging and Electron Physics, Vol. 152, Elsevier, Amsterdam, pp. 49–78. (and).
Sameen Ahmed Khan. (2010). Maxwell Optics of Quasiparaxial Beams, Optik-International Journal for Light and Electron Optics, 121(5), 408–416. (http://www.elsevier-deutschland.de/ijleo/).
Laporte, O., and Uhlenbeck, G. E. (1931). Applications of spinor analysis to the Maxwell and Dirac Equations. Physical Review, 37, 1380–1397.
Majorana, E. (1974). (unpublished notes), quoted after Mignani, R., Recami, E., and Baldo, M. About a Diraclike Equation for the Photon, According to Ettore Majorana. Lettere al Nuovo Cimento, 11, 568–572.
Moses, E. (1959).Solutions of Maxwell's equations in terms of a spinor notation: the direct and inverse problems. Physical Review, 113(6), 1670–1679.
Panofsky, W. K. H., and Phillips, M. (1962). Classical Electricity and Magnetics, Addison-Wesley Publishing Company, Reading, Massachusetts, USA.
Pradhan, T. (1987). Maxwell's Equations From Geometrical Optics. IP/BBSR/87-15; Physics Letters A 122(8), 397–398.
Ludwig Silberstein. (1907a). Elektromagnetische Grundgleichungen in bivektorieller Behandlung, Ann. Phys. (Leipzig), 22, 579–586.
Ludwig Silberstein. (1907b). Nachtrag zur Abhandlung ber Elektromagnetische Grundgleichungen in bivektorieller Behandlung. Ann. Phys. (Leipzig), 24, 783–784.

Notes and References

(Bialynicki-Birula, 1994, 1996a, 1996b)
(Khan, 2002, 2005)
(Jackson, 1998; Panofsky and Phillips, 1962)
Silberstein (1907a, 1907b)
Khan (2002, 2005)
Bialynicki-Birula (1996b)
(Pradhan, 1987)
(Bialynicki-Birula, 1996b)
(Khan, 2006b, 2010)
(Khan, 2006a, 2008)
(Jagannathan et al., 1989, Jagannathan, 1990, Jagannathan and Khan 1996, Khan, 1997)