Pauli matrices explained

In mathematical physics and mathematics, the Pauli matrices are a set of three complex matrices that are traceless, Hermitian, involutory and unitary. Usually indicated by the Greek letter sigma, they are occasionally denoted by tau when used in connection with isospin symmetries.\begin \sigma_1 = \sigma_x &= \begin 0&1\\ 1&0 \end, \\ \sigma_2 = \sigma_y &= \begin 0& -i \\ i&0 \end, \\ \sigma_3 = \sigma_z &= \begin 1&0\\ 0&-1 \end. \\\end

These matrices are named after the physicist Wolfgang Pauli. In quantum mechanics, they occur in the Pauli equation, which takes into account the interaction of the spin of a particle with an external electromagnetic field. They also represent the interaction states of two polarization filters for horizontal/vertical polarization, 45 degree polarization (right/left), and circular polarization (right/left).

Each Pauli matrix is Hermitian, and together with the identity matrix (sometimes considered as the zeroth Pauli matrix), the Pauli matrices form a basis for the real vector space of Hermitian matrices. This means that any Hermitian matrix can be written in a unique way as a linear combination of Pauli matrices, with all coefficients being real numbers.

R3.

The Pauli matrices (after multiplication by to make them anti-Hermitian) also generate transformations in the sense of Lie algebras: the matrices form a basis for the real Lie algebra

ak{su}(2)

, which exponentiates to the special unitary group SU(2). The algebra generated by the three matrices is isomorphic to the Clifford algebra of

R3,

[1] and the (unital) associative algebra generated by functions identically (is isomorphic) to that of quaternions (

H

).

Algebraic properties

\sigmax

! style=width:3em

\sigmay

style=width:3em

\sigmaz

\sigmax

I

i\sigmaz

-i\sigmay

\sigmay

-i\sigmaz

I

i\sigmax

\sigmaz

i\sigmay

-i\sigmax

I

All three of the Pauli matrices can be compacted into a single expression:

\sigmaj=\begin{pmatrix} \deltaj3&\deltaj1-i\deltaj2\\ \deltaj1+i\deltaj2&-\deltaj3\end{pmatrix},

where the solution to is the "imaginary unit", and is the Kronecker delta, which equals if and 0 otherwise. This expression is useful for "selecting" any one of the matrices numerically by substituting values of in turn useful when any of the matrices (but no particular one) is to be used in algebraic manipulations.

The matrices are involutory:

2
\sigma
1

=

2
\sigma
2

=

2
\sigma
3

=-i\sigma1\sigma2\sigma3=\begin{pmatrix}1&0\ 0&1\end{pmatrix}=I,

where is the identity matrix.

The determinants and traces of the Pauli matrices are

\begin{align} \det\sigmaj&=-1,\\ \operatorname{tr}\sigmaj&=0, \end{align}

from which we can deduce that each matrix has eigenvalues +1 and −1.

l{H}2

of Hermitian matrices over

R

, and the Hilbert space

l{M}2,2(C)

of all complex matrices over

C

.

Commutation and anti-commutation relations

Commutation relations

The Pauli matrices obey the following commutation relations:

[\sigmaj,\sigmak]=2i\varepsilonj\sigmal,

where the structure constant is the Levi-Civita symbol, and Einstein summation notation is used.

These commutation relations make the Pauli matrices the generators of a representation of the Lie algebra

(R3, x )\congak{su}(2)\congak{so}(3).

Anticommutation relations

They also satisfy the anticommutation relations:

\{\sigmaj,\sigmak\}=2\deltajI,

where

\{\sigmaj,\sigmak\}

is defined as

\sigmaj\sigmak+\sigmak\sigmaj,

and is the Kronecker delta. denotes the identity matrix.

These anti-commutation relations make the Pauli matrices the generators of a representation of the Clifford algebra for

R3,

denoted

Cl3(R).

The usual construction of generators

\sigmajk=\tfrac{1}{4}[\sigmaj,\sigmak]

of

ak{so}(3)

using the Clifford algebra recovers the commutation relations above, up to unimportant numerical factors.

A few explicit commutators and anti-commutators are given below as examples:

Eigenvectors and eigenvalues

Each of the (Hermitian) Pauli matrices has two eigenvalues: and . The corresponding normalized eigenvectors are

\begin{align} \psix+&=

1
\sqrt{2}

\begin{bmatrix}1\1\end{bmatrix},& \psix-&=

1
\sqrt{2}

\begin{bmatrix}1\ -1\end{bmatrix},\\ \psiy+&=

1
\sqrt{2}

\begin{bmatrix}1\i\end{bmatrix},& \psiy-&=

1
\sqrt{2}

\begin{bmatrix}1\ -i\end{bmatrix},\\ \psiz+&=\begin{bmatrix}1\0\end{bmatrix},& \psiz-&=\begin{bmatrix}0\1\end{bmatrix}. \end{align}

Pauli vectors

The Pauli vector is defined by\vec = \sigma_1 \hat_1 + \sigma_2 \hat_2 + \sigma_3 \hat_3,where

\hat{x}1

,

\hat{x}2

, and

\hat{x}3

are an equivalent notation for the more familiar

\hat{x}

,

\hat{y}

, and

\hat{z}

.

The Pauli vector provides a mapping mechanism from a vector basis to a Pauli matrix basis[2] as follows:\begin \vec \cdot \vec &= (a_k\, \hat_k) \cdot (\sigma_\ell\, \hat_\ell) = a_k\, \sigma_\ell\, \hat_k \cdot \hat_\ell \\ &= a_k\, \sigma_\ell\, \delta_ = a_k\, \sigma_k \\ &= a_1 \begin 0 & 1 \\ 1 & 0 \end + a_2 \begin 0 & -i \\ i & 0 \end + a_3 \begin 1 & 0 \\ 0 & -1 \end = \begin a_3 & a_1 - i a_2 \\ a_1 + i a_2 & -a_3 \end,\end using Einstein's summation convention.

More formally, this defines a map from

R3

to the vector space of traceless Hermitian

2 x 2

matrices. This map encodes structures of

R3

as a normed vector space and as a Lie algebra (with the cross-product as its Lie bracket) via functions of matrices, making the map an isomorphism of Lie algebras. This makes the Pauli matrices intertwiners from the point of view of representation theory.

Another way to view the Pauli vector is as a

2 x 2

Hermitian traceless matrix-valued dual vector, that is, an element of

Mat2 x (C)(R3)*

that maps

\veca\mapsto\veca\vec\sigma.

Completeness relation

Each component of

\veca

can be recovered from the matrix (see completeness relation below) \frac \operatorname \Bigl(\bigl(\vec \cdot \vec \bigr) \vec \Bigr) = \vec.This constitutes an inverse to the map

\veca\mapsto\veca\vec\sigma

, making it manifest that the map is a bijection.

Determinant

The norm is given by the determinant (up to a minus sign)\det \bigl(\vec \cdot \vec \bigr) = -\vec \cdot \vec = -|\vec|^2.Then, considering the conjugation action of an

SU(2)

matrix

U

on this space of matrices,

U*\veca\vec\sigma:=U\veca\vec\sigmaU-1,

we find

\det(U*\veca\vec\sigma)=\det(\veca\vec\sigma),

and that

U*\veca\vec\sigma

is Hermitian and traceless. It then makes sense to define

U*\veca\vec\sigma=\veca'\vec\sigma,

where

\veca'

has the same norm as

\veca,

and therefore interpret

U

as a rotation of three-dimensional space. In fact, it turns out that the special restriction on

U

implies that the rotation is orientation preserving. This allows the definition of a map

R:SU(2)\toSO(3)

given by

U*\veca\vec\sigma=\veca'\vec\sigma=:(R(U)\veca)\vec\sigma,

where

R(U)\inSO(3).

This map is the concrete realization of the double cover of

SO(3)

by

SU(2),

and therefore shows that

SU(2)\congSpin(3).

The components of

R(U)

can be recovered using the tracing process above:

R(U)ij=

1
2

\operatorname{tr}\left(\sigmaiU\sigmajU-1\right).

Cross-product

The cross-product is given by the matrix commutator (up to a factor of

2i

) [\vec a \cdot \vec \sigma, \vec b \cdot \vec \sigma] = 2i\,(\vec a \times \vec b) \cdot \vec \sigma.In fact, the existence of a norm follows from the fact that

R3

is a Lie algebra (see Killing form).

This cross-product can be used to prove the orientation-preserving property of the map above.

Eigenvalues and eigenvectors

The eigenvalues of

\veca\vec\sigma

are

\pm|\vec{a}|.

This follows immediately from tracelessness and explicitly computing the determinant.

More abstractly, without computing the determinant, which requires explicit properties of the Pauli matrices, this follows from

(\veca\vec\sigma)2-|\veca|2=0 ,

since this can be factorised into

(\veca\vec\sigma-|\veca|)(\veca\vec\sigma+|\veca|)=0.

A standard result in linear algebra (a linear map that satisfies a polynomial equation written in distinct linear factors is diagonal) means this implies

\veca\vec\sigma

is diagonal with possible eigenvalues

\pm|\veca|.

The tracelessness of

\veca\vec\sigma

means it has exactly one of each eigenvalue.

Its normalized eigenvectors are \psi_+ = \frac \begin a_3 + \left|\vec\right| \\ a_1 + ia_2 \end; \qquad \psi_- = \frac \begin ia_2 - a_1 \\ a_3 + |\vec| \end ~ .These expressions become singular for

a3\to-\left|\vec{a}\right|

. They can be rescued by letting

\vec{a}=\left|\vec{a}\right|(\epsilon,0,-(1-\epsilon2/2))

and taking the limit

\epsilon\to0

, which yields the correct eigenvectors (0,1) and (1,0) of

\sigmaz

.

Alternatively, one may use spherical coordinates

\vec{a}=a(\sin\vartheta\cos\varphi,\sin\vartheta\sin\varphi,\cos\vartheta)

to obtain the eigenvectors

\psi+=(\cos(\vartheta/2),\sin(\vartheta/2)\exp(i\varphi))

and

\psi-=(-\sin(\vartheta/2)\exp(-i\varphi),\cos(\vartheta/2))

.

Pauli 4-vector

The Pauli 4-vector, used in spinor theory, is written

\sigma\mu

with components

\sigma\mu=(I,\vec\sigma).

This defines a map from

R1,3

to the vector space of Hermitian matrices,

x\mu\mapsto

\mu,
x
\mu\sigma
which also encodes the Minkowski metric (with mostly minus convention) in its determinant:

\det

\mu)
(x
\mu\sigma

=η(x,x).

This 4-vector also has a completeness relation. It is convenient to define a second Pauli 4-vector

\bar\sigma\mu=(I,-\vec\sigma).

and allow raising and lowering using the Minkowski metric tensor. The relation can then be writtenx_\nu = \tfrac \operatorname \Bigl(\bar\sigma_\nu\bigl(x_\mu \sigma^\mu \bigr) \Bigr) .

Similarly to the Pauli 3-vector case, we can find a matrix group that acts as isometries on

R1,3;

in this case the matrix group is

SL(2,C),

and this shows

SL(2,C)\congSpin(1,3).

Similarly to above, this can be explicitly realized for

S\inSL(2,C)

with components
\mu{}
Λ(S)
\nu

=\tfrac{1}{2}\operatorname{tr}\left(\bar\sigma\nuS\sigma\muS\dagger\right).

In fact, the determinant property follows abstractly from trace properties of the

\sigma\mu.

For

 2 x 2 

matrices, the following identity holds:

\det(A+B)=\det(A)+\det(B)+\operatorname{tr}(A)\operatorname{tr}(B)-\operatorname{tr}(AB).

That is, the 'cross-terms' can be written as traces. When

A,B

are chosen to be different

\sigma\mu,

the cross-terms vanish. It then follows, now showing summation explicitly,\det\left(\sum_\mu x_\mu \sigma^\mu\right) = \sum_\mu \det\left(x_\mu\sigma^\mu\right). Since the matrices are

 2 x 2 ,

this is equal to \sum_\mu x_\mu^2 \det(\sigma^\mu) = \eta(x,x).

Relation to dot and cross product

Pauli vectors elegantly map these commutation and anticommutation relations to corresponding vector products. Adding the commutator to the anticommutator gives

\begin{align}\left[\sigmaj,\sigmak\right]+\{\sigmaj,\sigmak\}&=(\sigmaj\sigmak-\sigmak\sigmaj)+(\sigmaj\sigmak+\sigmak\sigmaj)\\ 2i\varepsilonj\sigma\ell+2\deltajI&=2\sigmaj\sigmak\end{align}

so that,

Contracting each side of the equation with components of two -vectors and (which commute with the Pauli matrices, i.e., for each matrix and vector component (and likewise with) yields

~~\begin{align}ajbk\sigmaj\sigmak&=ajbk\left(i\varepsilonjk\ell\sigma\ell+\deltajkI\right)\\ aj\sigmajbk\sigmak&=i\varepsilonjk\ellajbk\sigma\ell+ajbk\deltajkI\end{align}.~

Finally, translating the index notation for the dot product and cross product results in

If is identified with the pseudoscalar then the right hand side becomes

ab+a\wedgeb

, which is also the definition for the product of two vectors in geometric algebra.

If we define the spin operator as, then satisfies the commutation relation:\mathbf \times \mathbf = i\hbar \mathbfOr equivalently, the Pauli vector satisfies:\frac \times \frac = i\frac

Some trace relations

The following traces can be derived using the commutation and anticommutation relations.

\begin{align} \operatorname{tr}\left(\sigmaj\right)&=0\\ \operatorname{tr}\left(\sigmaj\sigmak\right)&=2\deltajk\\ \operatorname{tr}\left(\sigmaj\sigmak\sigma\ell\right)&=2i\varepsilonjk\ell\\ \operatorname{tr}\left(\sigmaj\sigmak\sigma\ell\sigmam\right)&=2\left(\deltajk\delta\ell-\deltaj\ell\deltakm+\deltajm\deltak\ell\right) \end{align}

If the matrix is also considered, these relationships become

\begin \operatorname\left(\sigma_\alpha \right) &= 2\delta_ \\ \operatorname\left(\sigma_\alpha \sigma_\beta \right) &= 2\delta_ \\ \operatorname\left(\sigma_\alpha \sigma_\beta \sigma_\gamma \right) &= 2 \sum_ \delta_ \delta_ - 4 \delta_ \delta_ \delta_ + 2i\varepsilon_ \\ \operatorname\left(\sigma_\alpha \sigma_\beta \sigma_\gamma \sigma_\mu \right) &= 2\left(\delta_\delta_ - \delta_\delta_ + \delta_\delta_\right) + 4\left(\delta_ \delta_ \delta_ + \delta_ \delta_ \delta_\right) - 8 \delta_ \delta_ \delta_ \delta_ + 2 i \sum_ \varepsilon_ \delta_\end

where Greek indices and assume values from and the notation \sum_ is used to denote the sum over the cyclic permutation of the included indices.

Exponential of a Pauli vector

For

\vec{a}=a\hat{n},|\hat{n}|=1,

one has, for even powers,

(\hat{n}\vec{\sigma})2p=I,

which can be shown first for the case using the anticommutation relations. For convenience, the case is taken to be by convention.

For odd powers,

\left(\hat{n}\vec{\sigma}\right)2q+1=\hat{n}\vec{\sigma}.

Matrix exponentiating, and using the Taylor series for sine and cosine,

\begin{align} ei\vec{\sigma}\right)} &=

infty{ik\left[a\left(\hat{n
\sum
k=0

\vec{\sigma}\right)\right]k}{k!}}\\ &=

infty{(-1)p(a\hat{n
\sum
p=0

\vec{\sigma})2p

}} + i\sum_^\infty \\ &= I\sum_^\infty + i (\hat\cdot \vec) \sum_^\infty\\\end.

In the last line, the first sum is the cosine, while the second sum is the sine; so, finally,

which is analogous to Euler's formula, extended to quaternions.

Note that

\det[ia(\hat{n}\vec{\sigma})]=a2

,

while the determinant of the exponential itself is just, which makes it the generic group element of SU(2).

A more abstract version of formula for a general matrix can be found in the article on matrix exponentials. A general version of for an analytic (at a and −a) function is provided by application of Sylvester's formula,[3]

f(a(\hat{n}\vec{\sigma}))=I

f(a)+f(-a)
2

+\hat{n}\vec{\sigma}

f(a)-f(-a)
2

.

The group composition law of

A straightforward application of formula provides a parameterization of the composition law of the group . One may directly solve for in \begin e^ e^ &= I\left(\cos a \cos b - \hat \cdot \hat \sin a \sin b\right) + i\left(\hat \sin a \cos b + \hat \sin b \cos a - \hat \times \hat ~ \sin a \sin b \right) \cdot \vec \\ &= I\cos + i \left(\hat \cdot \vec\right) \sin c \\ &= e^,\end

which specifies the generic group multiplication, where, manifestly, \cos c = \cos a \cos b - \hat \cdot \hat \sin a \sin b~,the spherical law of cosines. Given, then, \hat = \frac\left(\hat \sin a \cos b + \hat \sin b \cos a - \hat\times\hat \sin a \sin b\right).

Consequently, the composite rotation parameters in this group element (a closed form of the respective BCH expansion in this case) simply amount to[4]

e^ = \exp \left(i\frac \left(\hat \sin a \cos b + \hat \sin b \cos a - \hat\times\hat ~ \sin a \sin b\right) \cdot \vec\right).

(Of course, when

\hat{n}

is parallel to

\hat{m}

, so is

\hat{k}

, and .)

Adjoint action

It is also straightforward to likewise work out the adjoint action on the Pauli vector, namely rotation of any angle

a

along any axis

\hatn

: R_n(-a) ~ \vec ~ R_n(a) = e^ ~ \vec ~ e^ = \vec\cos (a) + \hat \times \vec ~ \sin(a) + \hat ~ \hat \cdot \vec ~ (1 - \cos(a)) ~ .

Taking the dot product of any unit vector with the above formula generates the expression of any single qubit operator under any rotation. For example, it can be shown that R_y\mathord\left(-\frac\right)\, \sigma_x\, R_y\mathord\left(\frac\right) = \hat \cdot \left(\hat \times \vec\right) = \sigma_z.

See also: Rodrigues' rotation formula.

Completeness relation

An alternative notation that is commonly used for the Pauli matrices is to write the vector index in the superscript, and the matrix indices as subscripts, so that the element in row and column of the -th Pauli matrix is

In this notation, the completeness relation for the Pauli matrices can be written

\vec{\sigma}\alpha\beta\vec{\sigma}\gamma\delta\equiv

3
\sum
k=1
k
\sigma
\alpha\beta
k
\sigma
\gamma\delta

=2\delta\alpha\delta\delta\beta\gamma-\delta\alpha\beta\delta\gamma\delta.

As noted above, it is common to denote the 2 × 2 unit matrix by so The completeness relation can alternatively be expressed as\sum_^3 \sigma^k_\,\sigma^k_ = 2\,\delta_\,\delta_ ~ .

The fact that any Hermitian complex 2 × 2 matrices can be expressed in terms of the identity matrix and the Pauli matrices also leads to the Bloch sphere representation of 2 × 2 mixed states’ density matrix, (positive semidefinite 2 × 2 matrices with unit trace. This can be seen by first expressing an arbitrary Hermitian matrix as a real linear combination of

Notes and References

  1. Gull, S. F. . Lasenby, A. N. . Doran, C. J. L. . January 1993 . Imaginary numbers are not Real – the geometric algebra of spacetime . geometry.mrao.cam.ac.uk . Found. Phys. . 23 . 9 . 1175–1201 . 10.1007/BF01883676 . 1993FoPh...23.1175G . 14670523 . 2023-05-05 . dmy-all.
  2. See the spinor map.
  3. Book: Quantum Computation and Quantum Information . Nielsen . Michael A. . Michael Nielsen . Chuang . Isaac L. . Isaac Chuang . 2000 . Cambridge University Press . Cambridge, UK . 978-0-521-63235-5 . 43641333.
  4. Book: Gibbs, J.W. . 1884 . Elements of Vector Analysis . New Haven, CT . 67. In fact, however, the formula goes back to Olinde Rodrigues (1840), replete with half-angle: Olinde . Rodrigues . Olinde Rodrigues . 1840 . Des lois géometriques qui regissent les déplacements d' un systéme solide dans l' espace, et de la variation des coordonnées provenant de ces déplacement considérées indépendant des causes qui peuvent les produire . . 5 . 380–440 .