Representation theory of SU(2) explained

In the study of the representation theory of Lie groups, the study of representations of SU(2) is fundamental to the study of representations of semisimple Lie groups. It is the first case of a Lie group that is both a compact group and a non-abelian group. The first condition implies the representation theory is discrete: representations are direct sums of a collection of basic irreducible representations (governed by the Peter–Weyl theorem). The second means that there will be irreducible representations in dimensions greater than 1.

SU(2) is the universal covering group of SO(3), and so its representation theory includes that of the latter, by dint of a surjective homomorphism to it. This underlies the significance of SU(2) for the description of non-relativistic spin in theoretical physics; see below for other physical and historical context.

As shown below, the finite-dimensional irreducible representations of SU(2) are indexed by a non-negative integer

and have dimension

m+1

. In the physics literature, the representations are labeled by the quantity

l=m/2

, where

is then either an integer or a half-integer, and the dimension is

2l+1

Lie algebra representations

The representations of the group are found by considering representations of

ak{su}(2)

, the Lie algebra of SU(2). Since the group SU(2) is simply connected, every representation of its Lie algebra can be integrated to a group representation;^[1] we will give an explicit construction of the representations at the group level below.^[2]

Real and complexified Lie algebras

The real Lie algebra

ak{su}(2)

has a basis given by

u₁=\begin{bmatrix} 0&i\\ i&0 \end{bmatrix}, u₂=\begin{bmatrix} 0&-1\\ 1&~~0 \end{bmatrix}, u₃=\begin{bmatrix} i&~~0\\ 0&-i \end{bmatrix}~,

(These basis matrices are related to the Pauli matrices by

u₁=+i \sigma₁ ,u₂=-i \sigma₂ ,

and

u₃=+i \sigma₃~.

)

The matrices are a representation of the quaternions:

u_1u₁=-I,~~ u_2u₂=-I,~~ u_3u₃=-I,

u_1u₂=+u₃, u_2u₃=+u₁, u_3u₁=+u₂,

u_2u₁=-u₃, u_3u₂=-u₁, u_1u₃=-u₂~.

where is the conventional 2×2 identity matrix:

~~I=\begin{bmatrix} 1&0\\ 0&1 \end{bmatrix}~.

Consequently, the commutator brackets of the matrices satisfy

[u_1,u_2]=2u₃, [u_2,u_3]=2u₁, [u_3,u_1]=2u₂~.

It is then convenient to pass to the complexified Lie algebra

su(2)+isu(2)=sl(2;C)~.

(Skew self-adjoint matrices with trace zero plus self-adjoint matrices with trace zero gives all matrices with trace zero.) As long as we are working with representations over

this passage from real to complexified Lie algebra is harmless.^[3] The reason for passing to the complexification is that it allows us to construct a nice basis of a type that does not exist in the real Lie algebra

ak{su}(2)

The complexified Lie algebra is spanned by three elements

, and

, given by

	1
	i

u_3, X=

	1
	2i

\left(u₁-iu_2\right), Y=

	1
	2i

(u₁+iu₂₎~;

or, explicitly,

H=\begin{bmatrix} 1&~~0\\ 0&-1 \end{bmatrix}, X=\begin{bmatrix} 0&1\\ 0&0 \end{bmatrix}, Y=\begin{bmatrix} 0&0\\ 1&0 \end{bmatrix} ~.

The non-trivial/non-identical part of the group's multiplication table is

HX~=~~~~X, HY~=-Y, XY~=~\tfrac{1}{2}\left(I+H\right),

XH~=-X, YH~=~~~~Y, YX~=~\tfrac{1}{2}\left(I-H\right),

HH~=~~~I~, XX~=~~~~O, YY~=~~O,

where is the 2×2 all-zero matrix.Hence their commutation relations are

[H,X]=2X, [H,Y]=-2Y, [X,Y]=H.

Up to a factor of 2, the elements

and

may be identified with the angular momentum operators

J_z

J₊

, and

J_-

, respectively. The factor of 2 is a discrepancy between conventions in math and physics; we will attempt to mention both conventions in the results that follow.

Weights and the structure of the representation

In this setting, the eigenvalues for

are referred to as the weights of the representation. The following elementary result^[4] is a key step in the analysis. Suppose that

is an eigenvector for

with eigenvalue

\alpha

; that is, that

Hv=\alphav.

Then

\begin{alignat}{5} H(Xv)&=(XH+[H,X])v&&=(\alpha+2)Xv,\\[3pt] H(Yv)&=(YH+[H,Y])v&&=(\alpha-2)Yv. \end{alignat}

In other words,

is either the zero vector or an eigenvector for

with eigenvalue

\alpha+2

and

is either zero or an eigenvector for

with eigenvalue

\alpha-2.

Thus, the operator

acts as a raising operator, increasing the weight by 2, while

acts as a lowering operator.

Suppose now that

is an irreducible, finite-dimensional representation of the complexified Lie algebra. Then

can have only finitely many eigenvalues. In particular, there must be some final eigenvalue

λ\inC

with the property that

λ+2

is not an eigenvalue. Let

v₀

be an eigenvector for

with that eigenvalue

λ:

Hv₀=λv_0,

then we must have

Xv₀=0,

or else the above identity would tell us that

Xv₀

is an eigenvector with eigenvalue

λ+2.

Now define a "chain" of vectors

v_0,v_1,\ldots

v_k=Y^kv₀

A simple argument by induction^[5] then shows that

Xv_k=k(λ-(k-1))v_k-1

for all

k=1,2,\ldots.

Now, if

v_k

is not the zero vector, it is an eigenvector for

with eigenvalue

λ-2k.

Since, again,

has only finitely many eigenvectors, we conclude that

v_\ell

must be zero for some

\ell

(and then

v_k=0

for all

k>\ell

Let

v_m

be the last nonzero vector in the chain; that is,

v_m ≠ 0

but

v_m+1=0.

Then of course

Xv_m+1=0

and by the above identity with

k=m+1,

we have

0=Xv_m+1=(m+1)(λ-m)v_m.

Since

m+1

is at least one and

v_m ≠ 0,

we conclude that

must be equal to the non-negative integer

We thus obtain a chain of

m+1

vectors,

v_0,v_1,\ldots,v_m,

such that

acts as

Yv_m=0, Yv_k=v_k+1 (k<m)

and

acts as

Xv₀=0, Xv_k=k(m-(k-1))v_k-1 (k\ge1)

and

acts as

Hv_k=(m-2k)v_k.

(We have replaced

with its currently known value of

in the formulas above.)

Since the vectors

v_k

are eigenvectors for

with distinct eigenvalues, they must be linearly independent. Furthermore, the span of

v_0,\ldots,v_m

is clearly invariant under the action of the complexified Lie algebra. Since

is assumed irreducible, this span must be all of

We thus obtain a complete description of what an irreducible representation must look like; that is, a basis for the space and a complete description of how the generators of the Lie algebra act. Conversely, for any

m\geq0

we can construct a representation by simply using the above formulas and checking that the commutation relations hold. This representation can then be shown to be irreducible.^[6]

Conclusion: For each non-negative integer

there is a unique irreducible representation with highest weight

Each irreducible representation is equivalent to one of these. The representation with highest weight

has dimension

m+1

with weights

m,m-2,\ldots,-(m-2),-m,

each having multiplicity one.

The Casimir element

We now introduce the (quadratic) Casimir element,

given by

	2
-\left(u
	1

	2
u
	2

	2\right)
u
	3

We can view

as an element of the universal enveloping algebra or as an operator in each irreducible representation. Viewing

as an operator on the representation with highest weight

, we may easily compute that

commutes with each

u_i.

Thus, by Schur's lemma,

acts as a scalar multiple

c_m

of the identity for each

We can write

in terms of the

\{H,X,Y\}

basis as follows:

C=(X+Y)²-(-X+Y)²+H²,

which can be reduced to

C=4YX+H²+2H.

The eigenvalue of

in the representation with highest weight

can be computed by applying

to the highest weight vector, which is annihilated by

thus, we get

c_m=m²+2m=m(m+2).

In the physics literature, the Casimir is normalized as $C' = \fracC .$ Labeling things in terms of $\ell = \fracm,$ the eigenvalue

d_\ell

is then computed as

d_\ell=

	1
	4

(2\ell)(2\ell+2)=\ell(\ell+1).

The group representations

Action on polynomials

Since SU(2) is simply connected, a general result shows that every representation of its (complexified) Lie algebra gives rise to a representation of SU(2) itself. It is desirable, however, to give an explicit realization of the representations at the group level. The group representations can be realized on spaces of polynomials in two complex variables.^[7] That is, for each non-negative integer

, we let

V_m

denote the space of homogeneous polynomials

of degree

in two complex variables. Then the dimension of

V_m

m+1

. There is a natural action of SU(2) on each

V_m

, given by

[U ⋅ p](z)=p\left(U^-1z\right), z\inC^2,U\inSU(2)

The associated Lie algebra representation is simply the one described in the previous section. (See here for an explicit formula for the action of the Lie algebra on the space of polynomials.)

The characters

The character of a representation

\Pi:G → \operatorname{GL}(V)

is the function

\Chi:G → C

given by

\Chi(g)=\operatorname{trace}(\Pi(g))

.Characters plays an important role in the representation theory of compact groups. The character is easily seen to be a class function, that is, invariant under conjugation.

consisting of the diagonal matrices in SU(2), since the elements are orthogonally diagonalizable with the spectral theorem.^[8] Since the irreducible representation with highest weight

has weights

m,m-2,\ldots,-(m-2),-m

, it is easy to see that the associated character satisfies

\Chi\left(\begin{pmatrix} e^i\theta&0\\ 0&e^-i\theta\end{pmatrix}\right)=e^im\theta+e^i(m-2)\theta+ … +e^{-i(m-2)\theta}+e^-im\theta.

This expression is a finite geometric series that can be simplified to

\Chi\left(\begin{pmatrix} e^i\theta&0\\ 0&e^-i\theta\end{pmatrix}\right)=

	\sin((m+1)\theta)
	\sin(\theta)

This last expression is just the statement of the Weyl character formula for the SU(2) case.^[9]

Actually, following Weyl's original analysis of the representation theory of compact groups, one can classify the representations entirely from the group perspective, without using Lie algebra representations at all. In this approach, the Weyl character formula plays an essential part in the classification, along with the Peter–Weyl theorem. The SU(2) case of this story is described here.

Relation to the representations of SO(3)

See also: Projective representation. Note that either all of the weights of the representation are even (if

is even) or all of the weights are odd (if

is odd). In physical terms, this distinction is important: The representations with even weights correspond to ordinary representations of the rotation group SO(3).^[10] By contrast, the representations with odd weights correspond to double-valued (spinorial) representation of SO(3), also known as projective representations.

In the physics conventions,

being even corresponds to

being an integer while

being odd corresponds to

being a half-integer. These two cases are described as integer spin and half-integer spin, respectively. The representations with odd, positive values of

are faithful representations of SU(2), while the representations of SU(2) with non-negative, even

are not faithful.^[11]

Another approach

See under the example for Borel–Weil–Bott theorem.

Most important irreducible representations and their applications

Representations of SU(2) describe non-relativistic spin, due to being a double covering of the rotation group of Euclidean 3-space. Relativistic spin is described by the representation theory of SL₂(C), a supergroup of SU(2), which in a similar way covers SO⁺(1;3), the relativistic version of the rotation group. SU(2) symmetry also supports concepts of isobaric spin and weak isospin, collectively known as isospin.

The representation with

m=1

(i.e.,

l=1/2

in the physics convention) is the 2 representation, the fundamental representation of SU(2). When an element of SU(2) is written as a complex matrix, it is simply a multiplication of column 2-vectors. It is known in physics as the spin-1/2 and, historically, as the multiplication of quaternions (more precisely, multiplication by a unit quaternion). This representation can also be viewed as a double-valued projective representation of the rotation group SO(3).

The representation with

m=2

(i.e.,

l=1

) is the 3 representation, the adjoint representation. It describes 3-d rotations, the standard representation of SO(3), so real numbers are sufficient for it. Physicists use it for the description of massive spin-1 particles, such as vector mesons, but its importance for spin theory is much higher because it anchors spin states to the geometry of the physical 3-space. This representation emerged simultaneously with the 2 when William Rowan Hamilton introduced versors, his term for elements of SU(2). Note that Hamilton did not use standard group theory terminology since his work preceded Lie group developments.

The

m=3

(i.e.

l=3/2

) representation is used in particle physics for certain baryons, such as the Δ.

References

- Gerard 't Hooft (2007), Lie groups in Physics, Chapter 5 "Ladder operators"

Notes and References

Theorem 5.6
, Section 4.6
, Section 3.6
Lemma 4.33
, Equation (4.15)
, proof of Proposition 4.11
Section 4.2
Travis Willse (https://math.stackexchange.com/users/155629/travis-willse), Conjugacy classes in $SU_2$, URL (version: 2021-01-10): https://math.stackexchange.com/q/967927
Example 12.23
Section 4.7
Book: Ma, Zhong-Qi. Group Theory for Physicists. 2007-11-28. World Scientific Publishing Company. 9789813101487. 120. en.