Adjoint representation explained

In mathematics, the adjoint representation (or adjoint action) of a Lie group G is a way of representing the elements of the group as linear transformations of the group's Lie algebra, considered as a vector space. For example, if G is

GL(n,R)

, the Lie group of real n-by-n invertible matrices, then the adjoint representation is the group homomorphism that sends an invertible n-by-n matrix

g

to an endomorphism of the vector space of all linear transformations of

Rn

defined by:

x\mapstogxg-1

.

For any Lie group, this natural representation is obtained by linearizing (i.e. taking the differential of) the action of G on itself by conjugation. The adjoint representation can be defined for linear algebraic groups over arbitrary fields.

Definition

See also: Representation theory. Let G be a Lie group, and let

\Psi:G\to\operatorname{Aut}(G)

be the mapping, with Aut(G) the automorphism group of G and given by the inner automorphism (conjugation)

\Psig(h)=ghg-1~.

This Ψ is a Lie group homomorphism.

For each g in G, define to be the derivative of at the origin:

\operatorname{Ad}g=(d\Psig)e:TeGTeG

where is the differential and

ak{g}=TeG

is the tangent space at the origin (being the identity element of the group). Since

\Psig

is a Lie group automorphism, Adg is a Lie algebra automorphism; i.e., an invertible linear transformation of

akg

to itself that preserves the Lie bracket. Moreover, since

g\mapsto\Psig

is a group homomorphism,

g\mapsto\operatorname{Ad}g

too is a group homomorphism.[1] Hence, the map

Ad\colonG\toAut(akg),g\mapstoAdg

is a group representation called the adjoint representation of G.

If G is an immersed Lie subgroup of the general linear group

GLn(C)

(called immersely linear Lie group), then the Lie algebra

ak{g}

consists of matrices and the exponential map is the matrix exponential

\operatorname{exp}(X)=eX

for matrices X with small operator norms. We will compute the derivative of

\Psig

at

e

. For g in G and small X in

ak{g}

, the curve

t\to\exp(tX)

has derivative

X

at t = 0, one then gets:

\operatorname{Ad}g(X)=(d\Psig)e(X)=(\Psig\circ\exp(tX))'(0)=(g\exp(tX)g-1)'(0)=gXg-1

where on the right we have the products of matrices. If

G\subsetGLn(C)

is a closed subgroup (that is, G is a matrix Lie group), then this formula is valid for all g in G and all X in

akg

.

Succinctly, an adjoint representation is an isotropy representation associated to the conjugation action of G around the identity element of G.

Derivative of Ad

One may always pass from a representation of a Lie group G to a representation of its Lie algebra by taking the derivative at the identity.

Taking the derivative of the adjoint map

Ad:G\toAut(akg)

at the identity element gives the adjoint representation of the Lie algebra

akg=\operatorname{Lie}(G)

of G:

\begin{align} ad:&akg\toDer(akg)\\ &x\mapsto\operatorname{ad}x=d(\operatorname{Ad})e(x) \end{align}

where

Der(akg)=\operatorname{Lie}(\operatorname{Aut}(ak{g}))

is the Lie algebra of

Aut(akg)

which may be identified with the derivation algebra of

akg

. One can show that

adx(y)=[x,y]

for all

x,y\inakg

, where the right hand side is given (induced) by the Lie bracket of vector fields. Indeed, recall that, viewing

ak{g}

as the Lie algebra of left-invariant vector fields on G, the bracket on

akg

is given as: for left-invariant vector fields X, Y,

[X,Y]=\limt{1\overt}(d\varphi-t(Y)-Y)

where

\varphit:G\toG

denotes the flow generated by X. As it turns out,

\varphit(g)=g\varphit(e)

, roughly because both sides satisfy the same ODE defining the flow. That is,

\varphit=

R
\varphit(e)
where

Rh

denotes the right multiplication by

h\inG

. On the other hand, since

\Psig=

R
g-1

\circLg

, by the chain rule,

\operatorname{Ad}g(Y)=d

(R
g-1

\circLg)(Y)=d

R
g-1

(dLg(Y))=d

R
g-1

(Y)

as Y is left-invariant. Hence,

[X,Y]=\limt{1\over

t}(\operatorname{Ad}
\varphit(e)

(Y)-Y)

,which is what was needed to show.

Thus,

adx

coincides with the same one defined in below. Ad and ad are related through the exponential map: Specifically, Adexp(x) = exp(adx) for all x in the Lie algebra.[2] It is a consequence of the general result relating Lie group and Lie algebra homomorphisms via the exponential map.[3]

If G is an immersely linear Lie group, then the above computation simplifies: indeed, as noted early,

\operatorname{Ad}g(Y)=gYg-1

and thus with

g=etX

,
\operatorname{Ad}
etX

(Y)=etXYe-tX

.Taking the derivative of this at

t=0

, we have:

\operatorname{ad}XY=XY-YX

.The general case can also be deduced from the linear case: indeed, let

G'

be an immersely linear Lie group having the same Lie algebra as that of G. Then the derivative of Ad at the identity element for G and that for G coincide; hence, without loss of generality, G can be assumed to be G.

The upper-case/lower-case notation is used extensively in the literature. Thus, for example, a vector in the algebra

ak{g}

generates a vector field in the group . Similarly, the adjoint map of vectors in

ak{g}

is homomorphic to the Lie derivative of vector fields on the group considered as a manifold.

Further see the derivative of the exponential map.

Adjoint representation of a Lie algebra

Let

ak{g}

be a Lie algebra over some field. Given an element of a Lie algebra

ak{g}

, one defines the adjoint action of on

ak{g}

as the map

\operatorname{ad}x:ak{g}\toak{g}    with    \operatorname{ad}x(y)=[x,y]

for all in

ak{g}

. It is called the adjoint endomorphism or adjoint action. (

\operatorname{ad}x

is also often denoted as

\operatorname{ad}(x)

.) Since a bracket is bilinear, this determines the linear mapping

\operatorname{ad}:ak{g}\toak{gl}(ak{g})=(\operatorname{End}(ak{g}),[,])

given by . Within End

(ak{g})

, the bracket is, by definition, given by the commutator of the two operators:

[T,S]=T\circS-S\circT

where

\circ

denotes composition of linear maps. Using the above definition of the bracket, the Jacobi identity

[x,[y,z]]+[y,[z,x]]+[z,[x,y]]=0

takes the form

\left([\operatorname{ad}x,\operatorname{ad}y]\right)(z)=\left(\operatorname{ad}[x,\right)(z)

where,, and are arbitrary elements of

ak{g}

.

This last identity says that ad is a Lie algebra homomorphism; i.e., a linear mapping that takes brackets to brackets. Hence, ad is a representation of a Lie algebra and is called the adjoint representation of the algebra

ak{g}

.

If

ak{g}

is finite-dimensional and a basis for it is chosen, then

ak{gl}(ak{g})

is the Lie algebra of square matrices and the composition corresponds to matrix multiplication.

In a more module-theoretic language, the construction says that

ak{g}

is a module over itself.

The kernel of ad is the center of

ak{g}

(that's just rephrasing the definition). On the other hand, for each element in

ak{g}

, the linear mapping

\delta=\operatorname{ad}z

obeys the Leibniz' law:

\delta([x,y])=[\delta(x),y]+[x,\delta(y)]

for all and in the algebra (the restatement of the Jacobi identity). That is to say, adz is a derivation and the image of

ak{g}

under ad is a subalgebra of Der

(ak{g})

, the space of all derivations of

ak{g}

.

When

ak{g}=\operatorname{Lie}(G)

is the Lie algebra of a Lie group G, ad is the differential of Ad at the identity element of G.

There is the following formula similar to the Leibniz formula: for scalars

\alpha,\beta

and Lie algebra elements

x,y,z

,

(\operatorname{ad}x-\alpha-\beta)n[y,z]=

n
\sum
i=0

\binom{n}{i}\left[(\operatorname{ad}x-\alpha)iy,(\operatorname{ad}x-\beta)nz\right].

Structure constants

The explicit matrix elements of the adjoint representation are given by the structure constants of the algebra. That is, let be a set of basis vectors for the algebra, with

[ei,e

ij
k{c
}_k e^k.Then the matrix elements for adeiare given by

{\left[

\operatorname{ad}
ei
j
\right]
k}

={cij

}_k ~.

Thus, for example, the adjoint representation of su(2) is the defining representation of so(3).

Examples

GL(n,\Complex)

), then its Lie algebra is an algebra of n×n matrices with the commutator for a Lie bracket (i.e. a subalgebra of

ak{gl}n(\Complex)

). In this case, the adjoint map is given by Adg(x) = gxg−1.

Properties

The following table summarizes the properties of the various maps mentioned in the definition

\Psi\colonG\to\operatorname{Aut}(G)

\Psig\colonG\toG

Lie group homomorphism:

\Psigh=\Psig\Psih

Lie group automorphism:

\Psig(ab)=\Psig(a)\Psig(b)

-1
(\Psi
g)

=

\Psi
g-1

\operatorname{Ad}\colonG\to\operatorname{Aut}(ak{g})

\operatorname{Ad}g\colonak{g}\toak{g}

Lie group homomorphism:

\operatorname{Ad}gh=\operatorname{Ad}g\operatorname{Ad}h

Lie algebra automorphism:

\operatorname{Ad}g

is linear
-1
\left(\operatorname{Ad}
g\right)

=

\operatorname{Ad}
g-1

\operatorname{Ad}g[x,y]=[\operatorname{Ad}gx,\operatorname{Ad}gy]

\operatorname{ad}\colonakg\to\operatorname{Der}(akg)

\operatorname{ad}x\colonakg\toakg

Lie algebra homomorphism:

\operatorname{ad}

is linear

\operatorname{ad}[x,y]=[\operatorname{ad}x,\operatorname{ad}y]

Lie algebra derivation:

\operatorname{ad}x

is linear

\operatorname{ad}x[y,z]=[\operatorname{ad}xy,z]+[y,\operatorname{ad}xz]

The image of G under the adjoint representation is denoted by Ad(G). If G is connected, the kernel of the adjoint representation coincides with the kernel of Ψ which is just the center of G. Therefore, the adjoint representation of a connected Lie group G is faithful if and only if G is centerless. More generally, if G is not connected, then the kernel of the adjoint map is the centralizer of the identity component G0 of G. By the first isomorphism theorem we have

Ad(G)\congG/ZG(G0).

Given a finite-dimensional real Lie algebra

ak{g}

, by Lie's third theorem, there is a connected Lie group

\operatorname{Int}(ak{g})

whose Lie algebra is the image of the adjoint representation of

ak{g}

(i.e.,

\operatorname{Lie}(\operatorname{Int}(ak{g}))=\operatorname{ad}(ak{g})

.) It is called the adjoint group of

ak{g}

.

Now, if

ak{g}

is the Lie algebra of a connected Lie group G, then

\operatorname{Int}(ak{g})

is the image of the adjoint representation of G:

\operatorname{Int}(ak{g})=\operatorname{Ad}(G)

.

Roots of a semisimple Lie group

If G is semisimple, the non-zero weights of the adjoint representation form a root system.[4] (In general, one needs to pass to the complexification of the Lie algebra before proceeding.) To see how this works, consider the case G = SL(n, R). We can take the group of diagonal matrices diag(t1, ..., tn) as our maximal torus T. Conjugation by an element of T sends

\begin{bmatrix} a11&a12&&a1n\\ a21&a22&&a2n\\ \vdots&\vdots&\ddots&\vdots\\ an1&an2&&ann\\ \end{bmatrix} \mapsto \begin{bmatrix} a11&t1t

-1
2

a12&&t1t

-1
n

a1n\\ t2t

-1
1

a21&a22&&t2t

-1
n

a2n\\ \vdots&\vdots&\ddots&\vdots\\ tnt

-1
1

an1&tnt

-1
2

an2&&ann\\ \end{bmatrix}.

Thus, T acts trivially on the diagonal part of the Lie algebra of G and with eigenvectors titj-1 on the various off-diagonal entries. The roots of G are the weights diag(t1, ..., tn) → titj-1. This accounts for the standard description of the root system of G = SLn(R) as the set of vectors of the form eiej.

Example SL(2, R)

When computing the root system for one of the simplest cases of Lie Groups, the group SL(2, R) of two dimensional matrices with determinant 1 consists of the set of matrices of the form:

\begin{bmatrix} a&b\\ c&d\\ \end{bmatrix}

with a, b, c, d real and ad - bc = 1.

A maximal compact connected abelian Lie subgroup, or maximal torus T, is given by the subset of all matrices of the form

\begin{bmatrix} t1&0\\ 0&t2\\ \end{bmatrix} = \begin{bmatrix} t1&0\\ 0&1/t1\\ \end{bmatrix} = \begin{bmatrix} \exp(\theta)&0\\ 0&\exp(-\theta)\\ \end{bmatrix}

with

t1t2=1

. The Lie algebra of the maximal torus is the Cartan subalgebra consisting of the matrices

\begin{bmatrix} \theta&0\\ 0&-\theta\\ \end{bmatrix}=\theta\begin{bmatrix} 1&0\\ 0&0\\ \end{bmatrix}-\theta\begin{bmatrix} 0&0\\ 0&1\\ \end{bmatrix} =\theta(e1-e2).

If we conjugate an element of SL(2, R) by an element of the maximal torus we obtain

\begin{bmatrix} t1&0\\ 0&1/t1\\ \end{bmatrix} \begin{bmatrix} a&b\\ c&d\\ \end{bmatrix} \begin{bmatrix} 1/t1&0\\ 0&t1\\ \end{bmatrix} = \begin{bmatrix} at1&bt1\\ c/t1&d/t1\\ \end{bmatrix} \begin{bmatrix} 1/t1&0\\ 0&t1\\ \end{bmatrix} = \begin{bmatrix} a&b

2\\ c
t
1
-2
t
1

&d\\ \end{bmatrix}

The matrices

\begin{bmatrix} 1&0\\ 0&0\\ \end{bmatrix} \begin{bmatrix} 0&0\\ 0&1\\ \end{bmatrix} \begin{bmatrix} 0&1\\ 0&0\\ \end{bmatrix} \begin{bmatrix} 0&0\\ 1&0\\ \end{bmatrix}

are then 'eigenvectors' of the conjugation operation with eigenvalues

2,
1,1,t
1
-2
t
1
. The function Λ which gives
2
t
1
is a multiplicative character, or homomorphism from the group's torus to the underlying field R. The function λ giving θ is a weight of the Lie Algebra with weight space given by the span of the matrices.

It is satisfying to show the multiplicativity of the character and the linearity of the weight. It can further be proved that the differential of Λ can be used to create a weight. It is also educational to consider the case of SL(3, R).

Variants and analogues

The adjoint representation can also be defined for algebraic groups over any field.

The co-adjoint representation is the contragredient representation of the adjoint representation. Alexandre Kirillov observed that the orbit of any vector in a co-adjoint representation is a symplectic manifold. According to the philosophy in representation theory known as the orbit method (see also the Kirillov character formula), the irreducible representations of a Lie group G should be indexed in some way by its co-adjoint orbits. This relationship is closest in the case of nilpotent Lie groups.

References

Notes and References

  1. Indeed, by the chain rule,

    \operatorname{Ad}gh=d(\Psigh)e=d(\Psig\circ\Psih)e=d(\Psig)e\circd(\Psih)e=\operatorname{Ad}g\circ\operatorname{Ad}h.

  2. Proposition 3.35
  3. Theorem 3.28
  4. Section 7.3