Commutator Explained

In mathematics, the commutator gives an indication of the extent to which a certain binary operation fails to be commutative. There are different definitions used in group theory and ring theory.

Group theory

The commutator of two elements, and, of a group, is the element

This element is equal to the group's identity if and only if and commute (that is, if and only if).

The set of all commutators of a group is not in general closed under the group operation, but the subgroup of G generated by all commutators is closed and is called the derived group or the commutator subgroup of G. Commutators are used to define nilpotent and solvable groups and the largest abelian quotient group.

The definition of the commutator above is used throughout this article, but many group theorists define the commutator as

.Using the first definition, this can be expressed as .

Identities (group theory)

Commutator identities are an important tool in group theory. The expression denotes the conjugate of by, defined as .

x^y=x[x,y].

[y,x]=[x,y]^-1.

[x,zy]=[x,y] ⋅ [x,z]^y

and

[xz,y]=[x,y]^z ⋅ [z,y].

\left[x,y^-1\right]=[y,

	y^-1
x]

and

\left[x^-1,y\right]=[y,

	x^-1
x]

\left[\left[x,y^-1\right],z\right]^y ⋅ \left[\left[y,z^-1\right],x\right]^z ⋅ \left[\left[z,x^-1\right],y\right]^x=1

and

\left[\left[x,y\right],z^x\right] ⋅ \left[[z,x],y^z\right] ⋅ \left[[y,z],x^y\right]=1.

Identity (5) is also known as the Hall–Witt identity, after Philip Hall and Ernst Witt. It is a group-theoretic analogue of the Jacobi identity for the ring-theoretic commutator (see next section).

N.B., the above definition of the conjugate of by is used by some group theorists. Many other group theorists define the conjugate of by as . This is often written

{}^xa

. Similar identities hold for these conventions.

Many identities that are true modulo certain subgroups are also used. These can be particularly useful in the study of solvable groups and nilpotent groups. For instance, in any group, second powers behave well:

(xy)²=x²y²[y,x][[y,x],y].

If the derived subgroup is central, then

(xy)ⁿ=xⁿyⁿ[y,x]^{\binom{n}{2}.}

Ring theory

Rings often do not support division. Thus, the commutator of two elements a and b of a ring (or any associative algebra) is defined differently by

[a,b]=ab-ba.

The commutator is zero if and only if a and b commute. In linear algebra, if two endomorphisms of a space are represented by commuting matrices in terms of one basis, then they are so represented in terms of every basis. By using the commutator as a Lie bracket, every associative algebra can be turned into a Lie algebra.

The anticommutator of two elements and of a ring or associative algebra is defined by

\{a,b\}=ab+ba.

Sometimes

[a,b]₊

is used to denote anticommutator, while

[a,b]_-

is then used for commutator. The anticommutator is used less often, but can be used to define Clifford algebras and Jordan algebras and in the derivation of the Dirac equation in particle physics.

The commutator of two operators acting on a Hilbert space is a central concept in quantum mechanics, since it quantifies how well the two observables described by these operators can be measured simultaneously. The uncertainty principle is ultimately a theorem about such commutators, by virtue of the Robertson–Schrödinger relation. In phase space, equivalent commutators of function star-products are called Moyal brackets and are completely isomorphic to the Hilbert space commutator structures mentioned.

Identities (ring theory)

The commutator has the following properties:

Lie-algebra identities

[A+B,C]=[A,C]+[B,C]

[A,A]=0

[A,B]=-[B,A]

[A,[B,C]]+[B,[C,A]]+[C,[A,B]]=0

Relation (3) is called anticommutativity, while (4) is the Jacobi identity.

Additional identities

[A,BC]=[A,B]C+B[A,C]

[A,BCD]=[A,B]CD+B[A,C]D+BC[A,D]

[A,BCDE]=[A,B]CDE+B[A,C]DE+BC[A,D]E+BCD[A,E]

[AB,C]=A[B,C]+[A,C]B

[ABC,D]=AB[C,D]+A[B,D]C+[A,D]BC

[ABCD,E]=ABC[D,E]+AB[C,E]D+A[B,E]CD+[A,E]BCD

[A,B+C]=[A,B]+[A,C]

[A+B,C+D]=[A,C]+[A,D]+[B,C]+[B,D]

[AB,CD]=A[B,C]D+[A,C]BD+CA[B,D]+C[A,D]B=A[B,C]D+AC[B,D]+[A,C]DB+C[A,D]B

[[A,C],[B,D]]=[[[A,B],C],D]+[[[B,C],D],A]+[[[C,D],A],B]+[[[D,A],B],C]

If is a fixed element of a ring R, identity (1) can be interpreted as a Leibniz rule for the map

\operatorname{ad}_A:R → R

given by

\operatorname{ad}_A(B)=[A,B]

. In other words, the map ad_A defines a derivation on the ring R. Identities (2), (3) represent Leibniz rules for more than two factors, and are valid for any derivation. Identities (4)–(6) can also be interpreted as Leibniz rules. Identities (7), (8) express Z-bilinearity.

From identity (9), one finds that the commutator of integer powers of ring elements is:

[A^N,B^M]=

	N-1
\sum
	n=0

	M-1
\sum
	m=0

AⁿB^m[A,B]B^N-n-1A^M-m-1=

	N-1
\sum
	n=0

	M-1
\sum
	m=0

BⁿA^m[A,B]A^N-n-1B^M-m-1

Some of the above identities can be extended to the anticommutator using the above ± subscript notation.For example:

[AB,C]_\pm=A[B,C]_-+[A,C]_\pmB

[AB,CD]_\pm=A[B,C]_-D+AC[B,D]_-+[A,C]_-DB+C[A,D]_\pmB

[[A,B],[C,D]]=[[[B,C]_+,A]_+,D]-[[[B,D]_+,A]_+,C]+[[[A,D]_+,B]_+,C]-[[[A,C]_+,B]_+,D]

\left[A,[B,C]_\pm\right]+\left[B,[C,A]_\pm\right]+\left[C,[A,B]_\pm\right]=0

[A,BC]_\pm=[A,B]_-C+B[A,C]_\pm=[A,B]_\pmC\mpB[A,C]_-

[A,BC]=[A,B]_\pmC\mpB[A,C]_\pm

Exponential identities

e^A=\exp(A)=1+A+\tfrac{1}{2!}A²+ …

can be meaningfully defined, such as a Banach algebra or a ring of formal power series.

In such a ring, Hadamard's lemma applied to nested commutators gives: $e^A Be^ \ =\ B + [A, B] + \frac[A, [A, B]] + \frac[A, [A, [A, B]]] + \cdots \ =\ e^(B).$ (For the last expression, see Adjoint derivation below.) This formula underlies the Baker–Campbell–Hausdorff expansion of log(exp(A) exp(B)).

A similar expansion expresses the group commutator of expressions

e^A

(analogous to elements of a Lie group) in terms of a series of nested commutators (Lie brackets),

e^A e^B e^ e^ =\exp\!\left([A, B] + \frac[A{+}B, [A, B]] + \frac \left(\frac [A, [B, [B, A]]] + [A{+}B, [A{+}B, [A, B]]]\right) + \cdots\right).

Graded rings and algebras

When dealing with graded algebras, the commutator is usually replaced by the graded commutator, defined in homogeneous components as

[\omega,η]_gr:=\omegaη-(-1)^\degη\omega.

Adjoint derivation

Especially if one deals with multiple commutators in a ring R, another notation turns out to be useful. For an element

x\inR

, we define the adjoint mapping

ad_x:R\toR

by:

\operatorname{ad}_x(y)=[x,y]=xy-yx.

This mapping is a derivation on the ring R:

ad_x(yz) = ad_x(y)z+yad_x(z).

By the Jacobi identity, it is also a derivation over the commutation operation:

ad_x[y,z] = [ad_x(y),z]+[y,ad_x(z)].

Composing such mappings, we get for example

\operatorname{ad}_{x\operatorname{ad}}_y(z)=[x,[y,z]]

and

\operatorname_x^2\!(z) \ =\ \operatorname_x\!(\operatorname_x\!(z)) \ =\ [x, [x, z]\,].

We may consider

itself as a mapping,

ad:R\toEnd(R)

, where

End(R)

is the ring of mappings from R to itself with composition as the multiplication operation. Then

is a Lie algebra homomorphism, preserving the commutator:

\operatorname{ad}_[x,=\left[\operatorname{ad}_x,\operatorname{ad}_y\right].

By contrast, it is not always a ring homomorphism: usually

\operatorname{ad}_xy ≠ \operatorname{ad}_{x\operatorname{ad}}_y

General Leibniz rule

The general Leibniz rule, expanding repeated derivatives of a product, can be written abstractly using the adjoint representation:

xⁿy=

	n
\sum
	k=0

\binom{n}{k}

	k(y)
\operatorname{ad}
	x

xⁿ.

Replacing

by the differentiation operator

\partial

, and

by the multiplication operator

m_f:g\mapstofg

, we get

\operatorname{ad}(\partial)(m_f)=m_\partial(f)

, and applying both sides to a function g, the identity becomes the usual Leibniz rule for the nth derivative

\partialⁿ(fg)