Hermitian adjoint explained

A

on an inner product space defines a Hermitian adjoint (or adjoint) operator

A*

on that space according to the rule

\langleAx,y\rangle=\langlex,A*y\rangle,

where

\langle,\rangle

is the inner product on the vector space.

The adjoint may also be called the Hermitian conjugate or simply the Hermitian[1] after Charles Hermite. It is often denoted by in fields like physics, especially when used in conjunction with bra–ket notation in quantum mechanics. In finite dimensions where operators can be represented by matrices, the Hermitian adjoint is given by the conjugate transpose (also known as the Hermitian transpose).

The above definition of an adjoint operator extends verbatim to bounded linear operators on Hilbert spaces

H

. The definition has been further extended to include unbounded densely defined operators, whose domain is topologically dense in, but not necessarily equal to,

H.

Informal definition

A:H1\toH2

between Hilbert spaces. Without taking care of any details, the adjoint operator is the (in most cases uniquely defined) linear operator

A*:H2\toH1

fulfilling

\left\langleAh1,h2

\right\rangle
H2

=\left\langleh1,A*h2

\right\rangle
H1

,

where

\langle,

\rangle
Hi
is the inner product in the Hilbert space

Hi

, which is linear in the first coordinate and conjugate linear in the second coordinate. Note the special case where both Hilbert spaces are identical and

A

is an operator on that Hilbert space.

When one trades the inner product for the dual pairing, one can define the adjoint, also called the transpose, of an operator

A:E\toF

, where

E,F

are Banach spaces with corresponding norms

\|\|E,\|\|F

. Here (again not considering any technicalities), its adjoint operator is defined as

A*:F*\toE*

with

A*f=f\circA:u\mapstof(Au),

I.e.,

\left(A*f\right)(u)=f(Au)

for

f\inF*,u\inE

.

The above definition in the Hilbert space setting is really just an application of the Banach space case when one identifies a Hilbert space with its dual. Then it is only natural that we can also obtain the adjoint of an operator

A:H\toE

, where

H

is a Hilbert space and

E

is a Banach space. The dual is then defined as

A*:E*\toH

with

A*f=hf

such that

\langlehf,h\rangleH=f(Ah).

Definition for unbounded operators between Banach spaces

Let

\left(E,\|\|E\right),\left(F,\|\|F\right)

be Banach spaces. Suppose

A:D(A)\toF

and

D(A)\subsetE

, and suppose that

A

is a (possibly unbounded) linear operator which is densely defined (i.e.,

D(A)

is dense in

E

). Then its adjoint operator

A*

is defined as follows. The domain is

D\left(A*\right):=\left\{g\inF*:~\existsc\geq0:~forallu\inD(A):~|g(Au)|\leqc\|u\|E\right\}.

Now for arbitrary but fixed

g\inD(A*)

we set

f:D(A)\to\R

with

f(u)=g(Au)

. By choice of

g

and definition of

D(A*)

, f is (uniformly) continuous on

D(A)

as

|f(u)|=|g(Au)|\leqc\|u\|E

. Then by the Hahn–Banach theorem, or alternatively through extension by continuity, this yields an extension of

f

, called

\hat{f}

, defined on all of

E

. This technicality is necessary to later obtain

A*

as an operator

D\left(A*\right)\toE*

instead of

D\left(A*\right)\to(D(A))*.

Remark also that this does not mean that

A

can be extended on all of

E

but the extension only worked for specific elements

g\inD\left(A*\right)

.

Now, we can define the adjoint of

A

as

\begin{align} A*:F*\supsetD(A*)&\toE*\\ g&\mapstoA*g=\hatf. \end{align}

The fundamental defining identity is thus

g(Au)=\left(A*g\right)(u)

for

u\inD(A).

Definition for bounded operators between Hilbert spaces

\langle,\rangle

. Consider a continuous linear operator (for linear operators, continuity is equivalent to being a bounded operator). Then the adjoint of is the continuous linear operator satisfying

\langleAx,y\rangle=\left\langlex,A*y\right\rangleforallx,y\inH.

Existence and uniqueness of this operator follows from the Riesz representation theorem.

This can be seen as a generalization of the adjoint matrix of a square matrix which has a similar property involving the standard complex inner product.

Properties

The following properties of the Hermitian adjoint of bounded operators are immediate:

  1. Involutivity:
  2. If is invertible, then so is, with \left(A^*\right)^ = \left(A^\right)^*
  3. Conjugate linearity:

If we define the operator norm of by

\|A\|op:=\sup\left\{\|Ax\|:\|x\|\le1\right\}

then

\left\|A*\right\|op=\|A\|op.

Moreover,

\left\|A*A\right\|op=

2.
\|A\|
op

One says that a norm that satisfies this condition behaves like a "largest value", extrapolating from the case of self-adjoint operators.

The set of bounded linear operators on a complex Hilbert space together with the adjoint operation and the operator norm form the prototype of a C*-algebra.

Adjoint of densely defined unbounded operators between Hilbert spaces

Definition

Let the inner product

\langle,\rangle

be linear in the first argument. A densely defined operator from a complex Hilbert space to itself is a linear operator whose domain is a dense linear subspace of and whose values lie in .[2] By definition, the domain of its adjoint is the set of all for which there is a satisfying

\langleAx,y\rangle=\langlex,z\rangleforallx\inD(A).

Owing to the density of

D(A)

and Riesz representation theorem,

z

is uniquely defined, and, by definition,

A*y=z.

Properties 1.–5. hold with appropriate clauses about domains and codomains. For instance, the last property now states that is an extension of if, and are densely defined operators.

ker A=(im A)

For every

y\in\kerA*,

the linear functional

x\mapsto\langleAx,y\rangle=\langlex,A*y\rangle

is identically zero, and hence

y\in(\operatorname{im}A)\perp.

Conversely, the assumption that

y\in(\operatorname{im}A)\perp

causes the functional

x\mapsto\langleAx,y\rangle

to be identically zero. Since the functional is obviously bounded, the definition of

A*

assures that

y\inD(A*).

The fact that, for every

x\inD(A),

\langleAx,y\rangle=\langlex,A*y\rangle=0

shows that

A*y\inD(A)\perp=\overline{D(A)}\perp=\{0\},

given that

D(A)

is dense.

This property shows that

\operatorname{ker}A*

is a topologically closed subspace even when

D(A*)

is not.

Geometric interpretation

If

H1

and

H2

are Hilbert spaces, then

H1H2

is a Hilbert space with the inner product

l\langle(a,b),(c,d)r

\rangle
H1H2

\stackrel{def

} \langle a,c \rangle_ + \langle b,d \rangle_,

where

a,c\inH1

and

b,d\inH2.

Let

J\colonHH\toHH

be the symplectic mapping, i.e.

J(\xi,η)=(,\xi).

Then the graph

G(A*)=\{(x,y)\midx\inD(A*),y=A*x\}\subseteqHH

of

A*

is the orthogonal complement of

JG(A):

G(A*)=(JG(A))\perp=\{(x,y)\inHH:l\langle(x,y),(-A\xi,\xi)r\rangleH=0  \forall\xi\inD(A)\}.

The assertion follows from the equivalences

l\langle(x,y),(-A\xi,\xi)r\rangle=0\Leftrightarrow\langleA\xi,x\rangle=\langle\xi,y\rangle,

and

l[\forall\xi\inD(A)  \langleA\xi,x\rangle=\langle\xi,y\rangler]\Leftrightarrowx\inD(A*)\&y=A*x.

Corollaries

A is closed

An operator

A

is closed if the graph

G(A)

is topologically closed in

HH.

The graph

G(A*)

of the adjoint operator

A*

is the orthogonal complement of a subspace, and therefore is closed.
A is densely defined ⇔ A is closable

An operator

A

is closable if the topological closure

Gcl(A)\subseteqHH

of the graph

G(A)

is the graph of a function. Since

Gcl(A)

is a (closed) linear subspace, the word "function" may be replaced with "linear operator". For the same reason,

A

is closable if and only if

(0,v)\notinGcl(A)

unless

v=0.

The adjoint

A*

is densely defined if and only if

A

is closable. This follows from the fact that, for every

v\inH,

v\inD(A*)\perp\Leftrightarrow(0,v)\inGcl(A),

which, in turn, is proven through the following chain of equivalencies:

\begin{align} v\inD(A*)\perp&\Longleftrightarrow(v,0)\inG(A*)\perp\Longleftrightarrow(v,0)\in(JG(A))cl=JGcl(A)\\ &\Longleftrightarrow(0,-v)=J-1(v,0)\inGcl(A)\\ &\Longleftrightarrow(0,v)\inGcl(A). \end{align}

A = A

The closure

Acl

of an operator

A

is the operator whose graph is

Gcl(A)

if this graph represents a function. As above, the word "function" may be replaced with "operator". Furthermore,

A**=Acl,

meaning that

G(A**)=Gcl(A).

To prove this, observe that

J*=-J,

i.e.

\langleJx,y\rangleH=-\langlex,Jy\rangleH,

for every

x,y\inHH.

Indeed,

\begin{align} \langleJ(x1,x2),(y1,y2)\rangleH&=\langle(-x2,x1),(y1,y2)\rangleH=\langle-x2,y1\rangleH+\langlex1,y2\rangleH\\ &=\langlex1,y2\rangleH+\langlex2,-y1\rangleH =\langle(x1,x2),-J(y1,y2)\rangleH. \end{align}

In particular, for every

y\inHH

and every subspace

V\subseteqHH,

y\in(JV)\perp

if and only if

Jy\inV\perp.

Thus,

J[(JV)\perp]=V\perp

and

[J[(JV)\perp]]\perp=Vcl.

Substituting

V=G(A),

obtain

Gcl(A)=G(A**).

A = (A)

For a closable operator

A,

A*=\left(Acl\right)*,

meaning that

G(A*)=G\left(\left(Acl\right)*\right).

Indeed,

G\left(\left(Acl\right)*\right)=\left(JGcl(A)\right)\perp=\left(\left(JG(A)\right)cl\right)\perp=(JG(A))\perp=G(A*).

Counterexample where the adjoint is not densely defined

Let

H=L2(R,l),

where

l

is the linear measure. Select a measurable, bounded, non-identically zero function

f\notinL2,

and pick

\varphi0\inL2\setminus\{0\}.

Define

A\varphi=\langlef,\varphi\rangle\varphi0.

It follows that

D(A)=\{\varphi\inL2\mid\langlef,\varphi\rangleinfty\}.

The subspace

D(A)

contains all the

L2

functions with compact support. Since

1[-n,n]\varphi\stackrel{L2}{\to}\varphi,

A

is densely defined. For every

\varphi\inD(A)

and

\psi\inD(A*),

\langle\varphi,A*\psi\rangle=\langleA\varphi,\psi\rangle=\langle\langlef,\varphi\rangle\varphi0,\psi\rangle=\langlef,\varphi\rangle\langle\varphi0,\psi\rangle=\langle\varphi,\langle\varphi0,\psi\ranglef\rangle.

Thus,

A*\psi=\langle\varphi0,\psi\ranglef.

The definition of adjoint operator requires that

ImA*\subseteqH=L2.

Since

f\notinL2,

this is only possible if

\langle\varphi0,\psi\rangle=0.

For this reason,

D(A*)=

\perp.
\{\varphi
0\}
Hence,

A*

is not densely defined and is identically zero on

D(A*).

As a result,

A

is not closable and has no second adjoint

A**.

Hermitian operators

A bounded operator is called Hermitian or self-adjoint if

A=A*

which is equivalent to

\langleAx,y\rangle=\langlex,Ay\rangleforallx,y\inH.

In some sense, these operators play the role of the real numbers (being equal to their own "complex conjugate") and form a real vector space. They serve as the model of real-valued observables in quantum mechanics. See the article on self-adjoint operators for a full treatment.

Adjoints of conjugate-linear operators

For a conjugate-linear operator the definition of adjoint needs to be adjusted in order to compensate for the complex conjugation. An adjoint operator of the conjugate-linear operator on a complex Hilbert space is an conjugate-linear operator with the property:

\langleAx,y\rangle=\overline{\left\langlex,A*y\right\rangle}forallx,y\inH.

Other adjoints

The equation

\langleAx,y\rangle=\left\langlex,A*y\right\rangle

is formally similar to the defining properties of pairs of adjoint functors in category theory, and this is where adjoint functors got their name from.

See also

References

Notes and References

  1. Book: Miller, David A. B. . Quantum Mechanics for Scientists and Engineers . Cambridge University Press . 2008 . 262, 280.
  2. See unbounded operator for details.