Operator (physics) explained

An operator is a function over a space of physical states onto another space of states. The simplest example of the utility of operators is the study of symmetry (which makes the concept of a group useful in this context). Because of this, they are useful tools in classical mechanics. Operators are even more important in quantum mechanics, where they form an intrinsic part of the formulation of the theory.

Operators in classical mechanics

L(q,

•

,t)

or equivalently the Hamiltonian

H(q,p,t)

, a function of the generalized coordinates q, generalized velocities

•

=dq/dt

and its conjugate momenta:

\partialL

\partial

•

If either L or H is independent of a generalized coordinate q, meaning the L and H do not change when q is changed, which in turn means the dynamics of the particle are still the same even when q changes, the corresponding momenta conjugate to those coordinates will be conserved (this is part of Noether's theorem, and the invariance of motion with respect to the coordinate q is a symmetry). Operators in classical mechanics are related to these symmetries.

More technically, when H is invariant under the action of a certain group of transformations G:

S\inG,H(S(q,p))=H(q,p)

The elements of G are physical operators, which map physical states among themselves.

Table of classical mechanics operators

Transformation	Operator	Position	Momentum
	X(a)	r → r+a	p → p
Time translation symmetry	U(t₀₎	r(t) → r(t+t₀₎	p(t) → p(t+t₀₎
	R(\hat{n },\theta)	r → R(\hat{n },\theta)\mathbf	p → R(\hat{n },\theta)\mathbf
Galilean transformations	G(v)	r → r+vt	p → p+mv
Parity	P	r → -r	p → -p
T-symmetry	T	r → r(-t)	p → -p(-t)

where

R(\hat{\boldsymbol{n}},\theta)

is the rotation matrix about an axis defined by the unit vector

\hat{\boldsymbol{n}}

and angle θ.

Generators

If the transformation is infinitesimal, the operator action should be of the form

I+\epsilonA,

where

is the identity operator,

\epsilon

is a parameter with a small value, and

will depend on the transformation at hand, and is called a generator of the group. Again, as a simple example, we will derive the generator of the space translations on 1D functions.

As it was stated,

T_af(x)=f(x-a)

. If

a=\epsilon

is infinitesimal, then we may write

T_\epsilonf(x)=f(x-\epsilon) ≈ f(x)-\epsilonf'(x).

This formula may be rewritten as

T_\epsilonf(x)=(I-\epsilonD)f(x)

where

is the generator of the translation group, which in this case happens to be the derivative operator. Thus, it is said that the generator of translations is the derivative.

The exponential map

The whole group may be recovered, under normal circumstances, from the generators, via the exponential map. In the case of the translations the idea works like this.

The translation for a finite value of

may be obtained by repeated application of the infinitesimal translation:

T_af(x)=\lim_N\toinftyT_a/N … T_a/Nf(x)

with the

…

standing for the application

times. If

is large, each of the factors may be considered to be infinitesimal:

T_af(x)=\lim_N\toinfty\left(I-

	a
	N

D\right)^Nf(x).

But this limit may be rewritten as an exponential:

T_af(x)=\exp(-aD)f(x).

To be convinced of the validity of this formal expression, we may expand the exponential in a power series:

T_af(x)=\left(I-aD+{a²D²\over2!}-{a³D³\over3!}+ … \right)f(x).

The right-hand side may be rewritten as

f(x)-af'(x)+

	a²
	2!

f''(x)-

	a³
	3!

f⁽³⁾(x)+ …

which is just the Taylor expansion of

f(x-a)

, which was our original value for

T_af(x)

The mathematical properties of physical operators are a topic of great importance in itself. For further information, see C*-algebra and Gelfand–Naimark theorem.

Operators in quantum mechanics

The mathematical formulation of quantum mechanics (QM) is built upon the concept of an operator.

Physical pure states in quantum mechanics are represented as unit-norm vectors (probabilities are normalized to one) in a special complex Hilbert space. Time evolution in this vector space is given by the application of the evolution operator.

Any observable, i.e., any quantity which can be measured in a physical experiment, should be associated with a self-adjoint linear operator. The operators must yield real eigenvalues, since they are values which may come up as the result of the experiment. Mathematically this means the operators must be Hermitian.^[1] The probability of each eigenvalue is related to the projection of the physical state on the subspace related to that eigenvalue. See below for mathematical details about Hermitian operators.

In the wave mechanics formulation of QM, the wavefunction varies with space and time, or equivalently momentum and time (see position and momentum space for details), so observables are differential operators.

In the matrix mechanics formulation, the norm of the physical state should stay fixed, so the evolution operator should be unitary, and the operators can be represented as matrices. Any other symmetry, mapping a physical state into another, should keep this restriction.

Wavefunction

See main article: wavefunction.

The wavefunction must be square-integrable (see L^p spaces), meaning:

\iiint
	\R³

|\psi(r)|²d^3r=

\iiint
	\R³

\psi(r)^*\psi(r)d^3r<infty

and normalizable, so that:

\iiint
	\R³

|\psi(r)|²d^3r=1

Two cases of eigenstates (and eigenvalues) are:

for discrete eigenstates

|\psi_i\rangle

forming a discrete basis, so any state is a sum

|\psi\rangle = \sum_i c_i|\phi_i\rangle

where c_i are complex numbers such that c_i² = c_i^*c_i is the probability of measuring the state

|\phi_i\rangle

, and the corresponding set of eigenvalues a_i is also discrete - either finite or countably infinite. In this case, the inner product of two eigenstates is given by

\langle\phi_i\vert\phi_{j\rangle=\delta}_ij

, where

\delta_mn

denotes the Kronecker Delta. However,

for a continuum of eigenstates

|\psi_i\rangle

forming a continuous basis, any state is an integral

|\psi\rangle = \int c(\phi) \, d\phi|\phi\rangle

where c(φ) is a complex function such that c(φ)² = c(φ)^*c(φ) is the probability of measuring the state

|\phi\rangle

, and there is an uncountably infinite set of eigenvalues a. In this case, the inner product of two eigenstates is defined as

\langle\phi'\vert\phi\rangle=\delta(\phi-\phi')

, where here

\delta(x-y)

denotes the Dirac Delta.

Linear operators in wave mechanics

See main article: Wave function and Bra–ket notation.

Let be the wavefunction for a quantum system, and

\hat{A}

be any linear operator for some observable (such as position, momentum, energy, angular momentum etc.). If is an eigenfunction of the operator

\hat{A}

, then

\hat{A}\psi=a\psi,

where is the eigenvalue of the operator, corresponding to the measured value of the observable, i.e. observable has a measured value .

If is an eigenfunction of a given operator

\hat{A}

, then a definite quantity (the eigenvalue) will be observed if a measurement of the observable is made on the state . Conversely, if is not an eigenfunction of

\hat{A}

, then it has no eigenvalue for

\hat{A}

, and the observable does not have a single definite value in that case. Instead, measurements of the observable will yield each eigenvalue with a certain probability (related to the decomposition of relative to the orthonormal eigenbasis of

\hat{A}

In bra–ket notation the above can be written;

\begin{align} \hat{A}\psi&=\hat{A}\psi(r)=\hat{A}\left\langler\mid\psi\right\rangle=\left\langler\left\vert\hat{A}\right\vert\psi\right\rangle\\ a\psi&=a\psi(r)=a\left\langler\mid\psi\right\rangle=\left\langler\mida\mid\psi\right\rangle\\ \end{align}

that are equal if

\left|\psi\right\rangle

is an eigenvector, or eigenket of the observable .

Due to linearity, vectors can be defined in any number of dimensions, as each component of the vector acts on the function separately. One mathematical example is the del operator, which is itself a vector (useful in momentum-related quantum operators, in the table below).

An operator in n-dimensional space can be written:

\hat{A

} = \sum_^n \mathbf_j \hat_j

where e_j are basis vectors corresponding to each component operator A_j. Each component will yield a corresponding eigenvalue

a_j

. Acting this on the wave function :

\hat{A

} \psi = \left(\sum_^n \mathbf_j \hat_j \right) \psi = \sum_^n \left(\mathbf_j \hat_j \psi \right) = \sum_^n \left(\mathbf_j a_j \psi \right)

in which we have used

\hat{A}_j\psi=a_j\psi.

In bra–ket notation:

\begin{align} \hat{A

} \psi = \mathbf \psi (\mathbf) = \mathbf \left\langle \mathbf \mid \psi \right\rangle &= \left\langle \mathbf \left\vert \mathbf \right\vert \psi \right\rangle \\ \left (\sum_^n \mathbf_j \hat_j \right) \psi = \left(\sum_^n \mathbf_j \hat_j \right) \psi (\mathbf) = \left(\sum_^n \mathbf_j \hat_j \right) \left\langle \mathbf \mid \psi \right\rangle &= \left\langle \mathbf \left\vert \sum_^n \mathbf_j \hat_j \right\vert \psi \right\rangle\end

Commutation of operators on Ψ

See main article: Commutator.

If two observables A and B have linear operators

\hat{A}

and

\hat{B}

, the commutator is defined by,

\left[\hat{A},\hat{B}\right]=\hat{A}\hat{B}-\hat{B}\hat{A}

The commutator is itself a (composite) operator. Acting the commutator on ψ gives:

\left[\hat{A},\hat{B}\right]\psi=\hat{A}\hat{B}\psi-\hat{B}\hat{A}\psi.

If ψ is an eigenfunction with eigenvalues a and b for observables A and B respectively, and if the operators commute:

\left[\hat{A},\hat{B}\right]\psi=0,

then the observables A and B can be measured simultaneously with infinite precision, i.e., uncertainties

\DeltaA=0

\DeltaB=0

simultaneously. ψ is then said to be the simultaneous eigenfunction of A and B. To illustrate this:

\begin{align} \left[\hat{A},\hat{B}\right]\psi&=\hat{A}\hat{B}\psi-\hat{B}\hat{A}\psi\\ &=a(b\psi)-b(a\psi)\\ &=0.\\ \end{align}

It shows that measurement of A and B does not cause any shift of state, i.e., initial and final states are same (no disturbance due to measurement). Suppose we measure A to get value a. We then measure B to get the value b. We measure A again. We still get the same value a. Clearly the state (ψ) of the system is not destroyed and so we are able to measure A and B simultaneously with infinite precision.

If the operators do not commute:

\left[\hat{A},\hat{B}\right]\psi ≠ 0,

they cannot be prepared simultaneously to arbitrary precision, and there is an uncertainty relation between the observables

\DeltaA\DeltaB\geq\left|

	1
	2

\langle[A,B]\rangle\right|

even if ψ is an eigenfunction the above relation holds. Notable pairs are position-and-momentum and energy-and-time uncertainty relations, and the angular momenta (spin, orbital and total) about any two orthogonal axes (such as L_x and L_y, or s_y and s_z, etc.).

Expectation values of operators on Ψ

The expectation value (equivalently the average or mean value) is the average measurement of an observable, for particle in region R. The expectation value

\left\langle\hat{A}\right\rangle

of the operator

\hat{A}

is calculated from:^[2]

\left\langle\hat{A}\right\rangle=\int_R\psi^*\left(r\right)\hat{A}\psi\left(r\right)d^3r=\left\langle\psi\left|\hat{A}\right|\psi\right\rangle.

This can be generalized to any function F of an operator:

\left\langleF\left(\hat{A}\right)\right\rangle=\int_R\psi(r)^*\left[F\left(\hat{A}\right)\psi(r)\right]d³r=\left\langle\psi\left|F\left(\hat{A}\right)\right|\psi\right\rangle,

An example of F is the 2-fold action of A on ψ, i.e. squaring an operator or doing it twice:

\begin{align} F\left(\hat{A}\right)&=\hat{A}²\\ ⇒ \left\langle\hat{A}²\right\rangle&=\int_R\psi^*\left(r\right)\hat{A}²\psi\left(r\right)d^3r=\left\langle\psi\left\vert\hat{A}²\right\vert\psi\right\rangle\\ \end{align}

Hermitian operators

See main article: Self-adjoint operator.

The definition of a Hermitian operator is:^[1]

\hat{A}=\hat{A}^\dagger

Following from this, in bra–ket notation:

\left\langle\phi_i\left|\hat{A}\right|\phi_j\right\rangle=\left\langle\phi_j\left|\hat{A}\right|\phi_i\right\rangle^*.

Important properties of Hermitian operators include:

real eigenvalues,
eigenvectors with different eigenvalues are orthogonal,
eigenvectors can be chosen to be a complete orthonormal basis,

Operators in matrix mechanics

An operator can be written in matrix form to map one basis vector to another. Since the operators are linear, the matrix is a linear transformation (aka transition matrix) between bases. Each basis element

\phi_j

can be connected to another,^[2] by the expression:

A_ij=\left\langle\phi_i\left|\hat{A}\right|\phi_j\right\rangle,

which is a matrix element:

\hat{A}=\begin{pmatrix} A₁₁&A₁₂& … &A_1n\\ A₂₁&A₂₂& … &A_2n\\ \vdots&\vdots&\ddots&\vdots\\ A_n1&A_n2& … &A_nn\\ \end{pmatrix}

A further property of a Hermitian operator is that eigenfunctions corresponding to different eigenvalues are orthogonal.^[1] In matrix form, operators allow real eigenvalues to be found, corresponding to measurements. Orthogonality allows a suitable basis set of vectors to represent the state of the quantum system. The eigenvalues of the operator are also evaluated in the same way as for the square matrix, by solving the characteristic polynomial:

\det\left(\hat{A}-a\hat{I}\right)=0,

where I is the n × n identity matrix, as an operator it corresponds to the identity operator. For a discrete basis:

\hat{I}=\sum_i|\phi_{i\rangle\langle\phi}_i|

while for a continuous basis:

\hat{I}=\int|\phi\rangle\langle\phi|d\phi

Inverse of an operator

A non-singular operator

\hat{A}

has an inverse

\hat{A}^-1

defined by:

\hat{A}\hat{A}^-1=\hat{A}^-1\hat{A}=\hat{I}

If an operator has no inverse, it is a singular operator. In a finite-dimensional space, an operator is non-singular if and only if its determinant is nonzero:

\det\left(\hat{A}\right) ≠ 0

and hence the determinant is zero for a singular operator.

Table of QM operators

The operators used in quantum mechanics are collected in the table below (see for example^[1] ^[3]). The bold-face vectors with circumflexes are not unit vectors, they are 3-vector operators; all three spatial components taken together.

Operator (common name/s)

Cartesian component

General definition

SI unit

Dimension

Position

\begin{align} \hat{x}&=x,& \hat{y}&=y,& \hat{z}&=z\end{align}

\hat{r

} = \mathbf \,\!

[L]

Momentum

General

\begin{align} \hat{p}_x&=-i\hbar

	\partial
	\partialx

,& \hat{p}_y&=-i\hbar

	\partial
	\partialy

,& \hat{p}_z&=-i\hbar

	\partial
	\partialz

\end{align}

General

\hat{p

} = -i \hbar \nabla \,\!

J s m⁻¹ = N s

[M] [L] [T]⁻¹

Electromagnetic field

\begin{align} \hat{p}_x=-i\hbar

	\partial
	\partialx

-qA_x\\ \hat{p}_y=-i\hbar

	\partial
	\partialy

-qA_y\\ \hat{p}_z=-i\hbar

	\partial
	\partialz

-qA_z\end{align}

Electromagnetic field (uses kinetic momentum; A, vector potential)

\begin{align} \hat{p

} & = \mathbf - q\mathbf \\ & = -i \hbar \nabla - q\mathbf \\\end\,\!

J s m⁻¹ = N s

[M] [L] [T]⁻¹

Kinetic energy

Translation

\begin{align} \hat{T}_x&=-

	\hbar²
	2m

	\partial²
	\partialx²

\\[2pt] \hat{T}_y&=-

	\hbar²
	2m

	\partial²
	\partialy²

\\[2pt] \hat{T}_z&=-

	\hbar²
	2m

	\partial²
	\partialz²

\\ \end{align}

\begin{align} \hat{T}&=

	1
	2m

\hat{p

}\cdot\mathbf \\ & = \frac(-i \hbar \nabla)\cdot(-i \hbar \nabla) \\ & = \frac\nabla^2\end\,\!

[M] [L]² [T]⁻²

Electromagnetic field

\begin{align} \hat{T}_x&=

	1
	2m

\left(-i\hbar

	\partial
	\partialx

-qA_x\right)²\\ \hat{T}_y&=

	1
	2m

\left(-i\hbar

	\partial
	\partialy

-qA_y\right)²\\ \hat{T}_z&=

	1
	2m

\left(-i\hbar

	\partial
	\partialz

-qA_z\right)²\end{align}

Electromagnetic field (A, vector potential)

\begin{align} \hat{T}&=

	1
	2m

\hat{p

}\cdot\mathbf \\ & = \frac(-i \hbar \nabla - q\mathbf)\cdot(-i \hbar \nabla - q\mathbf) \\ & = \frac(-i \hbar \nabla - q\mathbf)^2\end\,\!

[M] [L]² [T]⁻²

Rotation (I, moment of inertia)

\begin{align}\hat{T}_xx&=

	\hat{J
	_x

	2}{2I

	xx

} \\ \hat_ & = \frac \\ \hat_ & = \frac \\\end\,\!

Rotation

\hat{T}=

	\hat{J
	⋅ \hat{J

}} \,\!

[M] [L]² [T]⁻²

Potential energy

N/A

\hat{V}=V\left(r,t\right)=V

[M] [L]² [T]⁻²

Total energy

N/A

Time-dependent potential:

\hat{E}=i\hbar

	\partial
	\partialt

Time-independent:

\hat{E}=E\

[M] [L]² [T]⁻²

Hamiltonian

\begin{align} \hat{H}&=\hat{T}+\hat{V}\\ &=

	1
	2m

\hat{p

}\cdot\mathbf + V \\ & = \frac\hat^2 + V \\\end \,\!

[M] [L]² [T]⁻²

Angular momentum operator

\begin{align} \hat{L}_x&=-i\hbar\left(y{\partial\over\partialz}-z{\partial\over\partialy}\right)\\ \hat{L}_y&=-i\hbar\left(z{\partial\over\partialx}-x{\partial\over\partialz}\right)\\ \hat{L}_z&=-i\hbar\left(x{\partial\over\partialy}-y{\partial\over\partialx}\right) \end{align}

\hat{L

} = \mathbf \times -i\hbar \nabla

J s = N s m

[M] [L]² [T]⁻¹

Spin angular momentum

\begin{align} \hat{S}_x&={\hbar\over2}\sigma_x& \hat{S}_y&={\hbar\over2}\sigma_y& \hat{S}_z&={\hbar\over2}\sigma_z\end{align}

where

\begin{align} \sigma_x&=\begin{pmatrix} 0&1\\ 1&0 \end{pmatrix}\\ \sigma_y&=\begin{pmatrix} 0&-i\\ i&0 \end{pmatrix}\\ \sigma_z&=\begin{pmatrix} 1&0\\ 0&-1 \end{pmatrix} \end{align}

are the Pauli matrices for spin-1/2 particles.

\hat{S

} = \boldsymbol \,\!

where σ is the vector whose components are the Pauli matrices.

J s = N s m

[M] [L]² [T]⁻¹

Total angular momentum

\begin{align} \hat{J}_x&=\hat{L}_x+\hat{S}_x\\ \hat{J}_y&=\hat{L}_y+\hat{S}_y\\ \hat{J}_z&=\hat{L}_z+\hat{S}_{z
\end{align}}

\begin{align} \hat{J

} & = \mathbf + \mathbf \\ & = -i\hbar \mathbf\times\nabla + \frac\boldsymbol \end

J s = N s m

[M] [L]² [T]⁻¹

Transition dipole moment (electric)

\begin{align} \hat{d}_x&=q\hat{x},& \hat{d}_y&=q\hat{y},& \hat{d}_z&=q\hat{z} \end{align}

\hat{d

} = q \mathbf

C m

[I] [T] [L]

Examples of applying quantum operators

The procedure for extracting information from a wave function is as follows. Consider the momentum p of a particle as an example. The momentum operator in position basis in one dimension is:

\hat{p}=-i\hbar

	\partial
	\partialx

Letting this act on ψ we obtain:

\hat{p}\psi=-i\hbar

	\partial
	\partialx

\psi,

if ψ is an eigenfunction of

\hat{p}

, then the momentum eigenvalue p is the value of the particle's momentum, found by:

-i\hbar	\partial
	\partialx

\psi=p\psi.

For three dimensions the momentum operator uses the nabla operator to become:

\hat{p

} = -i\hbar\nabla .

In Cartesian coordinates (using the standard Cartesian basis vectors e_x, e_y, e_z) this can be written;

e_x\hat{p}_x+e_y\hat{p}_y+e_z\hat{p}_z=-i\hbar\left(e_x

	\partial
	\partialx

+e_y

	\partial
	\partialy

+e_z

	\partial
	\partialz

\right),

that is:

\hat{p}_x=-i\hbar

	\partial
	\partialx

, \hat{p}_y=-i\hbar

	\partial
	\partialy

, \hat{p}_z=-i\hbar

	\partial
	\partialz

The process of finding eigenvalues is the same. Since this is a vector and operator equation, if ψ is an eigenfunction, then each component of the momentum operator will have an eigenvalue corresponding to that component of momentum. Acting

\hat{p

} on ψ obtains:

\begin{align} \hat{p}_x\psi&=-i\hbar

	\partial
	\partialx

\psi=p_x\psi\\ \hat{p}_y\psi&=-i\hbar

	\partial
	\partialy

\psi=p_y\psi\\ \hat{p}_z\psi&=-i\hbar

	\partial
	\partialz

\psi=p_z\psi\\ \end{align}

Notes and References

Molecular Quantum Mechanics Parts I and II: An Introduction to Quantum Chemistry (Volume 1), P.W. Atkins, Oxford University Press, 1977,
Quantum Mechanics Demystified, D. McMahon, Mc Graw Hill (USA), 2006,
https://feynmanlectures.caltech.edu/III_20.html Operators - The Feynman Lectures on Physics

Operator (physics) explained

Operators in classical mechanics

Table of classical mechanics operators

Generators

The exponential map

Operators in quantum mechanics

Wavefunction

Linear operators in wave mechanics

Commutation of operators on Ψ

Expectation values of operators on Ψ

Hermitian operators

Operators in matrix mechanics

Inverse of an operator

Table of QM operators

Examples of applying quantum operators

See also

Notes and References