In mathematics, spectral theory is an inclusive term for theories extending the eigenvector and eigenvalue theory of a single square matrix to a much broader theory of the structure of operators in a variety of mathematical spaces.[1] It is a result of studies of linear algebra and the solutions of systems of linear equations and their generalizations.[2] The theory is connected to that of analytic functions because the spectral properties of an operator are related to analytic functions of the spectral parameter.[3]
The name spectral theory was introduced by David Hilbert in his original formulation of Hilbert space theory, which was cast in terms of quadratic forms in infinitely many variables. The original spectral theorem was therefore conceived as a version of the theorem on principal axes of an ellipsoid, in an infinite-dimensional setting. The later discovery in quantum mechanics that spectral theory could explain features of atomic spectra was therefore fortuitous. Hilbert himself was surprised by the unexpected application of this theory, noting that "I developed my theory of infinitely many variables from purely mathematical interests, and even called it 'spectral analysis' without any presentiment that it would later find application to the actual spectrum of physics."[4]
There have been three main ways to formulate spectral theory, each of which find use in different domains. After Hilbert's initial formulation, the later development of abstract Hilbert spaces and the spectral theory of single normal operators on them were well suited to the requirements of physics, exemplified by the work of von Neumann.[5] The further theory built on this to address Banach algebras in general. This development leads to the Gelfand representation, which covers the commutative case, and further into non-commutative harmonic analysis.
The difference can be seen in making the connection with Fourier analysis. The Fourier transform on the real line is in one sense the spectral theory of differentiation as a differential operator. But for that to cover the phenomena one has already to deal with generalized eigenfunctions (for example, by means of a rigged Hilbert space). On the other hand, it is simple to construct a group algebra, the spectrum of which captures the Fourier transform's basic properties, and this is carried out by means of Pontryagin duality.
One can also study the spectral properties of operators on Banach spaces. For example, compact operators on Banach spaces have many spectral properties similar to that of matrices.
The background in the physics of vibrations has been explained in this way:[6]
Such physical ideas have nothing to do with the mathematical theory on a technical level, but there are examples of indirect involvement (see for example Mark Kac's question Can you hear the shape of a drum?). Hilbert's adoption of the term "spectrum" has been attributed to an 1897 paper of Wilhelm Wirtinger on Hill differential equation (by Jean Dieudonné), and it was taken up by his students during the first decade of the twentieth century, among them Erhard Schmidt and Hermann Weyl. The conceptual basis for Hilbert space was developed from Hilbert's ideas by Erhard Schmidt and Frigyes Riesz.[7] [8] It was almost twenty years later, when quantum mechanics was formulated in terms of the Schrödinger equation, that the connection was made to atomic spectra; a connection with the mathematical physics of vibration had been suspected before, as remarked by Henri Poincaré, but rejected for simple quantitative reasons, absent an explanation of the Balmer series.[9] The later discovery in quantum mechanics that spectral theory could explain features of atomic spectra was therefore fortuitous, rather than being an object of Hilbert's spectral theory.
See main article: Spectrum (functional analysis). Consider a bounded linear transformation T defined everywhere over a general Banach space. We form the transformation:
Here I is the identity operator and ζ is a complex number. The inverse of an operator T, that is T−1, is defined by:
If the inverse exists, T is called regular. If it does not exist, T is called singular.
With these definitions, the resolvent set of T is the set of all complex numbers ζ such that Rζ exists and is bounded. This set often is denoted as ρ(T). The spectrum of T is the set of all complex numbers ζ such that Rζ fails to exist or is unbounded. Often the spectrum of T is denoted by σ(T). The function Rζ for all ζ in ρ(T) (that is, wherever Rζ exists as a bounded operator) is called the resolvent of T. The spectrum of T is therefore the complement of the resolvent set of T in the complex plane.[10] Every eigenvalue of T belongs to σ(T), but σ(T) may contain non-eigenvalues.[11]
This definition applies to a Banach space, but of course other types of space exist as well; for example, topological vector spaces include Banach spaces, but can be more general.[12] [13] On the other hand, Banach spaces include Hilbert spaces, and it is these spaces that find the greatest application and the richest theoretical results.[14] With suitable restrictions, much can be said about the structure of the spectra of transformations in a Hilbert space. In particular, for self-adjoint operators, the spectrum lies on the real line and (in general) is a spectral combination of a point spectrum of discrete eigenvalues and a continuous spectrum.[15]
See main article: Spectral theorem.
See also: Eigenvalue, eigenvector and eigenspace. In functional analysis and linear algebra the spectral theorem establishes conditions under which an operator can be expressed in simple form as a sum of simpler operators. As a full rigorous presentation is not appropriate for this article, we take an approach that avoids much of the rigor and satisfaction of a formal treatment with the aim of being more comprehensible to a non-specialist.
This topic is easiest to describe by introducing the bra–ket notation of Dirac for operators.[16] [17] As an example, a very particular linear operator L might be written as a dyadic product:[18] [19]
L=|k1\rangle\langleb1|,
in terms of the "bra" ⟨1| and the "ket" |1⟩. A function is described by a ket as | ⟩. The function defined on the coordinates
(x1,x2,x3,...)
f(x)=\langlex,f\rangle
\|f\|2=\langlef,f\rangle=\int\langlef,x\rangle\langlex,f\rangledx=\intf*(x)f(x)dx
The effect of L upon a function f is then described as:
L|f\rangle=|k1\rangle\langleb1|f\rangle
expressing the result that the effect of L on f is to produce a new function
|k1\rangle
\langleb1|f\rangle
L=λ1|e1\rangle\langlef1|+λ2|e2\rangle\langlef2|+λ3|e3\rangle\langlef3|+...,
where the
\{λi\}
\{|ei\rangle\}
\{\langlefi|\}
\langlefi|ej\rangle=\deltaij
If such a formalism applies, the
\{λi\}
\{|ei\rangle\}
Some natural questions are: under what circumstances does this formalism work, and for what operators L are expansions in series of other operators like this possible? Can any function f be expressed in terms of the eigenfunctions (are they a Schauder basis) and under what circumstances does a point spectrum or a continuous spectrum arise? How do the formalisms for infinite-dimensional spaces and finite-dimensional spaces differ, or do they differ? Can these ideas be extended to a broader class of spaces? Answering such questions is the realm of spectral theory and requires considerable background in functional analysis and matrix algebra.
This section continues in the rough and ready manner of the above section using the bra–ket notation, and glossing over the many important details of a rigorous treatment.[21] A rigorous mathematical treatment may be found in various references.[22] In particular, the dimension n of the space will be finite.
Using the bra–ket notation of the above section, the identity operator may be written as:
I=\sumi=1n|ei\rangle\langlefi|
where it is supposed as above that
\{|ei\rangle\}
\{\langlefi|\}
\langlefi|ej\rangle=\deltaij.
This expression of the identity operation is called a representation or a resolution of the identity.[21] [22] This formal representation satisfies the basic property of the identity:
Ik=I
Applying the resolution of the identity to any function in the space
|\psi\rangle
I|\psi\rangle=|\psi\rangle=
n | |
\sum | |
i=1 |
|ei\rangle\langlefi|\psi\rangle=
n | |
\sum | |
i=1 |
ci|ei\rangle
ci=\langlefi|\psi\rangle
Given some operator equation of the form:
O|\psi\rangle=|h\rangle
O|\psi\rangle=
n | |
\sum | |
i=1 |
ci\left(O|ei\rangle\right)=
n | |
\sum | |
i=1 |
|ei\rangle\langlefi|h\rangle,
\langlefj|O|\psi\rangle=
n | |
\sum | |
i=1 |
ci\langlefj|O|ei\rangle=
n | |
\sum | |
i=1 |
\langlefj|ei\rangle\langlefi|h\rangle=\langlefj|h\rangle, \forallj
\langlefj|h\rangle
Oji=\langlefj|O|ei\rangle
The role of spectral theory arises in establishing the nature and existence of the basis and the reciprocal basis. In particular, the basis might consist of the eigenfunctions of some linear operator L:
L|ei\rangle=λi|ei\rangle;
with the the eigenvalues of L from the spectrum of L. Then the resolution of the identity above provides the dyad expansion of L:
LI=L=
n | |
\sum | |
i=1 |
L|ei\rangle\langlefi|=
n | |
\sum | |
i=1 |
λi|ei\rangle\langlefi|.
See main article: Resolvent formalism.
See also: Green's function and Dirac delta function. Using spectral theory, the resolvent operator R:
R=(λI-L)-1,
can be evaluated in terms of the eigenfunctions and eigenvalues of L, and the Green's function corresponding to L can be found.
Applying R to some arbitrary function in the space, say
\varphi
R|\varphi\rangle=(λI-L)-1|\varphi\rangle=
n | |
\sum | |
i=1 |
1 | |
λ-λi |
|ei\rangle\langlefi|\varphi\rangle.
This function has poles in the complex λ-plane at each eigenvalue of L. Thus, using the calculus of residues:
1 | |
2\pii |
\ointCR|\varphi\rangledλ=
n | |
-\sum | |
i=1 |
|ei\rangle\langlefi|\varphi\rangle=-|\varphi\rangle,
where the line integral is over a contour C that includes all the eigenvalues of L.
Suppose our functions are defined over some coordinates, that is:
\langlex,\varphi\rangle=\varphi(x1,x2,...).
Introducing the notation
\langlex,y\rangle=\delta(x-y),
where δ(x − y) = δ(x1 − y1, x2 − y2, x3 − y3, ...) is the Dirac delta function,[24] we can write
\langlex,\varphi\rangle=\int\langlex,y\rangle\langley,\varphi\rangledy.
Then:
\begin{align} \left\langlex,
1 | |
2\pii |
\ointC
\varphi | |
λI-L |
dλ\right\rangle&=
1 | |
2\pii |
\ointCdλ\left\langlex,
\varphi | |
λI-L |
\right\rangle\\ &=
1 | |
2\pii |
\ointCdλ\intdy\left\langlex,
y | |
λI-L |
\right\rangle\langley,\varphi\rangle \end{align}
The function G(x, y; λ) defined by:
\begin{align} G(x,y;λ)&=\left\langlex,
y | |
λI-L |
\right\rangle\\ &=
n | |
\sum | |
i=1 |
n | |
\sum | |
j=1 |
\langlex,ei\rangle\left\langlefi,
ej | |
λI-L |
\right\rangle\langlefj,y\rangle\\ &=
n | |
\sum | |
i=1 |
\langlex,ei\rangle\langlefi,y\rangle | |
λ-λi |
\\ &=
n | |
\sum | |
i=1 |
| |||||||||||||
λ-λi |
, \end{align}
is called the Green's function for operator L, and satisfies:[25]
1 | |
2\pii |
\ointCG(x,y;λ)dλ=
n | |
-\sum | |
i=1 |
\langlex,ei\rangle\langlefi,y\rangle=-\langlex,y\rangle=-\delta(x-y).
See also: Spectral theory of ordinary differential equations and Integral equation. Consider the operator equation:
(O-λI)|\psi\rangle=|h\rangle;
in terms of coordinates:
\int\langlex,(O-λI)y\rangle\langley,\psi\rangledy=h(x).
A particular case is λ = 0.
The Green's function of the previous section is:
\langley,G(λ)z\rangle=\left\langley,(O-λI)-1z\right\rangle=G(y,z;λ),
and satisfies:
\int\langlex,(O-λI)y\rangle\langley,G(λ)z\rangledy=\int\langlex,(O-λI)y\rangle\left\langley,(O-λI)-1z\right\rangledy=\langlex,z\rangle=\delta(x-z).
Using this Green's function property:
\int\langlex,(O-λI)y\rangleG(y,z;λ)dy=\delta(x-z).
Then, multiplying both sides of this equation by h(z) and integrating:
\intdzh(z)\intdy\langlex,(O-λI)y\rangleG(y,z;λ)=\intdy\langlex,(O-λI)y\rangle\intdzh(z)G(y,z;λ)=h(x),
which suggests the solution is:
\psi(x)=\inth(z)G(x,z;λ)dz.
That is, the function ψ(x) satisfying the operator equation is found if we can find the spectrum of O, and construct G, for example by using:
G(x,z;λ)=
n | |
\sum | |
i=1 |
| |||||||||||||
λ-λi |
.
There are many other ways to find G, of course.[26] See the articles on Green's functions and on Fredholm integral equations. It must be kept in mind that the above mathematics is purely formal, and a rigorous treatment involves some pretty sophisticated mathematics, including a good background knowledge of functional analysis, Hilbert spaces, distributions and so forth. Consult these articles and the references for more detail.
Optimization problems may be the most useful examples about the combinatorial significance of the eigenvalues and eigenvectors in symmetric matrices, especially for the Rayleigh quotient with respect to a matrix M.
Theorem Let M be a symmetric matrix and let x be the non-zero vector that maximizes the Rayleigh quotient with respect to M. Then, x is an eigenvector of M with eigenvalue equal to the Rayleigh quotient. Moreover, this eigenvalue is the largest eigenvalue of M.
Proof Assume the spectral theorem. Let the eigenvalues of M be
λ1\leqλ2\leq … \leqλn
\{vi\}
x=\sumi
T | |
v | |
i |
xvi
The way to prove this formula is pretty easy. Namely,
T | |
\begin{align} v | |
j |
\sumi
T | |
v | |
i |
xvi ={}&\sumi
T | |
v | |
i |
x
T | |
v | |
j |
vi\\[4pt] ={}&
T | |
(v | |
j |
x)
T | |
v | |
j |
vj\\[4pt] ={}&
T | |
v | |
j |
x \end{align}
\begin{align} xTMx ={}&\left(\sumi
T | |
(v | |
i |
x)
T | |
v | |
i\right) |
M\left(\sumj
T | |
(v | |
j |
x)vj\right)\\[4pt] ={}&\left(\sumi
T | |
(v | |
i |
x)
T\right) | |
v | |
i |
\left(\sumj
T | |
(v | |
j |
x)vjλj\right)\\[4pt] ={}&\sumi,j
T | |
(v | |
i |
x)
T | |
v | |
j |
x)vjλj\\[4pt] ={}&\sumj
T | |
(v | |
j |
T | |
x)(v | |
j |
x)λj\\[4pt] ={}&\sumj
T | |
(v | |
j |
2λ | |
x) | |
j\leλ |
n\sumj
T | |
(v | |
j |
x)2\\[4pt] ={}&λnxTx, \end{align}
xTMx | |
xTx |
\leλn
so the Rayleigh quotient is always less than
λn