In mathematics, ergodicity expresses the idea that a point of a moving system, either a dynamical system or a stochastic process, will eventually visit all parts of the space that the system moves in, in a uniform and random sense. This implies that the average behavior of the system can be deduced from the trajectory of a "typical" point. Equivalently, a sufficiently large collection of random samples from a process can represent the average statistical properties of the entire process. Ergodicity is a property of the system; it is a statement that the system cannot be reduced or factored into smaller components. Ergodic theory is the study of systems possessing ergodicity.
Ergodic systems occur in a broad range of systems in physics and in geometry. This can be roughly understood to be due to a common phenomenon: the motion of particles, that is, geodesics on a hyperbolic manifold are divergent; when that manifold is compact, that is, of finite size, those orbits return to the same general area, eventually filling the entire space.
Ergodic systems capture the common-sense, every-day notions of randomness, such that smoke might come to fill all of a smoke-filled room, or that a block of metal might eventually come to have the same temperature throughout, or that flips of a fair coin may come up heads and tails half the time. A stronger concept than ergodicity is that of mixing, which aims to mathematically describe the common-sense notions of mixing, such as mixing drinks or mixing cooking ingredients.
The proper mathematical formulation of ergodicity is founded on the formal definitions of measure theory and dynamical systems, and rather specifically on the notion of a measure-preserving dynamical system. The origins of ergodicity lie in statistical physics, where Ludwig Boltzmann formulated the ergodic hypothesis.
Ergodicity occurs in broad settings in physics and mathematics. All of these settings are unified by a common mathematical description, that of the measure-preserving dynamical system. Equivalently, ergodicity can be understood in terms of stochastic processes. They are one and the same, despite using dramatically different notation and language.
The mathematical definition of ergodicity aims to capture ordinary every-day ideas about randomness. This includes ideas about systems that move in such a way as to (eventually) fill up all of space, such as diffusion and Brownian motion, as well as common-sense notions of mixing, such as mixing paints, drinks, cooking ingredients, industrial process mixing, smoke in a smoke-filled room, the dust in Saturn's rings and so on. To provide a solid mathematical footing, descriptions of ergodic systems begin with the definition of a measure-preserving dynamical system. This is written as
(X,l{A},\mu,T).
The set
X
\mu
X
l{A}
A\subsetX
\mu(A)
l{A}
X
l{A}
T:X\toX
A\subsetX
T(A)
A
T(A)
A
A formal difficulty arises when one tries to reconcile the volume of sets with the need to preserve their size under a map. The problem arises because, in general, several different points in the domain of a function can map to the same point in its range; that is, there may be
x\ney
T(x)=T(y)
x\inX
T-1:l{A}\tol{A}
A\subsetX
T-1(A)\inl{A}
l{A}\tol{A}
X\toX
\mu(A)=\mud\left(T-1(A)\right)
T-1(A)
A
One is now interested in studying the time evolution of the system. If a set
A\inl{A}
X
Tn(A)
X
n
A
A
Mixing is a stronger statement than ergodicity. Mixing asks for this ergodic property to hold between any two sets
A,B
A
X
A,B\inl{A}
N
A,B
n>N
Tn(A)\capB\ne\varnothing
\cap
\varnothing
See main article: Ergodic process.
The above discussion appeals to a physical sense of a volume. The volume does not have to literally be some portion of 3D space; it can be some abstract volume. This is generally the case in statistical systems, where the volume (the measure) is given by the probability. The total volume corresponds to probability one. This correspondence works because the axioms of probability theory are identical to those of measure theory; these are the Kolmogorov axioms.[1]
The idea of a volume can be very abstract. Consider, for example, the set of all possible coin-flips: the set of infinite sequences of heads and tails. Assigning the volume of 1 to this space, it is clear that half of all such sequences start with heads, and half start with tails. One can slice up this volume in other ways: one can say "I don't care about the first
n-1
n
(*, … ,*,h,*, … )
*
h
The above is enough to build up a measure-preserving dynamical system, in its entirety. The sets of
h
t
n
l{A}
X
\mu
h
m
t
k
h
t
m
k
For the coin-flip process, the time-evolution operator
T
(x1,x2, … )
T(x1,x2, … )=(x2,x3, … )
A\inl{A}
x1=*
\mu(A)
\mu(A)=\mu(T(A))
T-1
T-1(x1,x2, … )=(*,x1,x2, … )
\mud\left(T-1(A)\right)=\mu(A)
A
T-1
The above development takes a random process, the Bernoulli process, and converts it to a measure-preserving dynamical system
(X,l{A},\mu,T).
X
X
X
X
There are several important points to be made about the Bernoulli process. If one writes 0 for tails and 1 for heads, one gets the set of all infinite strings of binary digits. These correspond to the base-two expansion of real numbers. Explicitly, given a sequence
(x1,x2, … )
infty | |
y=\sum | |
n=1 |
xn | |
2n |
The statement that the Bernoulli process is ergodic is equivalent to the statement that the real numbers are uniformly distributed. The set of all such strings can be written in a variety of ways:
\{h,t\}infty=\{h,t\}\omega=\{0,1\}\omega=2\omega=2N.
C(x)=
infty | |
\sum | |
n=1 |
xn | |
3n |
In the end, these are all "the same thing".
The Cantor set plays key roles in many branches of mathematics. In recreational mathematics, it underpins the period-doubling fractals; in analysis, it appears in a vast variety of theorems. A key one for stochastic processes is the Wold decomposition, which states that any stationary process can be decomposed into a pair of uncorrelated processes, one deterministic, and the other being a moving average process.
The Ornstein isomorphism theorem states that every stationary stochastic process is equivalent to a Bernoulli scheme (a Bernoulli process with an N-sided (and possibly unfair) gaming die). Other results include that every non-dissipative ergodic system is equivalent to the Markov odometer, sometimes called an "adding machine" because it looks like elementary-school addition, that is, taking a base-N digit sequence, adding one, and propagating the carry bits. The proof of equivalence is very abstract; understanding the result is not: by adding one at each time step, every possible state of the odometer is visited, until it rolls over, and starts again. Likewise, ergodic systems visit each state, uniformly, moving on to the next, until they have all been visited.
Systems that generate (infinite) sequences of N letters are studied by means of symbolic dynamics. Important special cases include subshifts of finite type and sofic systems.
The term ergodic is commonly thought to derive from the Greek words Greek, Modern (1453-);: ἔργον (ergon: "work") and Greek, Modern (1453-);: ὁδός (hodos: "path", "way"), as chosen by Ludwig Boltzmann while he was working on a problem in statistical mechanics. At the same time it is also claimed to be a derivation of ergomonode, coined by Boltzmann in a relatively obscure paper from 1884. The etymology appears to be contested in other ways as well.[2]
The idea of ergodicity was born in the field of thermodynamics, where it was necessary to relate the individual states of gas molecules to the temperature of a gas as a whole and its time evolution thereof. In order to do this, it was necessary to state what exactly it means for gases to mix well together, so that thermodynamic equilibrium could be defined with mathematical rigor. Once the theory was well developed in physics, it was rapidly formalized and extended, so that ergodic theory has long been an independent area of mathematics in itself. As part of that progression, more than one slightly different definition of ergodicity and multitudes of interpretations of the concept in different fields coexist.
For example, in classical physics the term implies that a system satisfies the ergodic hypothesis of thermodynamics,[3] the relevant state space being position and momentum space.
In dynamical systems theory the state space is usually taken to be a more general phase space. On the other hand in coding theory the state space is often discrete in both time and state, with less concomitant structure. In all those fields the ideas of time average and ensemble average can also carry extra baggage as well - as is the case with the many possible thermodynamically relevant partition functions used to define ensemble averages in physics, back again. As such the measure theoretic formalization of the concept also serves as a unifying discipline. In 1913 Michel Plancherel proved the strict impossibility of ergodicity for a purely mechanical system.[4]
A review of ergodicity in physics, and in geometry follows. In all cases, the notion of ergodicity is exactly the same as that for dynamical systems; there is no difference, except for outlook, notation, style of thinking and the journals where results are published.
Physical systems can be split into three categories: classical mechanics, which describes machines with a finite number of moving parts, quantum mechanics, which describes the structure of atoms, and statistical mechanics, which describes gases, liquids, solids; this includes condensed matter physics. These presented below.
This section reviews ergodicity in statistical mechanics. The above abstract definition of a volume is required as the appropriate setting for definitions of ergodicity in physics. Consider a container of liquid, or gas, or plasma, or other collection of atoms or particles. Each and every particle
xi
R6.
N
6N
R6N.
R6N
W x H x L
\left(W x H x L x R3\right)N.
N
A physical system is said to be ergodic if any representative point of the system eventually comes to visit the entire volume of the system. For the above example, this implies that any given atom not only visits every part of the box
W x H x L
Formal mathematical proofs of ergodicity in statistical physics are hard to come by; most high-dimensional many-body systems are assumed to be ergodic, without mathematical proof. Exceptions include the dynamical billiards, which model billiard ball-type collisions of atoms in an ideal gas or plasma. The first hard-sphere ergodicity theorem was for Sinai's billiards, which considers two balls, one of them taken as being stationary, at the origin. As the second ball collides, it moves away; applying periodic boundary conditions, it then returns to collide again. By appeal to homogeneity, this return of the "second" ball can instead be taken to be "just some other atom" that has come into range, and is moving to collide with the atom at the origin (which can be taken to be just "any other atom".) This is one of the few formal proofs that exist; there are no equivalent statements e.g. for atoms in a liquid, interacting via van der Waals forces, even if it would be common sense to believe that such systems are ergodic (and mixing). More precise physical arguments can be made, though.
The formal study of ergodicity can be approached by examining fairly simple dynamical systems. Some of the primary ones are listed here.
The irrational rotation of a circle is ergodic: the orbit of a point is such that eventually, every other point in the circle is visited. Such rotations are a special case of the interval exchange map. The beta expansions of a number are ergodic: beta expansions of a real number are done not in base-N, but in base-
\beta
\beta.
Ergodicity is a widespread phenomenon in the study of symplectic manifolds and Riemannian manifolds. Symplectic manifolds provide the generalized setting for classical mechanics, where the motion of a mechanical system is described by a geodesic. Riemannian manifolds are a special case: the cotangent bundle of a Riemannian manifold is always a symplectic manifold. In particular, the geodesics on a Riemannian manifold are given by the solution of the Hamilton–Jacobi equations.
The geodesic flow of a flat torus following any irrational direction is ergodic; informally this means that when drawing a straight line in a square starting at any point, and with an irrational angle with respect to the sides, if every time one meets a side one starts over on the opposite side with the same angle, the line will eventually meet every subset of positive measure. More generally on any flat surface there are many ergodic directions for the geodesic flow.
For non-flat surfaces, one has that the geodesic flow of any negatively curved compact Riemann surface is ergodic. A surface is "compact" in the sense that it has finite surface area. The geodesic flow is a generalization of the idea of moving in a "straight line" on a curved surface: such straight lines are geodesics. One of the earliest cases studied is Hadamard's billiards, which describes geodesics on the Bolza surface, topologically equivalent to a donut with two holes. Ergodicity can be demonstrated informally, if one has a sharpie and some reasonable example of a two-holed donut: starting anywhere, in any direction, one attempts to draw a straight line; rulers are useful for this. It doesn't take all that long to discover that one is not coming back to the starting point. (Of course, crooked drawing can also account for this; that's why we have proofs.)
These results extend to higher dimensions. The geodesic flow for negatively curved compact Riemannian manifolds is ergodic. A classic example for this is the Anosov flow, which is the horocycle flow on a hyperbolic manifold. This can be seen to be a kind of Hopf fibration. Such flows commonly occur in classical mechanics, which is the study in physics of finite-dimensional moving machinery, e.g. the double pendulum and so-forth. Classical mechanics is constructed on symplectic manifolds. The flows on such systems can be deconstructed into stable and unstable manifolds; as a general rule, when this is possible, chaotic motion results. That this is generic can be seen by noting that the cotangent bundle of a Riemannian manifold is (always) a symplectic manifold; the geodesic flow is given by a solution to the Hamilton–Jacobi equations for this manifold. In terms of the canonical coordinates
(q,p)
H=\tfrac{1}{2}\sumijgij(q)pipj
with
gij
pi
E=\tfrac{1}{2}mv2
Ergodicity results have been provided in translation surfaces, hyperbolic groups and systolic geometry. Techniques include the study of ergodic flows, the Hopf decomposition, and the Ambrose–Kakutani–Krengel–Kubo theorem. An important class of systems are the Axiom A systems.
A number of both classification and "anti-classification" results have been obtained. The Ornstein isomorphism theorem applies here as well; again, it states that most of these systems are isomorphic to some Bernoulli scheme. This rather neatly ties these systems back into the definition of ergodicity given for a stochastic process, in the previous section. The anti-classification results state that there are more than a countably infinite number of inequivalent ergodic measure-preserving dynamical systems. This is perhaps not entirely a surprise, as one can use points in the Cantor set to construct similar-but-different systems. See measure-preserving dynamical system for a brief survey of some of the anti-classification results.
All of the previous sections considered ergodicty either from the point of view of a measurable dynamical system, or from the dual notion of tracking the motion of individual particle trajectories. A closely related concept occurs in (non-linear) wave mechanics. There, the resonant interaction allows for the mixing of normal modes, often (but not always) leading to the eventual thermalization of the system. One of the earliest systems to be rigorously studied in this context is the Fermi–Pasta–Ulam–Tsingou problem, a string of weakly coupled oscillators.
A resonant interaction is possible whenever the dispersion relations for the wave media allow three or more normal modes to sum in such a way as to conserve both the total momentum and the total energy. This allows energy concentrated in one mode to bleed into other modes, eventually distributing that energy uniformly across all interacting modes.
Resonant interactions between waves helps provide insight into the distinction between high-dimensional chaos (that is, turbulence) and thermalization. When normal modes can be combined so that energy and momentum are exactly conserved, then the theory of resonant interactions applies, and energy spreads into all of the interacting modes. When the dispersion relations only allow an approximate balance, turbulence or chaotic motion results. The turbulent modes can then transfer energy into modes that do mix, eventually leading to thermalization, but not before a preceding interval of chaotic motion.
As to quantum mechanics, there is no universal quantum definition of ergodocity or even chaos (see quantum chaos).[5] However, there is a quantum ergodicity theorem stating that the expectation value of an operator converges to the corresponding microcanonical classical average in the semiclassical limit
\hbar → 0
Ergodic measures provide one of the cornerstones with which ergodicity is generally discussed. A formal definition follows.
See main article: Conservative system. Let
(X,lB)
T
X
\mu
(X,lB)
\mud\left(T-1(A)\right)=\mu(A)
A\inlB
T
\mu;
\mu
T
A measurable function
T
\mu
\mu
T
T
\mu
For any
A\inlB
T-1(A)=A
\mu(A)=0
\mu(A)=1
In other words, there are no T
\mu
Some authors[16] relax the requirement that
T
\mu
T
\mu
N
T-1(N)
T(N)
The simplest example is when
X
\mu
X
\mu
T
x,y\inX
k\inN
y=Tk(x)
X=\{1,2,\ldots,n\}
(12 … n)
(12)(34 … n)
\{1,2\}
\{3,4,\ldots,n\}
The definition given above admits the following immediate reformulations:
A\inlB
\mud\left(T-1(A)triangleupA\right)=0
\mu(A)=0
\mu(A)=1
triangleup
A\inlB
A,B\inlB
n>0
\mud\left(\left(T-n(A)\right)\capB\right)>0
f:X\toR
f\circT=f
Importantly for applications, the condition in the last characterisation can be restricted to square-integrable functions only:
f\inL2(X,\mu)
f\circT=f
f
See also: Bernoulli shift. Let
S
X=SZ
\mu
S
T
T\left((sk)k)\right)=(sk+1)k
There are many more ergodic measures for the shift map
T
X
See also: Irrational rotation. Let
X
\{z\inC,|z|=1\}
\mu
\theta\inR
X
\theta
T\theta(z)=e2i\pi\thetaz
\theta\inQ
T\theta
\theta
T\theta
See also: Arnold's cat map. Let
X=R2/Z2
g\inSL2(Z)
X
g\left(Z2\right)=Z2
If
\mu
X
T
f:X\toR
\mu
x\inX
x
f
The mean ergodic theorem of J. von Neumann is a similar, weaker statement about averaged translates of square-integrable functions.
An immediate consequence of the definition of ergodicity is that on a topological space
X
lB
T
\mu
\mu
T
\mu
This is not an equivalence since for a transformation which is not uniquely ergodic, but for which there is an ergodic measure with full support
\mu0
\mu1
T
See main article: Mixing (mathematics). A transformation
T
(X,\mu)
\mu
A,B\subsetX
\mu
T
The definition is essentially the same for continuous-time dynamical systems as for a single transformation. Let
(X,lB)
t\inR+
Tt
X
t,s\inR+
Ts+t=Ts\circTt
R+ x X\toX
\mu
(X,lB)
Tt
\mu
\mu
T
Tt
\mu
For any
A\inlB
t\inR+
-1 | |
T | |
t |
(A)\subsetA
\mu(A)=0
\mu(A)=1
As in the discrete case the simplest example is that of a transitive action, for instance the action on the circle given by
Tt(z)=e2i\piz
An example with infinitely many orbits is given by the flow along an irrational slope on the torus: let
X=S1 x S1
\alpha\inR
Tt(z1,z2)=\left(e2i\piz1,e2\alphaz2\right)
\alpha\not\inQ
Further examples of ergodic flows are:
If
X
X
A very powerful alternate definition of ergodic measures can be given using the theory of Banach spaces. Radon measures on
X
lP(X)
X
T
X
lP(X)T
T
T
In the setting above it follows from the Banach-Alaoglu theorem that there always exists extremal points in
lP(X)T
See main article: Conservative system. In general an invariant measure need not be ergodic, but as a consequence of Choquet theory it can always be expressed as the barycenter of a probability measure on the set of ergodic measures. This is referred to as the ergodic decomposition of the measure.
In the case of
X=\{1,\ldots,n\}
T=(12)(34 … n)
T
\mu1,\mu2
\{1,2\}
\{3,\ldots,n\}
T
t\mu1+(1-t)\mu2
t\in[0,1]
Everything in this section transfers verbatim to continuous actions of
R
R+
The transformation
T
\mu
X
T
In the examples considered above, irrational rotations of the circle are uniquely ergodic; shift maps are not.
See main article: Ergodic process.
If
\left(Xn\right)n
\Omega
\OmegaN
\left(xn\right)n\mapsto\left(xn+1\right)n
The simplest case is that of an independent and identically distributed process which corresponds to the shift map described above. Another important case is that of a Markov chain which is discussed in detail below.
A similar interpretation holds for continuous-time stochastic processes though the construction of the measurable structure of the action is more complicated.
Let
S
S
P\in[0,1]S
P(s1,s2)
s1
s2
s\inS
P
\nu
S
\nuP=\nu
s\inS
Using this data we can define a probability measure
\mu\nu
X=SZ
Stationarity of
\nu
\mu\nu
T\left(\left(sk\right)k)\right)=\left(sk+1\right)k
The measure
\mu\nu
The hypotheses above imply that there is a unique stationary measure for the Markov chain. In terms of the matrix
P
P
P
C
Note that in probability theory the Markov chain is called ergodic if in addition each state is aperiodic (the times where the return probability is positive are not multiples of a single integer >1). This is not necessary for the invariant measure to be ergodic; hence the notions of "ergodicity" for a Markov chain and the associated shift-invariant measure are different (the one for the chain is strictly stronger).[18]
Moreover the criterion is an "if and only if" if all communicating classes in the chain are recurrent and we consider all stationary measures.
If
P(s,s')=1/|S|
s,s'\inS
\muP
Markov chains with recurring communicating classes which are not irreducible are not ergodic, and this can be seen immediately as follows. If
S1,S2\subsetneqS
\nu1,\nu2
S1,S2
Z | |
S | |
1 |
Z | |
S | |
2 |
S=\{1,2\}
The Markov chain on
S=\{1,2\}
\mu
\{1,2\}Z
and
we have
The definition of ergodicity also makes sense for group actions. The classical theory (for invertible transformations) corresponds to actions of
Z
R
For non-abelian groups there might not be invariant measures even on compact metric spaces. However the definition of ergodicity carries over unchanged if one replaces invariant measures by quasi-invariant measures.
Important examples are the action of a semisimple Lie group (or a lattice therein) on its Furstenberg boundary.
A measurable equivalence relation it is said to be ergodic if all saturated subsets are either null or conull.