Kramers–Moyal expansion explained

In stochastic processes, Kramers–Moyal expansion refers to a Taylor series expansion of the master equation, named after Hans Kramers and José Enrique Moyal.[1] [2] [3] In many textbooks, the expansion is used only to derive the Fokker–Planck equation, and never used again. In general, continuous stochastic processes are essentially all Markovian, and so Fokker–Planck equations are sufficient for studying them. The higher-order Kramers–Moyal expansion only come into play when the process is jumpy. This usually means it is a Poisson-like process.[4]

For a real stochastic process, one can compute its central moment functions from experimental data on the process, from which one can then compute its Kramers–Moyal coefficients, and thus empirically measure its Kolmogorov forward and backward equations. This is implemented as a python package [5]

Statement

\partialp(x,t)
\partialt

=\intp(x,t|x0,t0)p(x0,t0)dx0

where

p(x,t|x0,t0)

is the transition probability function, and

p(x,t)

is the probability density at time

t

. The Kramers–Moyal expansion transforms the above to an infinite order partial differential equation[6] [7] [8]

\partialtp(x,t)=

infty
\sum
n=1
n[D
(-\partial
n(x,t)

p(x,t)]

and also\partial_t p(x, t|x_0, t_0) = \sum_^\infty (-\partial_x)^n [D_n(x, t) p(x, t|x_0, t_0) ]

where

Dn(x,t)

are the Kramers–Moyal coefficients, defined byD_n(x, t) = \frac\lim_ \frac \mu_n(t|x, t-\tau)and

\mun

are the central moment functions, defined by

\mun(t'|x,t)=

infty
\int
-infty

(x'-x)np(x',t'\midx,t)dx'.

The Fokker–Planck equation is obtained by keeping only the first two terms of the series in which

D1

is the drift and

D2

is the diffusion coefficient.[9]

Also, the moments, assuming they exist, evolves as

\frac\left\langle x^n\right\rangle=\sum_^n \frac\left\langle x^ D^(x, t)\right\ranglewhere angled brackets mean taking the expectation:

\left\langlef\right\rangle=\intf(x)p(x,t)dx

.

n-dimensional version

The above version is the one-dimensional version. It generalizes to n-dimensions. (Section 4.7)

Proof

In usual probability, where the probability density does not change, the moments of a probability density function determines the probability density itself by a Fourier transform (details may be found at the characteristic function page):p(x) = \frac \int e^\tilde p(k)dk = \sum_^\infty \frac\delta^(x)\mu_n \tilde p(k) = \int e^ p(x) dx = \sum_^\infty\frac \mu_n Similarly, p(x, t| x_0, t_0) = \sum_^\infty \frac\delta^(x-x_0) \mu_n(t|x_0, t_0)Now we need to integrate away the Dirac delta function. Fixing a small

\tau>0

, we have by the Chapman-Kolmogorov equation,\beginp(x, t) &= \int p(x,t|x', t-\tau) p(x', t-\tau) dx' \\&= \sum_^\infty \frac\int p(x', t-\tau) \delta^(x-x') \mu_n(t|x', t-\tau) dx' \\&= \sum_^\infty \frac \partial_x^n (p(x, t-\tau) \mu_n(t|x, t-\tau))\end The

n=0

term is just

p(x,t-\tau)

, so taking derivative with respect to time,\partial_t p(x, t) = \lim_\frac 1\tau \sum_^\infty \frac \partial_x^n (p(x, t-\tau) \mu_n(t|x, t-\tau)) = \sum_^\infty (-\partial_x)^n (p(x, t) D_n(x, t))

The same computation with

p(x,t|x0,t0)

gives the other equation.

Forward and backward equations

The equation can be recast into a linear operator form, using the idea of infinitesimal generator. Define the linear operator \mathcal A f := \sum_^\infty (-\partial_x)^n[D_n(x,t) f(x,t)] then the equation above states \begin\partial_t p(x, t) &= \mathcal p(x, t) \\\partial_t p(x, t|x_0, t_0) &= \mathcal p(x, t|x_0, t_0)\endIn this form, the equations are precisely in the form of a general Kolmogorov forward equation. The backward equation then states that\partial_t p(x_1, t_1|x, t) = -\mathcal^\dagger p(x_1, t_1|x, t)where\mathcal A^\dagger f := \sum_^\infty D_n(x,t) \partial_x^n[f(x,t)] is the Hermitian adjoint of

lA

.

Computing the Kramers–Moyal coefficients

By definition,D_n(x, t) = \frac\lim_ \frac \mu_n(t|x, t-\tau)This definition works because

\mun(t|x,t)=0

, as those are the central moments of the Dirac delta function. Since the even central moments are nonnegative, we have

D2n\geq0

for all

n\geq1

. When the stochastic process is the Markov process

dX=bdt+\sigmadWt

, we can directly solve for

p(x,t|x,t-\tau)

as approximated by a normal distribution with mean

x+b(x)\tau

and variance

\sigma2\tau

. This then allows us to compute the central moments, and soD_1 = b, \quad D_2 = \frac 12 \sigma^2, \quad D_3=D_4=\cdots = 0This then gives us the 1-dimensional Fokker–Planck equation:\partial_t p = -\partial_x(bp) + \frac 12 \partial_x^2(\sigma^2 p)

Pawula theorem

Pawula theorem states that either the sequence

D1,D2,D3,...

becomes zero at the third term, or all its even terms are positive.[10] [11]

Proof

By Cauchy–Schwarz inequality, the central moment functions satisfy

2
\mu
n+m

\leq\mu2n\mu2m

. So, taking the limit, we have
2
D
n+m

\leq

(2n)!(2m)!
(n+m)!2

D2nD2m

. If some

D2+n0

for some

n\geq1

, then

D2D2+2n>0

. In particular,

D2+n,D2+2n,D2+4n,...>0

. So the existence of any nonzero coefficient of order

\geq3

implies the existence of nonzero coefficients of arbitrarily large order. Also, if

Dn0

, then

D2D2n-2>0,D4D2n-4>0,...

. So the existence of any nonzero coefficient of order

n

implies all coefficients of order

2,4,...,2n-2

are positive.

Interpretation

Let the operator

lAm

be defined such

lAmf:=

m
\sum
n=1
n[D
(-\partial
n(x,t)

f(x,t)]

. The probability density evolves by

\partialt\rholAm\rho

. Different order of

m

gives different level of approximation.

m=0

: the probability density does not evolve

m=1

: it evolves by deterministic drift only.

m=2

: it evolves by drift and Brownian motion (Fokker-Planck equation)

m=infty

: the fully exact equation.

Pawula theorem means that if truncating to the second term is not exact, that is,

lA2lA

, then truncating to any term is still not exact. Usually, this means that for any truncation

lAm

, there exists a probability density function

\rho

that can become negative during its evolution

\partialt\rholAm\rho

(and thus fail to be a probability density function). However, this doesn't mean that Kramers-Moyal expansions truncated at other choices of

m

is useless. Though the solution must have negative values at least for sufficiently small times, the resulting approximation probability density may still be better than the

m=2

approximation.

Notes and References

  1. Kramers . H. A. . 1940 . Brownian motion in a field of force and the diffusion model of chemical reactions . Physica . 7 . 4 . 284–304 . 10.1016/S0031-8914(40)90098-2 . 1940Phy.....7..284K . 33337019 .
  2. Moyal . J. E. . 1949 . Stochastic processes and statistical physics . . Series B (Methodological) . 11 . 2 . 150–210 . 10.1111/j.2517-6161.1949.tb00030.x . 2984076 .
  3. Book: Risken . Hannes . The Fokker-Planck Equation: Methods of Solution and Applications . 6 December 2012 . Springer . 9783642968075.
  4. Book: Spinney . Richard . Ford . Ian . Klages . Rainer . Just . Wolfram . Jarzynski . Christopher . 1201.6381 . Fluctuation relations: a pedagogical overview . 10.1002/9783527658701.ch1 . 978-3-527-41094-1 . Weinheim . 3308060 . 3–56 . Wiley-VCH . Reviews of Nonlinear Dynamics and Complexity . Nonequilibrium Statistical Physics of Small Systems: Fluctuation relations and beyond . 2013.
  5. Rydin Gorjão . L. . Meirinhos . F. . 2019 . kramersmoyal: Kramers--Moyal coefficients for stochastic processes . . 4 . 44 . 1693 . 1912.09737 . 2019JOSS....4.1693G . 10.21105/joss.01693 . free.
  6. Book: Gardiner, C. . 2009 . Stochastic Methods . 4th . Berlin . Springer . 978-3-642-08962-6 .
  7. Book: Van Kampen, N. G. . 1992 . Stochastic Processes in Physics and Chemistry . Elsevier . 0-444-89349-0 .
  8. Book: Risken, H. . 1996 . The Fokker–Planck Equation . 63–95 . Springer . Berlin, Heidelberg . 3-540-61530-X .
  9. Book: Wolfgang . Paul . Jörg . Baschnagel . A Brief Survey of the Mathematics of Probability Theory . Stochastic Processes . 17–61 [esp. 33–35] . Springer . 2013 . 978-3-319-00326-9. 10.1007/978-3-319-00327-6_2 .
  10. Pawula . R. F. . 1967 . Generalizations and extensions of the Fokker–Planck–Kolmogorov equations . IEEE Transactions on Information Theory . 13 . 1 . 33–41 . 10.1109/TIT.1967.1053955.
  11. Pawula . R. F. . 1967 . Approximation of the linear Boltzmann equation by the Fokker–Planck equation . Physical Review . 162 . 1 . 186–188 . 1967PhRv..162..186P . 10.1103/PhysRev.162.186.