Edgeworth series explained

The Gram–Charlier A series (named in honor of Jørgen Pedersen Gram and Carl Charlier), and the Edgeworth series (named in honor of Francis Ysidro Edgeworth) are series that approximate a probability distribution in terms of its cumulants.[1] The series are the same; but, the arrangement of terms (and thus the accuracy of truncating the series) differ.[2] The key idea of these expansions is to write the characteristic function of the distribution whose probability density function is to be approximated in terms of the characteristic function of a distribution with known and suitable properties, and to recover through the inverse Fourier transform.

Gram–Charlier A series

We examine a continuous random variable. Let

\hat{f}

be the characteristic function of its distribution whose density function is, and

\kappar

its cumulants. We expand in terms of a known distribution with probability density function, characteristic function

\hat{\psi}

, and cumulants

\gammar

. The density is generally chosen to be that of the normal distribution, but other choices are possible as well. By the definition of the cumulants, we have (see Wallace, 1958)[3]

\hat{f}(t)=

infty\kappa
\exp\left[\sum
r(it)r
r!

\right]

and
infty\gamma
\hat{\psi}(t)=\exp\left[\sum
r(it)r
r!

\right],

which gives the following formal identity:
infty(\kappa
\hat{f}(t)=\exp\left[\sum
r-\gamma
r)(it)r
r!

\right]\hat{\psi}(t).

By the properties of the Fourier transform,

(it)r\hat{\psi}(t)

is the Fourier transform of

(-1)r[Dr\psi](-x)

, where is the differential operator with respect to . Thus, after changing

x

with

-x

on both sides of the equation, we find for the formal expansion

f(x)=

infty(\kappa
\exp\left[\sum
r

-

\gamma
r)(-D)r
r!

\right]\psi(x).

If is chosen as the normal density

\phi(x)=

1\sigma}\exp\left[-
\sqrt{2\pi
(x-\mu)2
2\sigma2

\right]

with mean and variance as given by, that is, mean

\mu=\kappa1

and variance

\sigma2=\kappa2

, then the expansion becomes

f(x)=

infty\kappa
\exp\left[\sum
r(-D)r
r!

\right]\phi(x),

since

\gammar=0

for all > 2, as higher cumulants of the normal distribution are 0. By expanding the exponential and collecting terms according to the order of the derivatives, we arrive at the Gram–Charlier A series. Such an expansion can be written compactly in terms of Bell polynomials as
infty\kappa
\exp\left[\sum
r(-D)r
r!

\right]=

infty
\sum
n=0

Bn(0,0,\kappa3,\ldots,\kappa

n)(-D)n
n!

.

Since the n-th derivative of the Gaussian function

\phi

is given in terms of Hermite polynomial as

\phi(n)(x)=

(-1)n
\sigman

Hen\left(

x-\mu
\sigma

\right)\phi(x),

this gives us the final expression of the Gram–Charlier A series as

f(x)=\phi(x)

infty
\sum
n=0
1
n!\sigman

Bn(0,0,\kappa3,\ldots,\kappan)Hen\left(

x-\mu
\sigma

\right).

F(x)=

x
\int
-infty

f(u)du=\Phi(x)-\phi(x)

infty
\sum
n=3
1
n!\sigman-1

Bn(0,0,\kappa3,\ldots,\kappan)Hen-1\left(

x-\mu
\sigma

\right),

where

\Phi

is the CDF of the normal distribution.

If we include only the first two correction terms to the normal distribution, we obtain

f(x)

1\sigma}\exp\left[-
\sqrt{2\pi
(x-\mu)2\right]\left[1+
2\sigma2
\kappa3
3!\sigma3
He\right)+
3\left(x-\mu
\sigma
\kappa4
4!\sigma4
He
4\left(x-\mu
\sigma

\right)\right],

with

3-3x
He
3(x)=x
and
4
He
4(x)=x

-6x2+3

.

Note that this expression is not guaranteed to be positive, and is therefore not a valid probability distribution. The Gram–Charlier A series diverges in many cases of interest—it converges only if

f(x)

falls off faster than

\exp(-(x2)/4)

at infinity (Cramér 1957). When it does not converge, the series is also not a true asymptotic expansion, because it is not possible to estimate the error of the expansion. For this reason, the Edgeworth series (see next section) is generally preferred over the Gram–Charlier A series.

The Edgeworth series

Edgeworth developed a similar expansion as an improvement to the central limit theorem.[4] The advantage of the Edgeworth series is that the error is controlled, so that it is a true asymptotic expansion.

Let

\{Zi\}

be a sequence of independent and identically distributed random variables with finite mean

\mu

and variance

\sigma2

, and let

Xn

be their standardized sums:

Xn=

1
\sqrt{n
} \sum_^n \frac.

Let

Fn

denote the cumulative distribution functions of the variables

Xn

. Then by the central limit theorem,

\limn\toinftyFn(x)=\Phi(x)\equiv

x
\int
-infty
-1q2
2
\tfrac{1}{\sqrt{2\pi}}e

dq

for every

x

, as long as the mean and variance are finite.

The standardization of

\{Zi\}

ensures that the first two cumulants of

Xn

are
Fn
\kappa
1

=0

and
Fn
\kappa
2

=1.

Now assume that, in addition to having mean

\mu

and variance

\sigma2

, the i.i.d. random variables

Zi

have higher cumulants

\kappar

. From the additivity and homogeneity properties of cumulants, the cumulants of

Xn

in terms of the cumulants of

Zi

are for

r\geq2

,
Fn
\kappa
r

=

n\kappar
\sigmarnr/2

=

λr
nr/2

whereλr=

\kappar
\sigmar

.

If we expand the formal expression of the characteristic function

\hat{f}n(t)

of

Fn

in terms of the standard normal distribution, that is, if we set
\phi(x)=1
\sqrt{2\pi
}\exp(-\tfracx^2),

then the cumulant differences in the expansion are

Fn
\kappa
1-\gamma

1=0,

Fn
\kappa
2-\gamma

2=0,

Fn
\kappa
r-\gamma

r=

λr
nr/2-1

;    r\geq3.

The Gram–Charlier A series for the density function of

Xn

is now

fn(x)=\phi(x)

infty
\sum
r=0
1
r!

Br\left(0,0,

λ3,\ldots,
n1/2
λr
nr/2-1

\right)Her(x).

The Edgeworth series is developed similarly to the Gram–Charlier A series, only that now terms are collected according to powers of

n

. The coefficients of nm/2 term can be obtained by collecting the monomials of the Bell polynomials corresponding to the integer partitions of m. Thus, we have the characteristic function as

\hat{f}n(t)=\left[1+\sum

infty
j=1
Pj(it)
nj/2

\right]\exp(-t2/2),

where

Pj(x)

is a polynomial of degree

3j

. Again, after inverse Fourier transform, the density function

fn

follows as

fn(x)=\phi(x)+

infty
\sum
j=1
Pj(-D)
nj/2

\phi(x).

Likewise, integrating the series, we obtain the distribution function

Fn(x)=\Phi(x)+

infty
\sum
j=1
1
nj/2
Pj(-D)
D

\phi(x).

We can explicitly write the polynomial

Pm(-D)

as

Pm(-D)=\sum\prodi

1\left(
ki!
λ
li
li!
ki
\right)

(-D)s,

where the summation is over all the integer partitions of m such that

\sumiiki=m

and

li=i+2

and

s=\sumikili.

For example, if m = 3, then there are three ways to partition this number: 1 + 1 + 1 = 2 + 1 = 3. As such we need to examine three cases:

Thus, the required polynomial is

\begin{align} P3(-D)&=

1\left(
3!
λ3
3!

\right)3(-D)9+

1\left(
1!1!
λ3
3!

\right)\left(

λ4
4!

\right)(-D)7+

1\left(
1!
λ5
5!

\right)(-D)5\\ &=

3
λ
3
1296

(-D)9+

λ3λ4
144

(-D)7+

λ5
120

(-D)5. \end{align}

The first five terms of the expansion are

\begin{align} fn(x)&=\phi(x)\\ &   -

1
1
2
n
(3)
\left(\tfrac{1}{6}λ
3\phi

(x)\right)\\ &   +

1
n
(4)
\left(\tfrac{1}{24}λ
4\phi

(x)+

2\phi
\tfrac{1}{72}λ
3

(6)(x)\right)\\ &   -

1
3
2
n
(5)
\left(\tfrac{1}{120}λ
5\phi

(x)+\tfrac{1}{144}λ

(7)
4\phi

(x)+

3\phi
\tfrac{1}{1296}λ
3

(9)(x)\right)\\ &+

1
n2
(6)
\left(\tfrac{1}{720}λ
6\phi

(x)+

2
\left(\tfrac{1}{1152}λ
4

+\tfrac{1}{720}λ

(8)
5\right)\phi

(x)+

(10)
\tfrac{1}{1728}λ
4\phi

(x)+

4\phi
\tfrac{1}{31104}λ
3

(12)(x)\right)\\ &+O\left

-5
2
(n

\right). \end{align}

Here, is the j-th derivative of at point x. Remembering that the derivatives of the density of the normal distribution are related to the normal density by

\phi(n)(x)=(-1)nHen(x)\phi(x)

, (where

Hen

is the Hermite polynomial of order n), this explains the alternative representations in terms of the density function. Blinnikov and Moessner (1998) have given a simple algorithm to calculate higher-order terms of the expansion.

Note that in case of a lattice distributions (which have discrete values), the Edgeworth expansion must be adjusted to account for the discontinuous jumps between lattice points.[5]

Illustration: density of the sample mean of three χ² distributions

Take

Xi\sim\chi2(k=2),i=1,2,3(n=3)

and the sample mean

\barX=

1
3
3
\sum
i=1

Xi

.

We can use several distributions for

\barX

:

\barX\simGamma\left(\alpha=nk/2,\theta=2/n\right)=Gamma\left(\alpha=3,\theta=2/3\right)

.

\barX\xrightarrow{n\toinfty}N(k,2 ⋅ k/n)=N(2,4/3)

.

Discussion of results

[0,1]

.

See also

Further reading

Notes and References

  1. Stuart, A., & Kendall, M. G. (1968). The advanced theory of statistics. Hafner Publishing Company.
  2. Book: Kolassa . John E. . Series approximation methods in statistics . 2006 . Springer . 0387322272 . 3rd.
  3. Wallace . D. L. . 1958 . Asymptotic Approximations to Distributions . Annals of Mathematical Statistics . 29 . 3 . 635–654 . 2237255 . 10.1214/aoms/1177706528. free .
  4. Hall, P. (2013). The bootstrap and Edgeworth expansion. Springer Science & Business Media.
  5. Kolassa . John E. . Peter . McCullagh . Edgeworth series for lattice distributions . . 18 . 2 . 1990 . 981–985 . 2242145 . 10.1214/aos/1176347637. free .