The Gram–Charlier A series (named in honor of Jørgen Pedersen Gram and Carl Charlier), and the Edgeworth series (named in honor of Francis Ysidro Edgeworth) are series that approximate a probability distribution in terms of its cumulants.[1] The series are the same; but, the arrangement of terms (and thus the accuracy of truncating the series) differ.[2] The key idea of these expansions is to write the characteristic function of the distribution whose probability density function is to be approximated in terms of the characteristic function of a distribution with known and suitable properties, and to recover through the inverse Fourier transform.
We examine a continuous random variable. Let
\hat{f}
\kappar
\hat{\psi}
\gammar
\hat{f}(t)=
infty\kappa | ||||
\exp\left[\sum | ||||
|
\right]
infty\gamma | ||||
\hat{\psi}(t)=\exp\left[\sum | ||||
|
\right],
infty(\kappa | |
\hat{f}(t)=\exp\left[\sum | |
r-\gamma |
|
\right]\hat{\psi}(t).
By the properties of the Fourier transform,
(it)r\hat{\psi}(t)
(-1)r[Dr\psi](-x)
x
-x
f(x)=
infty(\kappa | |
\exp\left[\sum | |
r |
-
\gamma | ||||
|
\right]\psi(x).
If is chosen as the normal density
\phi(x)=
1 | \sigma}\exp\left[- | |
\sqrt{2\pi |
(x-\mu)2 | |
2\sigma2 |
\right]
with mean and variance as given by, that is, mean
\mu=\kappa1
\sigma2=\kappa2
f(x)=
infty\kappa | ||||
\exp\left[\sum | ||||
|
\right]\phi(x),
since
\gammar=0
infty\kappa | ||||
\exp\left[\sum | ||||
|
\right]=
infty | |
\sum | |
n=0 |
Bn(0,0,\kappa3,\ldots,\kappa
|
.
Since the n-th derivative of the Gaussian function
\phi
\phi(n)(x)=
(-1)n | |
\sigman |
Hen\left(
x-\mu | |
\sigma |
\right)\phi(x),
this gives us the final expression of the Gram–Charlier A series as
f(x)=\phi(x)
infty | |
\sum | |
n=0 |
1 | |
n!\sigman |
Bn(0,0,\kappa3,\ldots,\kappan)Hen\left(
x-\mu | |
\sigma |
\right).
F(x)=
x | |
\int | |
-infty |
f(u)du=\Phi(x)-\phi(x)
infty | |
\sum | |
n=3 |
1 | |
n!\sigman-1 |
Bn(0,0,\kappa3,\ldots,\kappan)Hen-1\left(
x-\mu | |
\sigma |
\right),
where
\Phi
If we include only the first two correction terms to the normal distribution, we obtain
f(x) ≈
1 | \sigma}\exp\left[- | |
\sqrt{2\pi |
(x-\mu)2 | \right]\left[1+ | |
2\sigma2 |
\kappa3 | |
3!\sigma3 |
He | \right)+ | ||||
|
\kappa4 | |
4!\sigma4 |
He | ||||
|
\right)\right],
with
3-3x | |
He | |
3(x)=x |
4 | |
He | |
4(x)=x |
-6x2+3
Note that this expression is not guaranteed to be positive, and is therefore not a valid probability distribution. The Gram–Charlier A series diverges in many cases of interest—it converges only if
f(x)
\exp(-(x2)/4)
Edgeworth developed a similar expansion as an improvement to the central limit theorem.[4] The advantage of the Edgeworth series is that the error is controlled, so that it is a true asymptotic expansion.
Let
\{Zi\}
\mu
\sigma2
Xn
Xn=
1 | |
\sqrt{n |
Let
Fn
Xn
\limn\toinftyFn(x)=\Phi(x)\equiv
x | |
\int | |
-infty |
| |||||
\tfrac{1}{\sqrt{2\pi}}e |
dq
for every
x
The standardization of
\{Zi\}
Xn
Fn | |
\kappa | |
1 |
=0
Fn | |
\kappa | |
2 |
=1.
\mu
\sigma2
Zi
\kappar
Xn
Zi
r\geq2
Fn | |
\kappa | |
r |
=
n\kappar | |
\sigmarnr/2 |
=
λr | |
nr/2 |
where λr=
\kappar | |
\sigmar |
.
If we expand the formal expression of the characteristic function
\hat{f}n(t)
Fn
\phi(x)= | 1 |
\sqrt{2\pi |
then the cumulant differences in the expansion are
Fn | |
\kappa | |
1-\gamma |
1=0,
Fn | |
\kappa | |
2-\gamma |
2=0,
Fn | |
\kappa | |
r-\gamma |
r=
λr | |
nr/2-1 |
; r\geq3.
The Gram–Charlier A series for the density function of
Xn
fn(x)=\phi(x)
infty | |
\sum | |
r=0 |
1 | |
r! |
Br\left(0,0,
λ3 | ,\ldots, | |
n1/2 |
λr | |
nr/2-1 |
\right)Her(x).
The Edgeworth series is developed similarly to the Gram–Charlier A series, only that now terms are collected according to powers of
n
\hat{f}n(t)=\left[1+\sum
infty | |
j=1 |
Pj(it) | |
nj/2 |
\right]\exp(-t2/2),
where
Pj(x)
3j
fn
fn(x)=\phi(x)+
infty | |
\sum | |
j=1 |
Pj(-D) | |
nj/2 |
\phi(x).
Likewise, integrating the series, we obtain the distribution function
Fn(x)=\Phi(x)+
infty | |
\sum | |
j=1 |
1 | |
nj/2 |
Pj(-D) | |
D |
\phi(x).
We can explicitly write the polynomial
Pm(-D)
Pm(-D)=\sum\prodi
1 | \left( | |
ki! |
| |||||
li! |
ki | |
\right) |
(-D)s,
where the summation is over all the integer partitions of m such that
\sumiiki=m
li=i+2
s=\sumikili.
For example, if m = 3, then there are three ways to partition this number: 1 + 1 + 1 = 2 + 1 = 3. As such we need to examine three cases:
Thus, the required polynomial is
\begin{align} P3(-D)&=
1 | \left( | |
3! |
λ3 | |
3! |
\right)3(-D)9+
1 | \left( | |
1!1! |
λ3 | |
3! |
\right)\left(
λ4 | |
4! |
\right)(-D)7+
1 | \left( | |
1! |
λ5 | |
5! |
\right)(-D)5\\ &=
| |||||||
1296 |
(-D)9+
λ3λ4 | |
144 |
(-D)7+
λ5 | |
120 |
(-D)5. \end{align}
The first five terms of the expansion are
\begin{align} fn(x)&=\phi(x)\\ & -
1 | |||||||
|
(3) | |
\left(\tfrac{1}{6}λ | |
3\phi |
(x)\right)\\ & +
1 | |
n |
(4) | |
\left(\tfrac{1}{24}λ | |
4\phi |
(x)+
2\phi | |
\tfrac{1}{72}λ | |
3 |
(6)(x)\right)\\ & -
1 | |||||||
|
(5) | |
\left(\tfrac{1}{120}λ | |
5\phi |
(x)+\tfrac{1}{144}λ3λ
(7) | |
4\phi |
(x)+
3\phi | |
\tfrac{1}{1296}λ | |
3 |
(9)(x)\right)\\ & +
1 | |
n2 |
(6) | |
\left(\tfrac{1}{720}λ | |
6\phi |
(x)+
2 | |
\left(\tfrac{1}{1152}λ | |
4 |
+\tfrac{1}{720}λ3λ
(8) | |
5\right)\phi |
(x)+
(10) | |
\tfrac{1}{1728}λ | |
4\phi |
(x)+
4\phi | |
\tfrac{1}{31104}λ | |
3 |
(12)(x)\right)\\ & +O\left
| ||||
(n |
\right). \end{align}
Here, is the j-th derivative of at point x. Remembering that the derivatives of the density of the normal distribution are related to the normal density by
\phi(n)(x)=(-1)nHen(x)\phi(x)
Hen
Note that in case of a lattice distributions (which have discrete values), the Edgeworth expansion must be adjusted to account for the discontinuous jumps between lattice points.[5]
Take
Xi\sim\chi2(k=2),i=1,2,3(n=3)
\barX=
1 | |
3 |
3 | |
\sum | |
i=1 |
Xi
We can use several distributions for
\barX
\barX\simGamma\left(\alpha=n ⋅ k/2,\theta=2/n\right)=Gamma\left(\alpha=3,\theta=2/3\right)
\barX\xrightarrow{n\toinfty}N(k,2 ⋅ k/n)=N(2,4/3)
[0,1]