In probability theory, the de Moivre–Laplace theorem, which is a special case of the central limit theorem, states that the normal distribution may be used as an approximation to the binomial distribution under certain conditions. In particular, the theorem shows that the probability mass function of the random number of "successes" observed in a series of
n
p
n
np
n
p
0
1
The theorem appeared in the second edition of The Doctrine of Chances by Abraham de Moivre, published in 1738. Although de Moivre did not use the term "Bernoulli trials", he wrote about the probability distribution of the number of times "heads" appears when a coin is tossed 3600 times.[1]
This is one derivation of the particular Gaussian function used in the normal distribution.
It is a special case of the central limit theorem because a Bernoulli process can be thought of as the drawing of independent random variables from a bimodal discrete distribution with non-zero probability only for values 0 and 1. In this case, the binomial distribution models the number of successes (i.e., the number of 1s), whereas the central limit theorem states that, given sufficiently large n, the distribution of the sample means will be approximately normal. However, because in this case the fraction of successes (i.e., the number of 1s divided by the number of trials, n) is equal to the sample mean, the distribution of the fractions of successes (described by the binomial distribution divided by the constant n) and the distribution of the sample means (approximately normal with large n due to the central limit theorem) are equivalent.
As n grows large, for k in the neighborhood of np we can approximate[2] [3]
{n\choosek}pkqn-k\simeq
1 | |
\sqrt{2\pinpq |
The theorem can be more rigorously stated as follows:
\left(X-np\right)/\sqrt{npq}
styleX
n\toinfty
X
c
X
k
k=np+c\sqrt{npq}
For example, with
c
k
The normal distribution with mean
\mu
\sigma
f'(x)=- | x-\mu |
\sigma2 |
f(x)
infty | |
\int | |
-infty |
f(x)dx=1
stylep(k+1)-p(k)
stylen\toinfty
f'(x) | |
f(x) |
⋅ \left(-
\sigma2 | |
x-\mu |
\right)\to1
n\toinfty
The required result can be shown directly:
\begin{align} | f'(x) |
f(x) |
npq | &= | |
np-k |
p\left(n,k+1\right)-p\left(n,k\right) | |
p\left(n,k\right) |
\sqrt{npq | |
The last holds because the term
-cnpq
n\toinfty
As
stylek
stylec
style{0.5}/\sqrt{npq}
The proof consists of transforming the left-hand side (in the statement of the theorem) to the right-hand side by three approximations.
First, according to Stirling's formula, the factorial of a large number n can be replaced with the approximation
n!\simeqnne-n\sqrt{2\pin} asn\toinfty.
Thus
\begin{align}{n\choosek}pkqn-k&=
n! | |
k!(n-k)! |
pkqn-k\\&\simeq
nne-n\sqrt{2\pin | |
}{k |
ke-k\sqrt{2\pik}(n-k)n-ke-(n-k)\sqrt{2\pi(n-k)}}pkqn-k\\&=\sqrt{
n | |
2\pik\left(n-k\right) |
Next, the approximation
\tfrac{k}{n}\top
\begin{align}{n\choosek}pkqn-k&\simeq\sqrt{
1 | ||||||||
|
Finally, the expression is rewritten as an exponential and the Taylor Series approximation for ln(1+x) is used:
ln\left(1+x\right)\simeqx-
x2 | + | |
2 |
x3 | |
3 |
- …
Then
\begin{align}{n\choosek}pkqn-k&\simeq
1 | |
\sqrt{2\pinpq |
Each "
\simeq