In probability theory, a compound Poisson distribution is the probability distribution of the sum of a number of independent identically-distributed random variables, where the number of terms to be added is itself a Poisson-distributed variable. The result can be either a continuous or a discrete distribution.
Suppose that
N\sim\operatorname{Poisson}(λ),
i.e., N is a random variable whose distribution is a Poisson distribution with expected value λ, and that
X1,X2,X3,...
are identically distributed random variables that are mutually independent and also independent of N. Then the probability distribution of the sum of
N
Y=
N | |
\sum | |
n=1 |
Xn
is a compound Poisson distribution.
In the case N = 0, then this is a sum of 0 terms, so the value of Y is 0. Hence the conditional distribution of Y given that N = 0 is a degenerate distribution.
The compound Poisson distribution is obtained by marginalising the joint distribution of (Y,N) over N, and this joint distribution can be obtained by combining the conditional distribution Y | N with the marginal distribution of N.
The expected value and the variance of the compound distribution can be derived in a simple way from law of total expectation and the law of total variance. Thus
\operatorname{E}(Y)=\operatorname{E}\left[\operatorname{E}(Y\midN)\right]=\operatorname{E}\left[N\operatorname{E}(X)\right]=\operatorname{E}(N)\operatorname{E}(X),
\begin{align} \operatorname{Var}(Y)&=\operatorname{E}\left[\operatorname{Var}(Y\midN)\right]+\operatorname{Var}\left[\operatorname{E}(Y\midN)\right]=\operatorname{E}\left[N\operatorname{Var}(X)\right]+\operatorname{Var}\left[N\operatorname{E}(X)\right],\\[6pt] &=\operatorname{E}(N)\operatorname{Var}(X)+\left(\operatorname{E}(X)\right)2\operatorname{Var}(N). \end{align}
Then, since E(N) = Var(N) if N is Poisson-distributed, these formulae can be reduced to
\operatorname{E}(Y)=\operatorname{E}(N)\operatorname{E}(X)=λ\operatorname{E}(X),
\operatorname{Var}(Y)=\operatorname{E}(N)(\operatorname{Var}(X)+(\operatorname{E}(X))2)=\operatorname{E}(N){\operatorname{E}(X2)}=λ{\operatorname{E}(X2)}.
The probability distribution of Y can be determined in terms of characteristic functions:
\varphiY(t)=\operatorname{E}(eitY)=\operatorname{E}\left(\left(\operatorname{E}(eitX\midN)\right)N\right)=\operatorname{E}
N\right), | |
\left((\varphi | |
X(t)) |
\varphiY(t)=
λ(\varphiX(t)-1) | |
rm{e} |
.
An alternative approach is via cumulant generating functions:
KY(t)=ln\operatorname{E}[etY]=ln\operatornameE[\operatornameE[etY\midN]]=ln\operatorname
NKX(t) | |
E[e |
]=KN(KX(t)).
Via the law of total cumulance it can be shown that, if the mean of the Poisson distribution λ = 1, the cumulants of Y are the same as the moments of X1.
Every infinitely divisible probability distribution is a limit of compound Poisson distributions.[1] And compound Poisson distributions is infinitely divisible by the definition.
When
X1,X2,X3,...
P(X1=k)=\alphak, (k=1,2,\ldots)
Y
PY(z)=
infty | |
\sum\limits | |
i=0 |
P(Y=i)zi=
infty | |
\exp\left(\sum\limits | |
k=1 |
\alphakλ(zk-1)\right), (|z|\le1)
has a discrete compound Poisson(DCP) distribution with parameters
(\alpha1λ,\alpha2λ,\ldots)\inRinfty
X\sim{DCP
Moreover, if
X\sim{\operatorname{DCP}}(λ{\alpha1},\ldots,λ{\alphar})
X
r
r=1,2
r=3,4
Feller's characterization of the compound Poisson distribution states that a non-negative integer valued r.v.
X
This distribution can model batch arrivals (such as in a bulk queue[5] [9]). The discrete compound Poisson distribution is also widely used in actuarial science for modelling the distribution of the total claim amount.[3]
When some
\alphak
Y
GY(z)=
infty | |
\sum\limits | |
i=0 |
P(Y=i)zi=
infty | |
\exp\left(\sum\limits | |
k=1 |
\alphakλ(zk-1)\right), (|z|\le1)
has a discrete pseudo compound Poisson distribution with parameters
(λ1,λ2,\ldots)=:(\alpha1λ,\alpha2λ,\ldots)\inRinfty
{\alphai}\inR,λ>0
If X has a gamma distribution, of which the exponential distribution is a special case, then the conditional distribution of Y | N is again a gamma distribution. The marginal distribution of Y is a Tweedie distribution[10] with variance power 1 < p < 2 (proof via comparison of characteristic function (probability theory)). To be more explicit, if
N\sim\operatorname{Poisson}(λ),
and
Xi\sim\operatorname{\Gamma}(\alpha,\beta)
i.i.d., then the distribution of
Y=
N | |
\sum | |
i=1 |
Xi
ED(\mu,\sigma2)
\begin{align} \operatorname{E}[Y]&=λ
\alpha | |
\beta |
=:\mu,\\[4pt] \operatorname{Var}[Y]&=λ
\alpha(1+\alpha) | |
\beta2 |
=:\sigma2\mup. \end{align}
The mapping of parameters Tweedie parameter
\mu,\sigma2,p
λ,\alpha,\beta
\begin{align} λ&=
\mu2-p | |
(2-p)\sigma2 |
, \\[4pt] \alpha&=
2-p | |
p-1 |
, \\[4pt] \beta&=
\mu1-p | |
(p-1)\sigma2 |
. \end{align}
See main article: Compound Poisson process.
A compound Poisson process with rate
λ>0
\{Y(t):t\geq0\}
Y(t)=
N(t) | |
\sum | |
i=1 |
Di,
where the sum is by convention equal to zero as long as N(t) = 0. Here,
\{N(t):t\geq0\}
λ
\{Di:i\geq1\}
\{N(t):t\geq0\}.
For the discrete version of compound Poisson process, it can be used in survival analysis for the frailty models.[12]
A compound Poisson distribution, in which the summands have an exponential distribution, was used by Revfeim to model the distribution of the total rainfall in a day, where each day contains a Poisson-distributed number of events each of which provides an amount of rainfall which has an exponential distribution.[13] Thompson applied the same model to monthly total rainfalls.[14]
There have been applications to insurance claims[15] [16] and x-ray computed tomography.[17] [18] [19]