In probability theory, the zero-truncated Poisson (ZTP) distribution is a certain discrete probability distribution whose support is the set of positive integers. This distribution is also known as the conditional Poisson distribution[1] or the positive Poisson distribution.[2] It is the conditional probability distribution of a Poisson-distributed random variable, given that the value of the random variable is not zero. Thus it is impossible for a ZTP random variable to be zero. Consider for example the random variable of the number of items in a shopper's basket at a supermarket checkout line. Presumably a shopper does not stand in line with nothing to buy (i.e., the minimum purchase is 1 item), so this phenomenon may follow a ZTP distribution.[3]
Since the ZTP is a truncated distribution with the truncation stipulated as, one can derive the probability mass function from a standard Poisson distribution) as follows:[4]
g(k;λ)=P(X=k\midX>0)=
f(k;λ) | |
1-f(0;λ) |
=
λke- | |
k!\left(1-e-\right) |
=
λk | |
(eλ-1)k! |
The mean is
\operatorname{E}[X]= | λ | = |
1-e-λ |
λeλ | |
eλ-1 |
and the variance is
\operatorname{Var}[X]= | λ+λ2 |
1-e-λ |
-
λ2 | |
(1-e-λ)2 |
=\operatorname{E}[X](1+λ-\operatorname{E}[X])
The method of moments estimator
\widehat{λ}
λ
\widehat{λ | |
where
\bar{x}
This equation has a solution in terms of the Lambert W function. In practice, a solution may be found using numerical methods.
Insurance claims:
Imagine navigating the intricate landscape of auto insurance claims, where each claim signifies a unique event – an accident or damage occurrence. The ZTP distribution seamlessly aligns with this scenario, excluding the possibility of policyholders with zero claims.
Let X denote the random variable representing the number of insurance claims. If λ is the average rate of claims, the ZTP probability mass function takes the form:
P(X=k)= | λke- |
k!\left(1-e-\right) |
This formula encapsulates the probability of observing k claims given that at least one claim has transpired. The denominator ensures the exclusion of the improbable zero-claim scenario. By utilizing the zero-truncated Poisson distribution, the manufacturing company can analyze and predict the frequency of defects in their products while focusing on instances where defects exist. This distribution helps in understanding and improving the quality control process, especially when it's crucial to account for at least one defect.
Random variables sampled from the zero-truncated Poisson distribution may be achieved using algorithms derived from Poisson distribution sampling algorithms.[5]
init: Let k ← 1, t ← e−λ / (1 - e−λ) * λ, s ← t. Generate uniform random number u in [0,1]. while s < u do: k ← k + 1. t ← t * λ / k. s ← s + t. return k.
The cost of the procedure above is linear in, which may be large for large values of
λ