Exponential dispersion model explained

In probability and statistics, the class of exponential dispersion models (EDM), also called exponential dispersion family (EDF), is a set of probability distributions that represents a generalisation of the natural exponential family.[1] [2] [3] Exponential dispersion models play an important role in statistical theory, in particular in generalized linear models because they have a special structure which enables deductions to be made about appropriate statistical inference.

Definition

Univariate case

There are two versions to formulate an exponential dispersion model.

Additive exponential dispersion model

In the univariate case, a real-valued random variable

X

belongs to the additive exponential dispersion model with canonical parameter

\theta

and index parameter

λ

,

X\simED*(\theta,λ)

, if its probability density function can be written as

fX(x\mid\theta,λ)=h*(λ,x)\exp\left(\thetax-λA(\theta)\right).

Reproductive exponential dispersion model

The distribution of the transformed random variable

Y=X
λ
is called reproductive exponential dispersion model,

Y\simED(\mu,\sigma2)

, and is given by

fY(y\mid\mu,\sigma2)=h(\sigma2,y)\exp\left(

\thetay-A(\theta)
\sigma2

\right),

with

\sigma2=

1
λ
and

\mu=A'(\theta)

, implying

\theta=(A')-1(\mu)

.The terminology dispersion model stems from interpreting

\sigma2

as dispersion parameter. For fixed parameter

\sigma2

, the

ED(\mu,\sigma2)

is a natural exponential family.

Multivariate case

In the multivariate case, the n-dimensional random variable

X

has a probability density function of the following form[1]

fX(x|\boldsymbol{\theta},λ)=h(λ,x)\exp\left(λ(\boldsymbol\theta\topx-A(\boldsymbol\theta))\right),

where the parameter

\boldsymbol\theta

has the same dimension as

X

.

Properties

Cumulant-generating function

The cumulant-generating function of

Y\simED(\mu,\sigma2)

is given by

K(t;\mu,\sigma2)=log\operatorname{E}[etY]=

A(\theta+\sigma2t)-A(\theta)
\sigma2

,

with

\theta=(A')-1(\mu)

Mean and variance

Mean and variance of

Y\simED(\mu,\sigma2)

are given by

\operatorname{E}[Y]=\mu=A'(\theta),\operatorname{Var}[Y]=\sigma2A''(\theta)=\sigma2V(\mu),

with unit variance function

V(\mu)=A''((A')-1(\mu))

.

Reproductive

If

Y1,\ldots,Yn

are i.i.d. with
Y
i\simED\left(\mu,\sigma2
wi

\right)

, i.e. same mean

\mu

and different weights

wi

, the weighted mean is again an

ED

with
n
\sum
i=1
wiYi
w\bullet

\simED\left(\mu,

\sigma2
w\bullet

\right),

with

w\bullet=

n
\sum
i=1

wi

. Therefore

Yi

are called reproductive.

Unit deviance

The probability density function of an

ED(\mu,\sigma2)

can also be expressed in terms of the unit deviance

d(y,\mu)

as

fY(y\mid\mu,\sigma2)=\tilde{h}(\sigma2,y)\exp\left(-

d(y,\mu)
2\sigma2

\right),

where the unit deviance takes the special form

d(y,\mu)=yf(\mu)+g(\mu)+h(y)

or in terms of the unit variance function as

d(y,\mu)=2

y
\int
\mu
y-t
V(t)

dt

.

Examples

Many very common probability distributions belong to the class of EDMs, among them are: normal distribution, binomial distribution, Poisson distribution, negative binomial distribution, gamma distribution, inverse Gaussian distribution, and Tweedie distribution.

Notes and References

  1. Jørgensen, B. (1987). Exponential dispersion models (with discussion). Journal of the Royal Statistical Society, Series B, 49 (2), 127 - 162.
  2. Jørgensen, B. (1992). The theory of exponential dispersion models and analysis of deviance. Monografias de matemática, no. 51.
  3. Marriott, P. (2005) "Local Mixtures and Exponential DispersionModels" pdf