Negative multinomial distribution explained

In probability theory and statistics, the negative multinomial distribution is a generalization of the negative binomial distribution (NB(x0, p)) to more than two outcomes.[1]

As with the univariate negative binomial distribution, if the parameter

x0

is a positive integer, the negative multinomial distribution has an urn model interpretation. Suppose we have an experiment that generates m+1≥2 possible outcomes,, each occurring with non-negative probabilities respectively. If sampling proceeded until n observations were made, then would have been multinomially distributed. However, if the experiment is stopped once X0 reaches the predetermined value x0 (assuming x0 is a positive integer), then the distribution of the m-tuple is negative multinomial. These variables are not multinomially distributed because their sum X1+...+Xm is not fixed, being a draw from a negative binomial distribution.

Properties

Marginal distributions

If m-dimensional x is partitioned as follows\mathbf=\begin \mathbf^ \\ \mathbf^\end

\text\begin n \times 1 \\ (m-n) \times 1 \endand accordingly

\boldsymbol{p}

\boldsymbol p=\begin \boldsymbol p^ \\ \boldsymbol p^\end\text\begin n \times 1 \\ (m-n) \times 1 \endand letq = 1-\sum_i p_i^ = p_0+\sum_i p_i^

The marginal distribution of

\boldsymbolX(1)

is

NM(x0,p0/q,\boldsymbolp(1)/q)

. That is the marginal distribution is also negative multinomial with the

\boldsymbolp(2)

removed and the remaining ps properly scaled so as to add to one.

The univariate marginal

m=1

is said to have a negative binomial distribution.

Conditional distributions

The conditional distribution of

X(1)

given

X(2)=x(2)

is \mathrm(x_0+\sum,\mathbf^) . That is,\Pr(\mathbf^\mid \mathbf^, x_0, \mathbf)= \Gamma\!\left(\sum_^m\right)\frac\prod_^n.

Independent sums

If

X1\simNM(r1,p)

and If

X2\simNM(r2,p)

are independent, then

X1+X2\simNM(r1+r2,p)

. Similarly and conversely, it is easy to see from the characteristic function that the negative multinomial is infinitely divisible.

Aggregation

If\mathbf = (X_1, \ldots, X_m)\sim\operatorname(x_0, (p_1,\ldots,p_m))then, if the random variables with subscripts i and j are dropped from the vector and replaced by their sum,\mathbf' = (X_1, \ldots, X_i + X_j, \ldots, X_m)\sim\operatorname (x_0, (p_1, \ldots, p_i + p_j, \ldots, p_m)).

This aggregation property may be used to derive the marginal distribution of

Xi

mentioned above.

Correlation matrix

The entries of the correlation matrix are\rho(X_i,X_i) = 1.\rho(X_i,X_j) = \frac = \sqrt.

Parameter estimation

Method of Moments

If we let the mean vector of the negative multinomial be\boldsymbol=\frac\mathbfand covariance matrix\boldsymbol=\tfrac\,\mathbf\mathbf' + \tfrac\,\operatorname(\mathbf),then it is easy to show through properties of determinants that |\boldsymbol| = \frac\prod_^m. From this, it can be shown thatx_0=\frac

-\prod
and \mathbf= \frac
-\prod
\sum
\boldsymbol.

Substituting sample moments yields the method of moments estimates\hat_0=\frac

-\prod_^
and\hat=\left(\frac
-\prod_^
\sum_^
\right)\boldsymbol

Related distributions

References

  1. Le Gall, F. The modes of a negative multinomial distribution, Statistics & Probability Letters, Volume 76, Issue 6, 15 March 2006, Pages 619-624, ISSN 0167-7152, 10.1016/j.spl.2005.09.009.

Waller LA and Zelterman D. (1997). Log-linear modeling with the negative multi-nomial distribution. Biometrics 53: 971–82.

Further reading

Book: Johnson . Norman L. . Kotz . Samuel . Balakrishnan . N.. Discrete Multivariate Distributions . Chapter 36: Negative Multinomial and Other Multinomial-Related Distributions . 1997 . Wiley . 978-0-471-12844-1.