In probability theory, the Bapat–Beg theorem gives the joint probability distribution of order statistics of independent but not necessarily identically distributed random variables in terms of the cumulative distribution functions of the random variables. Ravindra Bapat and M.I. Beg published the theorem in 1989,[1] though they did not offer a proof. A simple proof was offered by Hande in 1994.[2]
Often, all elements of the sample are obtained from the same population and thus have the same probability distribution. The Bapat–Beg theorem describes the order statistics when each element of the sample is obtained from a different statistical population and therefore has its own probability distribution.
Let
X1,X2,\ldots,Xn
F1(x),F2(x),\ldots,Fn(x)
X(1),X(2),\ldots,X(n)
n1,n2\ldots,nk
n1<n2< … <nk
x1<x2< … <xk
\begin{align}
F | |||||||||||
|
(x1,\ldots,xk) &=\Pr(
X | |
(n1) |
\leqx1\land
X | |
(n2) |
\leqx2\land … \land
X | |
(nk) |
\leqxk)\\ &=
n | |
\sum | |
ik=nk |
i3 | |
… \sum | |
i2=n2 |
\sum
i2 | |
i1=n1 |
| |||||||
i1!(i2-i1)! … (n-ik)! |
,\end{align}
where
\begin{align} P | |
i1,\ldots,ik |
(x1,\ldots,xk)=\operatorname{per} \begin{bmatrix} F1(x1) … F1(x1)&F1(x2)-F1(x1) … F1(x2)-F1(x1)& … &1-F1(xk) … 1-F1(xk)\\ F2(x1) … F2(x1)&F2(x2)-F2(x1) … F2(x2)-F2(x1)& … &1-F2(xk) … 1-F1(xk)\\ \vdots&\vdots&&\vdots\\ \underbrace{Fn(x1) … Fn(x1)
} | |
i1 |
&\underbrace{Fn(x2)-Fn(x1) … Fn(x2)-Fn(x1)}
i2-i1 |
& … &\underbrace{1-Fn(xk) … 1-Fn(xk)
} | |
n-ik |
\end{bmatrix} \end{align}
is the permanent of the given block matrix. (The figures under the braces show the number of columns.)
In the case when the variables
X1,X2,\ldots,Xn
Fi=F
\begin{align} F | |||||||||||
|
(x1,\ldots,xk) =
n | |
\sum | |
ik=nk |
…
i3 | |
\sum | |
i2=n2 |
i2 | |
\sum | |
i1=n1 |
n!
| |||||||
i1! |
| |||||||
(n-ik)! |
k | |
\prod\limits | |
j=2 |
| |||||||
(ij-ij-1)! |
. \end{align}
Glueck et al. note that the Bapat‒Beg formula is computationally intractable, because it involves an exponential number of permanents of the size of the number of random variables.[3] However, when the random variables have only two possible distributions, the complexity can be reduced to
O(m2k)
m
k