In the mathematical theory of probability, the Indian buffet process (IBP) is a stochastic process defining a probability distribution over sparse binary matrices with a finite number of rows and an infinite number of columns. This distribution is suitable to use as a prior for models with potentially infinite number of features. The form of the prior ensures that only a finite number of features will be present in any finite set of observations but more features may appear as more data points are observed.
Let
Z
N x K
Z
p(Z)=
| |||||||||||||||
|
\exp\{-\alphaHN\}\prod
K+ | |
k=1 |
(N-mk)!(mk-1)! | |
N! |
where
{K}
Z
mk
k
Z
HN
N
(i) | |
K | |
1 |
i
\alpha
In the Indian buffet process, the rows of
Z
Poisson(\alpha)
i
mk/i
mk
k
Poisson(\alpha/i)
znk
n
k
This process is infinitely exchangeable for an equivalence class of binary matrices defined by a left-ordered many-to-one function.
\operatorname{lof}(Z)
Z