In statistics, the Vuong closeness test is a likelihood-ratio-based test for model selection using the Kullback–Leibler information criterion. This statistic makes probabilistic statements about two models. They can be nested, strictly non-nested or partially non-nested (also called overlapping). The statistic tests the null hypothesis that the two models are equally close to the true data generating process, against the alternative that one model is closer. It cannot make any decision whether the "closer" model is the true model.
With strictly non-nested models and iid exogenous variables, model 1 (2) is preferred with significance level α, if the z statistic
Z= | \operatorname{LR |
N(\beta |
ML,1,\betaML,2)}{\sqrt{N}\omegaN}
with
\operatorname{LR}N(\betaML,1,\betaML,2)=
2 | ||||
L | ||||
|
logN
exceeds the positive (falls below the negative) (1 - α)-quantile of the standard normal distribution. Here K1 and K2 are the numbers of parameters in models 1 and 2 respectively.
The numerator is the difference between the maximum likelihoods of the two models, corrected for the number of coefficients analogous to the BIC, the term in the denominator of the expression for Z,
\omegaN
2 | |
\omega | |
N |
\elli
\elli=log
f1(yi\midxi,\betaML,1) | |
f2(yi\midxi,\betaML,2) |
.
For nested or partially non-nested (overlapping) models the statistic
2\operatorname{LR}N(\betaML,1,\betaML,2)
has to be compared to critical values from a weighted sum of chi squared distributions. This can be approximated by a gamma distribution (in shape-rate form):
Mm( ⋅ ,λ)\sim\Gamma(b,p)
with
λ=(λ1,λ2,...,λm),
m=K1+K2, b=
1 | |
2 |
| |||||||
|
and
p= | 1 |
2 |
\sumλi | ||||||
|
.
λ
Vuong's test for non-nested models has been used in model selection to compare a zero-inflated count model to its non-zero-inflated counterpart (e.g., zero-inflated Poisson model versus ordinary Poisson model). Wilson (2015) argues that such use of Vuong's test is invalid as a non-zero-inflated model is neither strictly non-nested nor partially non-nested in its zero-inflated counterpart. The core of the misunderstanding appears to be the terminology, which offers itself to being incorrectly understood to imply that all pairs of non-nested models are either strictly non-nested or partially non-nested (aka overlapping). Crucially, the definitions of strictly non-nested and partially non-nested in Vuong (1989) do not unite to mean "all pairs of models that are not nested". In other words, there are non-nested models that are neither strictly non-nested nor partially non-nested. The zero-inflated Poisson model and its non-zero-inflated counterpart are an example of such a pair of non-nested models. Consequently, Vuong's test is not a valid test for discriminating between them.
Vuong (1989) gives two examples of strictly non-nested models:
Yt=\theta1+\theta2'Zt+\epsilon
A | |
t |
Yt=\exp(\gamma1+\gamma2'Zt)+\epsilon
B | |
t |
\theta2 ≠ 0,\gamma2 ≠ 0
Zt
Vuong (1989) also gives an intuitive example of partially non-nested (aka overlapping) models: