Local asymptotic normality explained
In statistics, local asymptotic normality is a property of a sequence of statistical models, which allows this sequence to be asymptotically approximated by a normal location model, after an appropriate rescaling of the parameter. An important example when the local asymptotic normality holds is in the case of i.i.d sampling from a regular parametric model.
The notion of local asymptotic normality was introduced by and is fundamental in the treatment of estimator and test efficiency.[1]
Definition
A sequence of parametric statistical models is said to be locally asymptotically normal (LAN) at θ if there exist matrices rn and Iθ and a random vector such that, for every converging sequence,
ln
=h'\Deltan,\theta-
\thetah+
(1),
where the derivative here is a Radon–Nikodym derivative, which is a formalised version of the
likelihood ratio, and where
o is a type of
big O in probability notation. In other words, the local likelihood ratio must converge in distribution to a normal random variable whose mean is equal to minus one half the variance:
ln
\xrightarrow{d} l{N}({-\tfrac12}h'I\thetah, h'I\thetah).
The sequences of distributions
and
are
contiguous.
Example
The most straightforward example of a LAN model is an iid model whose likelihood is twice continuously differentiable. Suppose is an iid sample, where each Xi has density function . The likelihood function of the model is equal to
pn,\theta(x1,\ldots,xn;\theta)=
f(xi,\theta).
If
f is twice continuously differentiable in
θ, then
\begin{align}
lnpn,\theta+\delta\theta& ≈ lnpn,\theta+\delta\theta'
| \partiallnpn,\theta |
\partial\theta |
+
| \partial2lnpn,\theta |
\partial\theta\partial\theta' |
\delta\theta\\
&=lnpn,\theta+\delta\theta'
| n | \partiallnf(xi,\theta) | \partial\theta |
|
\sum | |
| i=1 |
+
| n | \partial2lnf(xi,\theta) | \partial\theta\partial\theta' |
|
[\sum | |
| i=1 |
]\delta\theta.
\end{align}
Plugging in
, gives
} = h' \Bigg(\frac \sum_^n\frac\Bigg) \;-\; \frac12 h' \Bigg(\frac1n \sum_^n - \frac \Bigg) h \;+\; o_p(1). By the
central limit theorem, the first term (in parentheses) converges in distribution to a normal random variable, whereas by the
law of large numbers the expression in second parentheses converges in probability to
Iθ, which is the Fisher information matrix:
I\theta=E[{-
| \partial2lnf(Xi,\theta) |
\partial\theta\partial\theta' |
}\bigg] = \mathrm\bigg[\bigg(\frac{\partial \ln f(X_i,\theta)}{\partial\theta}\bigg)\bigg(\frac{\partial \ln f(X_i,\theta)}{\partial\theta}\bigg)'\,\bigg]. Thus, the definition of the local asymptotic normality is satisfied, and we have confirmed that the parametric model with iid observations and twice continuously differentiable likelihood has the LAN property.
See also
Notes
References
- Book: Ibragimov . I.A. . Has’minskiĭ . R.Z. . Statistical estimation: asymptotic theory . 1981 . Springer-Verlag . 0-387-90523-5 .
- Le Cam . L. . Locally asymptotically normal families of distributions . 1960 . University of California Publications in Statistics . 3 . 37–98 .
- Book: van der Vaart, A.W.
. Asymptotic statistics . 1998 . Cambridge University Press . 978-0-521-78450-4 .
Notes and References
- Book: Vaart, A. W. van der . Asymptotic Statistics . 1998-10-13 . Cambridge University Press . 978-0-511-80225-6.