Hodges' estimator explained

In statistics, Hodges' estimator (or the Hodges–Le Cam estimator), named for Joseph Hodges, is a famous counterexample of an estimator which is "superefficient", i.e. it attains smaller asymptotic variance than regular efficient estimators. The existence of such a counterexample is the reason for the introduction of the notion of regular estimators.

Hodges' estimator improves upon a regular estimator at a single point. In general, any superefficient estimator may surpass a regular estimator at most on a set of Lebesgue measure zero.

Although Hodges discovered the estimator he never published it; the first publication was in the doctoral thesis of Lucien Le Cam.[1]

Construction

Suppose

\hat{\theta}n

is a "common" estimator for some parameter

\theta

: it is consistent, and converges to some asymptotic distribution

L\theta

(usually this is a normal distribution with mean zero and variance which may depend on

\theta

) at the

\sqrt{n}

-rate:

\sqrt{n}(\hat\thetan-\theta)\xrightarrow{d}L\theta.

Then the Hodges' estimator

H
\hat{\theta}
n
is defined as
H
\hat\theta
n

=\begin{cases}\hat\thetan,&if|\hat\thetan|\geqn-1/4,and\ 0,&if|\hat\thetan|<n-1/4.\end{cases}

This estimator is equal to

\hat{\theta}n

everywhere except on the small interval

[-n-1/4,n-1/4]

, where it is equal to zero. It is not difficult to see that this estimator is consistent for

\theta

, and its asymptotic distribution is

\begin{align} &

H
n
n

-\theta)\xrightarrow{d} 0,    when\theta=0,\\

H
&\sqrt{n}(\hat\theta
n

-\theta)\xrightarrow{d}L\theta,when\theta0, \end{align}

for any

\alpha\inR

. Thus this estimator has the same asymptotic distribution as

\hat{\theta}n

for all

\theta0

, whereas for

\theta=0

the rate of convergence becomes arbitrarily fast. This estimator is superefficient, as it surpasses the asymptotic behavior of the efficient estimator

\hat{\theta}n

at least at one point

\theta=0

.

It is not true that the Hodges estimator is equivalent to the sample mean, but much better when the true mean is 0. The correct interpretation is that, for finite

n

, the truncation can lead to worse square error than the sample mean estimator for

E[X]

close to 0, as is shown in the example in the following section.[2]

Le Cam shows that this behaviour is typical: superefficiency at the point θ implies the existence of a sequence

\thetan\theta

such that

\liminfE\thetan\ell(\sqrtn(\hat\thetan-\thetan))

is strictly larger than the Cramér-Rao bound. For theextreme case where the asymptotic risk at θ is zero, the

\liminf

is even infinite for a sequence

\thetan\theta

.[3]

In general, superefficiency may only be attained on a subset of Lebesgue measure zero of the parameter space

\Theta

.[4]

Example

Suppose x1, ..., xn is an independent and identically distributed (IID) random sample from normal distribution with unknown mean but known variance. Then the common estimator for the population mean θ is the arithmetic mean of all observations: \scriptstyle\bar. The corresponding Hodges' estimator will be \scriptstyle\hat\theta^H_n \;=\; \bar\cdot\mathbf\

\,\geq\,n^\
, where 1 denotes the indicator function.

The mean square error (scaled by n) associated with the regular estimator

x is constant and equal to 1 for all θs. At the same time the mean square error of the Hodges' estimator \scriptstyle\hat\theta_n^H behaves erratically in the vicinity of zero, and even becomes unbounded as . This demonstrates that the Hodges' estimator is not regular, and its asymptotic properties are not adequately described by limits of the form (θ fixed,).

See also

Notes

References

Notes and References

  1. Book: Le Cam, Lucien M. . On some asymptotic properties of maximum likelihood estimates and related Bayes' estimates. . University of California . Berkeley. . 1953 . University of California press . University of California publications in statistics; v. 1, no. 11 . Berkeley.
  2. Vaart AW van der. Asymptotic Statistics. Cambridge University Press; 1998.
  3. van der Vaart, A. W., & Wellner, J. A. (1996). Weak Convergence and Empirical Processes. In Springer Series in Statistics. Springer New York. https://doi.org/10.1007/978-1-4757-2545-2
  4. Vaart AW van der. Asymptotic Statistics. Cambridge University Press; 1998.