Hoeffding's independence test explained
In statistics, Hoeffding's test of independence, named after Wassily Hoeffding, is a test based on the population measure of deviation from independence
where
is the joint distribution function of two random variables, and
and
are their
marginal distribution functions.Hoeffding derived an unbiased estimator of
that can be used to test for
independence, and is
consistent for any continuous
alternative. The test should only be applied to data drawn from a continuous distribution, since
has a defect for discontinuous
, namely that it is not necessarily zero when
. This drawback can be overcome by taking an
integration with respect to
. This modified measure is known as Blum–Kiefer–Rosenblatt coefficient.
[1] A paper published in 2008[2] describes both the calculation of a sample based version of this measure for use as a test statistic, and calculation of the null distribution of this test statistic.
See also
Primary sources
- Wassily Hoeffding, A non-parametric test of independence, Annals of Mathematical Statistics 19: 293 - 325, 1948. (JSTOR)
- Hollander and Wolfe, Non-parametric statistical methods (Section 8.7), 1999. Wiley.
Notes and References
- Blum. J.R.. Kiefer. J.. Rosenblatt. M.. Distribution free tests of independence based on the sample distribution function. The Annals of Mathematical Statistics. 1961 . 32. 485–498. 2. 10.1214/aoms/1177705055 . 2237758 .
- Wilding, G.E., Mudholkar, G.S. (2008) "Empirical approximations for Hoeffding's test of bivariate independence using two Weibull extensions", Statistical Methodology, 5 (2), 160- - 170