In statistical hypothesis testing, a uniformly most powerful (UMP) test is a hypothesis test which has the greatest power
1-\beta
Let
X
f\theta(x)
\theta\in\Theta
\Theta
\Theta0
\Theta1
H0
\theta\in\Theta0
H1
\theta\in\Theta1
\varphi(x)
R
\varphi(x)=\begin{cases} 1&ifx\inR\\ 0&ifx\inRc \end{cases}
H1
X\inR
H0
X\inRc
R\cupRc
A test function
\varphi(x)
\alpha
\varphi'(x)
\sup | |
\theta\in\Theta0 |
\operatorname{E}[\varphi'(X)|\theta]=\alpha'\leq\alpha=\sup | |
\theta\in\Theta0 |
\operatorname{E}[\varphi(X)|\theta]
\forall\theta\in\Theta1, \operatorname{E}[\varphi'(X)|\theta]=1-\beta'(\theta)\leq1-\beta(\theta)=\operatorname{E}[\varphi(X)|\theta].
The Karlin–Rubin theorem can be regarded as an extension of the Neyman–Pearson lemma for composite hypotheses.[1] Consider a scalar measurement having a probability density function parameterized by a scalar parameter θ, and define the likelihood ratio
l(x)=
f | |
\theta1 |
(x)/
f | |
\theta0 |
(x)
l(x)
x
\theta1\geq\theta0
x
H1
\varphi(x)=\begin{cases} 1&ifx>x0\\ 0&ifx<x0 \end{cases}
where
x0
\operatorname{E} | |
\theta0 |
\varphi(X)=\alpha
is the UMP test of size α for testing
H0:\theta\leq\theta0vs.H1:\theta>\theta0.
Note that exactly the same test is also UMP for testing
H0:\theta=\theta0vs.H1:\theta>\theta0.
Although the Karlin-Rubin theorem may seem weak because of its restriction to scalar parameter and scalar measurement, it turns out that there exist a host of problems for which the theorem holds. In particular, the one-dimensional exponential family of probability density functions or probability mass functions with
f\theta(x)=g(\theta)h(x)\exp(η(\theta)T(x))
T(x)
η(\theta)
Let
X=(X0,\ldots,XM-1)
N
\thetam
R
\begin{align} f\theta(X)={}&(2\pi)-MN/2|R|-M/2\exp\left\{-
1 | |
2 |
M-1 | |
\sum | |
n=0 |
(Xn-\thetam)TR-1(Xn-\thetam)\right\}\\[4pt] ={}&(2\pi)-MN/2|R|-M/2\exp\left\{-
1 | |
2 |
M-1 | |
\sum | |
n=0 |
\left(\theta2mTR-1m\right)\right\}\\[4pt] &\exp\left\{-
1 | |
2 |
M-1 | |
\sum | |
n=0 |
T | |
X | |
n |
R-1Xn\right\}\exp\left\{\thetamTR-1
M-1 | |
\sum | |
n=0 |
Xn\right\} \end{align}
which is exactly in the form of the exponential family shown in the previous section, with the sufficient statistic being
T(X)=mTR-1
M-1 | |
\sum | |
n=0 |
Xn.
Thus, we conclude that the test
\varphi(T)=\begin{cases}1&T>t0\ 0&T<t0\end{cases}
\operatorname{E} | |
\theta0 |
\varphi(T)=\alpha
is the UMP test of size
\alpha
H0:\theta\leqslant\theta0
H1:\theta>\theta0
Finally, we note that in general, UMP tests do not exist for vector parameters or for two-sided tests (a test in which one hypothesis lies on both sides of the alternative). The reason is that in these situations, the most powerful test of a given size for one possible value of the parameter (e.g. for
\theta1
\theta1>\theta0
\theta2
\theta2<\theta0