In statistics, the multivariate Behrens–Fisher problem is the problem of testing for the equality of means from two multivariate normal distributions when the covariance matrices are unknown and possibly not equal. Since this is a generalization of the univariate Behrens-Fisher problem, it inherits all of the difficulties that arise in the univariate problem.
Let
Xij\siml{N}p(\mui,\Sigmai) (j=1,...,ni; i=1,2)
p
\mui
\Sigmai
i
j
i
Xij
The multivariate Behrens–Fisher problem is to test the null hypothesis
H0
H1
H0:\mu1=\mu2 vs H1:\mu1 ≠ \mu2.
Define some statistics, which are used in the various attempts to solve the multivariate Behrens–Fisher problem, by
\begin{align} \bar{Xi}&=
1 | |
ni |
ni | |
\sum | |
j=1 |
Xij,\\ Ai&=
ni | |
\sum | |
j=1 |
(Xij-\bar{Xi})(Xij-\bar{Xi})',\\ Si&=
1 | |
ni-1 |
Ai,\\ \tilde{Si}&=
1 | |
ni |
Si,\\ \tilde{S}&=\tilde{S1}+\tilde{S2}, and\\ T2&=(\bar{X1}-
-1 | |
\bar{X | |
2})'\tilde{S} |
(\bar{X1}-\bar{X2}). \end{align}
The sample means
\bar{Xi}
Ai
\mui,\Sigmai, (i=1,2)
\bar{Xi}
Ai
\begin{align} \bar{Xi}&\siml{N}p\left(\mui,\Sigmai/ni\right),\\ Ai&\simWp(\Sigmai,ni-1). \end{align}
In the case where the dispersion matrices are equal, the distribution of the
T2
The main problem is that when the true values of the dispersion matrix are unknown, then under the null hypothesis the probability of rejecting
H0
T2
Now, the mean vectors are independently and normally distributed,
\bar{Xi}\siml{N}p\left(\mui,\Sigmai/ni\right),
but the sum
A1+A2
Proposed solutions are based on a few main strategies:
T2
F
Below,
tr
(as cited by)
T2\sim
\nup | |
\nu-p+1 |
Fp,\nu-p+1,
where
\begin{align} \nu&=\left[
1 | \left( | |
n1 |
\bar{X | |
d'\tilde{S} |
-1\tilde{S}1\tilde{S}-1\bar{Xd}}
-1 | |
{\bar{X} | |
d'\tilde{S} |
2 | |
\bar{X} | |
d} \right) |
+
1 | \left( | |
n2 |
\bar{X | |
d |
'\tilde{S}-1\tilde{S}2\tilde{S}-1
-1 | |
X | |
d |
\bar_d & = \bar_-\bar_2. \end
(as cited by)
T2\simqFp,\nu,
where
\begin{align} q&=p+2D-
6D | |
p(p-1)+2 |
,\\ \nu&=
p(p+2) | |
3D |
,\\ \end{align}
and
\begin{align} D=
1 | |
2 |
2 | |
\sum | |
i=1 |
1 | |
ni |
\{ &tr\left[{\left(I-
-1 | |
(\tilde{S} | |
1 |
+
-1 | |
\tilde{S} | |
2 |
)-1
-1 | |
\tilde{S} | |
i |
\right)}2\right]\\ &{}+{\left[tr\left(I
-1 | |
-(\tilde{S} | |
1 |
+
-1 | |
\tilde{S} | |
2 |
)-1
-1 | |
\tilde{S} | |
i |
\right)\right]}2 \}.\\ \end{align}
(as cited by)
T2\sim
\nup | |
\nu-p+1 |
Fp,\nu-p+1,
where
\nu=
tr(\tilde{S | |
2) |
+[tr(\tilde{S})]2} {
1 | |
n1 |
\left\{tr(\tilde{S1
Kim (1992) proposed a solution that is based on a variant of
T2
Krishnamoorthy and Yu (2004) proposed a procedure which adjusts in Nel and Var der Merwe (1986)'s approximate df for the denominator of
T2
\left[min\{n1-1,n2-1\},n1+n2-2\right]
The test statistic
T2
T2\sim\nupFp,\nu-p+1/(\nu-p+1),
\nu=
p+p2 | ||||
|
1\tilde{S}-1
2]+[tr(\tilde{S} | |
) | |
1 |
\tilde{S}-1)]2\}+
1 | |
n2-1 |
\{tr[(\tilde{S}2\tilde{S}-1
2]+[tr(\tilde{S} | |
) | |
2 |
\tilde{S}-1)]2\} }.