The transmission disequilibrium test (TDT) was proposed by Spielman, McGinnis and Ewens (1993)[1] as a family-based association test for the presence of genetic linkage between a genetic marker and a trait. It is an application of McNemar's test.
A specificity of the TDT is that it will detect genetic linkage only in the presence of genetic association.While genetic association can be caused by population structure, genetic linkage will not be affected, which makes the TDT robust to the presence of population structure.
We first describe the TDT in the case where families consist of trios (two parents and one affected child). Our description follows the notations used in Spielman, McGinnis & Ewens (1993).
The TDT measures the over-transmission of an allele from heterozygous parents to affected offsprings.The n affected offsprings have 2n parents. These can be represented by the transmitted and the non-transmitted alleles
M1
M2
Non-transmitted allele | ||||
---|---|---|---|---|
Transmitted allele | M1 | M2 | Total | |
M1 | a | b | a + b | |
M2 | c | d | c + d | |
Total | a + c | b + d | 2n |
The derivation of the TDT shows that one should only use the heterozygous parents (total number b+c).The TDT tests whether the proportions b/(b+c) and c/(b+c) are compatible with probabilities (0.5, 0.5).This hypothesis can be tested using a binomial (asymptotically chi-square) test with one degree of freedom:
\chi2=
[b-(b+c)/2]2 | |
(b+c)/2 |
+
[c-(b+c)/2]2 | |
(b+c)/2 |
=
(b-c)2 | |
b+c |
A derivation of the test consists of using a population genetics model to obtain the expected proportions for the quantities
a,b,c
d
b
c
\chi2
On the other hand, one can also show that under such models the proportions
a,b,c
d
(a+b)/2n
(c+d)/2n
(a+c)/2n
(b+d)/2n
\chi2
The TDT can be readily extended beyond the case of trios. We keep following the notations of Spielman, McGinnis & Ewens (1993). Consider a total of
h
i
M1
h-i-j
M1
M2
j
M2
Using the notations of the previous paragraph we have:
b=2i+(h-i-j)=h+i-j
c=2j+(h-i-j)=h-i+j
leading to the chi-squared test statistic:
2 | |
\chi | |
tdt |
=
4(i-j)2 | |
h |
.
The comparison with the more traditional (at least at the time when the TDT was proposed) linkage test proposed by Blackwelder and Elston 1985[2] is informative.The Blackwelder and Elston approach uses the total number of haplotypes identical by descent (mean haplotype sharing). This measure ignores the allelic state of a marker and simply compares the number of times a parent transmits the same allele to both affected children with the number of times a different allele is transmitted.The test statistic is:
2 | |
\chi | |
hs |
=
(2i+2j-h)2 | |
h |
.
Under the null hypothesis of no linkage the expected proportions of (i, h - i - j, j) are (0.25, 0.5, 0.25). One can derive a simple chi-square statistic with 2 degrees of freedom:
2 | |
\chi | |
total |
=
(i-h/4)2 | |
h/4 |
+
(h-i-j-h/2)2 | |
h/2 |
+
(j-h/4)2 | |
h/4 |
=
2 | |
\chi | |
tdt |
+
2 | |
\chi | |
hs |
.
It clearly appears that the total statistic (with two degree of freedom) is the sum of two independent components: one is the traditional linkage measure and the other is the TDT statistic.
More recently, Wittkowski KM, Liu X. (2002/2004)[3] proposed a modification to the TDT that can be more powerful under some alternatives, although the asymptotic properties under the null hypothesis are equivalent.
The motivating idea for this modification is the fact that, while the transmissions of both allele from parents to a child are independent, the effects of other filial genetic or environmental covariates on penetrance are the same for both alleles transmitted to the same child. This situation can be important if, for example, the genetic marker is linked to a disease locus with a strong selection against heterozygous individuals. This observation suggests to shift the statistical model from a set of independent transmissions to a set of independent children (see Sasieni (1997)[4] for the corresponding problem in case-control association tests). While this observation does not affect the distribution under the null hypothesis of no linkage, it allows, for some disease models, to design a more powerful test.
In this modified TDT test the children are stratified by parental type and the modified test statistic becomes:
\chi2=
\left[[nPQ-nQQ]PQ+2 x [nPP-nQQ]PQ+[nPP-nPQ]PP\right]2 | |
[nPQ+nQQ]PQ+4 x [nPP+nQQ]PQ+[nPQ+nPP]PP |
where
[nPQ]PQ