Quartet distance explained

The quartet distance[1] is a way of measuring the distance between two phylogenetic trees. It is defined as the number of subsets of four leaves that are not related by the same topology in both trees.

Computing the quartet distance

The most straightforward computation of the quartet distance would require

O(N4)

time, where

N

is the number of leaves in the trees.

For binary trees, better algorithms have been found to compute the distance in

O(N2)

time[2]

O(Nlog2N)

time[3] and

O(NlogN)

time[4]

Gerth Stølting Brodal et al. found an algorithm that takes

O(DNlogN)

time to compute the quartet distance between two multifurcating trees when

D

is the maximum degree of the trees,[5] which is accessible in C, perl, and the R package Quartet.

References

  1. Estabrook. George F.. McMorris. F. R.. Meacham. Christopher A.. Comparison of Undirected Phylogenetic Trees Based on Subtrees of Four Evolutionary Units. Systematic Zoology. 1985. 34. 2. 193–200. 2413326. 10.2307/2413326.
  2. Bryant. D.. J. Tsang. P. E. Kearney. M. Li.. Computing the quartet distance between evolutionary trees. Proceedings of the Eleventh Annual ACM-SIAM Symposium on Discrete Algorithms. 285–286. ACM Press. N.Y.. 11 Jan 2000.
  3. Book: Brodal . Gerth Stølting . Fagerberg . Rolf . Pedersen . Christian N. S. . 2001 . Computing the Quartet Distance between Evolutionary Trees in Time

    O(nlog2n)

    . Algorithms and Computation . Lecture Notes in Computer Science . 2223 . 731–742 . 10.1007/3-540-45678-3_62 . 978-3-540-42985-2.
  4. Brodal. Gerth Stølting. Gerth Stølting Brodal. Rolf Fagerberg . Christian Nørgaard Storm Pedersen. Algorithmica . 2. 377–395. Computing the Quartet Distance Between Evolutionary Trees in Time

    O(nlogn)

    . 38. 2003 . 10.1007/s00453-003-1065-y. 6911940.
  5. Brodal. Gerth Stølting. Gerth Stølting Brodal. Rolf Fagerberg . T Mailund . Christian Nørgaard Storm Pedersen . A Sand. Proceedings of the Twenty-Fourth Annual ACM-SIAM Symposium on Discrete Algorithms. SIAM. 2013. Efficient algorithms for computing the triplet and quartet distance between trees of arbitrary degree. 1814–1832. 10.1137/1.9781611973105.130. 978-1-61197-251-1. free.