Sequence homology explained

Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a speciation event (orthologs), or a duplication event (paralogs), or else a horizontal (or lateral) gene transfer event (xenologs).[1]

Homology among DNA, RNA, or proteins is typically inferred from their nucleotide or amino acid sequence similarity. Significant similarity is strong evidence that two sequences are related by evolutionary changes from a common ancestral sequence. Alignments of multiple sequences are used to indicate which regions of each sequence are homologous.

Identity, similarity, and conservation

The term "percent homology" is often used to mean "sequence similarity”, that is the percentage of identical residues (percent identity), or the percentage of residues conserved with similar physicochemical properties (percent similarity), e.g. leucine and isoleucine, is usually used to "quantify the homology." Based on the definition of homology specified above this terminology is incorrect since sequence similarity is the observation, homology is the conclusion.[2] Sequences are either homologous or not.[2] This involves that the term "percent homology" is a misnomer.[3]

As with morphological and anatomical structures, sequence similarity might occur because of convergent evolution, or, as with shorter sequences, by chance, meaning that they are not homologous. Homologous sequence regions are also called conserved. This is not to be confused with conservation in amino acid sequences, where the amino acid at a specific position has been substituted with a different one that has functionally equivalent physicochemical properties.

Partial homology can occur where a segment of the compared sequences has a shared origin, while the rest does not. Such partial homology may result from a gene fusion event.

Orthology

Homologous sequences are orthologous if they are inferred to be descended from the same ancestral sequence separated by a speciation event: when a species diverges into two separate species, the copies of a single gene in the two resulting species are said to be orthologous. Orthologs, or orthologous genes, are genes in different species that originated by vertical descent from a single gene of the last common ancestor. The term "ortholog" was coined in 1970 by the molecular evolutionist Walter Fitch.[4]

For instance, the plant Flu regulatory protein is present both in Arabidopsis (multicellular higher plant) and Chlamydomonas (single cell green algae). The Chlamydomonas version is more complex: it crosses the membrane twice rather than once, contains additional domains and undergoes alternative splicing. However it can fully substitute the much simpler Arabidopsis protein, if transferred from algae to plant genome by means of genetic engineering. Significant sequence similarity and shared functional domains indicate that these two genes are orthologous genes,[5] inherited from the shared ancestor.

Orthology is strictly defined in terms of ancestry. Given that the exact ancestry of genes in different organisms is difficult to ascertain due to gene duplication and genome rearrangement events, the strongest evidence that two similar genes are orthologous is usually found by carrying out phylogenetic analysis of the gene lineage. Orthologs often, but not always, have the same function.[6]

Orthologous sequences provide useful information in taxonomic classification and phylogenetic studies of organisms. The pattern of genetic divergence can be used to trace the relatedness of organisms. Two organisms that are very closely related are likely to display very similar DNA sequences between two orthologs. Conversely, an organism that is further removed evolutionarily from another organism is likely to display a greater divergence in the sequence of the orthologs being studied.

Databases of orthologous genes

Given their tremendous importance for biology and bioinformatics, orthologous genes have been organized in several specialized databases that provide tools to identify and analyze orthologous gene sequences. These resources employ approaches that can be generally classified into those that use heuristic analysis of all pairwise sequence comparisons, and those that use phylogenetic methods. Sequence comparison methods were first pioneered in the COGs database in 1997.[7] These methods have been extended and automated in twelve different databases the most advanced being AYbRAH Analyzing Yeasts by Reconstructing Ancestry of Homologs[8] as well as these following databases right now.

Tree-based phylogenetic approaches aim to distinguish speciation from gene duplication events by comparing gene trees with species trees, as implemented in databases and software tools such as:

A third category of hybrid approaches uses both heuristic and phylogenetic methods to construct clusters and determine trees, for example:

Paralogy

Paralogous genes are genes that are related via duplication events in the last common ancestor (LCA) of the species being compared. They result from the mutation of duplicated genes during separate speciation events. When descendants from the LCA share mutated homologs of the original duplicated genes then those genes are considered paralogs.[1]

As an example, in the LCA, one gene (gene A) may get duplicated to make a separate similar gene (gene B), those two genes will continue to get passed to subsequent generations. During speciation, one environment will favor a mutation in gene A (gene A1), producing a new species with genes A1 and B. Then in a separate speciation event, one environment will favor a mutation in gene B (gene B1) giving rise to a new species with genes A and B1. The descendants' genes A1 and B1 are paralogous to each other because they are homologs that are related via a duplication event in the last common ancestor of the two species.[1]

Additional classifications of paralogs include alloparalogs (out-paralogs) and symparalogs (in-paralogs). Alloparalogs are paralogs that evolved from gene duplications that preceded the given speciation event. In other words, alloparalogs are paralogs that evolved from duplication events that happened in the LCA of the organisms being compared. The example above is an example alloparalogy. Symparalogs are paralogs that evolved from gene duplication of paralogous genes in subsequent speciation events. From the example above, if the descendant with genes A1 and B underwent another speciation event where gene A1 duplicated, the new species would have genes B, A1a, and A1b. In this example, genes A1a and A1b are symparalogs.[1]

Paralogous genes can shape the structure of whole genomes and thus explain genome evolution to a large extent. Examples include the Homeobox (Hox) genes in animals. These genes not only underwent gene duplications within chromosomes but also whole genome duplications. As a result, Hox genes in most vertebrates are clustered across multiple chromosomes with the HoxA-D clusters being the best studied.

Another example are the globin genes which encode myoglobin and hemoglobin and are considered to be ancient paralogs. Similarly, the four known classes of hemoglobins (hemoglobin A, hemoglobin A2, hemoglobin B, and hemoglobin F) are paralogs of each other. While each of these proteins serves the same basic function of oxygen transport, they have already diverged slightly in function: fetal hemoglobin (hemoglobin F) has a higher affinity for oxygen than adult hemoglobin. Function is not always conserved, however. Human angiogenin diverged from ribonuclease, for example, and while the two paralogs remain similar in tertiary structure, their functions within the cell are now quite different.

It is often asserted that orthologs are more functionally similar than paralogs of similar divergence, but several papers have challenged this notion.[36] [37] [38]

Regulation

Paralogs are often regulated differently, e.g. by having different tissue-specific expression patterns (see Hox genes). However, they can also be regulated differently on the protein level. For instance, Bacillus subtilis encodes two paralogues of glutamate dehydrogenase: GudB is constitutively transcribed whereas RocG is tightly regulated. In their active, oligomeric states, both enzymes show similar enzymatic rates. However, swaps of enzymes and promoters cause severe fitness losses, thus indicating promoter–enzyme coevolution. Characterization of the proteins shows that, compared to RocG, GudB's enzymatic activity is highly dependent on glutamate and pH.[39]

Paralogous chromosomal regions

Sometimes, large regions of chromosomes share gene content similar to other chromosomal regions within the same genome.[40] They are well characterised in the human genome, where they have been used as evidence to support the 2R hypothesis. Sets of duplicated, triplicated and quadruplicated genes, with the related genes on different chromosomes, are deduced to be remnants from genome or chromosomal duplications. A set of paralogy regions is together called a paralogon.[41] Well-studied sets of paralogy regions include regions of human chromosome 2, 7, 12 and 17 containing Hox gene clusters, collagen genes, keratin genes and other duplicated genes,[42] regions of human chromosomes 4, 5, 8 and 10 containing neuropeptide receptor genes, NK class homeobox genes and many more gene families,[43] [44] [45] and parts of human chromosomes 13, 4, 5 and X containing the ParaHox genes and their neighbors.[46] The Major histocompatibility complex (MHC) on human chromosome 6 has paralogy regions on chromosomes 1, 9 and 19.[47] Much of the human genome seems to be assignable to paralogy regions.[48]

Ohnology

Ohnologous genes are paralogous genes that have originated by a process of whole-genome duplication. The name was first given in honour of Susumu Ohno by Ken Wolfe.[49] Ohnologues are useful for evolutionary analysis because all ohnologues in a genome have been diverging for the same length of time (since their common origin in the whole genome duplication). Ohnologues are also known to show greater association with cancers, dominant genetic disorders, and pathogenic copy number variations.[50] [51] [52] [53] [54]

Xenology

Homologs resulting from horizontal gene transfer between two organisms are termed xenologs. Xenologs can have different functions if the new environment is vastly different for the horizontally moving gene. In general, though, xenologs typically have similar function in both organisms. The term was coined by Walter Fitch.[4]

Homoeology

Homoeologous (also spelled homeologous) chromosomes or parts of chromosomes are those brought together following inter-species hybridization and allopolyploidization to form a hybrid genome, and whose relationship was completely homologous in an ancestral species.[55] In allopolyploids, the homologous chromosomes within each parental sub-genome should pair faithfully during meiosis, leading to disomic inheritance; however in some allopolyploids, the homoeologous chromosomes of the parental genomes may be nearly as similar to one another as the homologous chromosomes, leading to tetrasomic inheritance (four chromosomes pairing at meiosis), intergenomic recombination, and reduced fertility.

Gametology

Gametology denotes the relationship between homologous genes on non-recombining, opposite sex chromosomes. The term was coined by García-Moreno and Mindell.[56] 2000. Gametologs result from the origination of genetic sex determination and barriers to recombination between sex chromosomes. Examples of gametologs include CHDW and CHDZ in birds.

See also

Notes and References

  1. Koonin EV . Orthologs, paralogs, and evolutionary genomics . Annual Review of Genetics . 39 . 309–38 . 2005 . 16285863 . 10.1146/annurev.genet.39.073003.114725 .
  2. Reeck GR, de Haën C, Teller DC, Doolittle RF, Fitch WM, Dickerson RE, Chambon P, McLachlan AD, Margoliash E, Jukes TH . 6 . "Homology" in proteins and nucleic acids: a terminology muddle and a way out of it . Cell . 50 . 5 . 667 . August 1987 . 3621342 . 10.1016/0092-8674(87)90322-9 . 42949514 .
  3. Holman C . January 2004 . Protein Similarity Score: A Simplified Version of the Blast Score as a Superior Alternative to Percent Identity for Claiming Genuses of Related Protein Sequences. Santa Clara High Technology Law Journal. 21. 1. 55. 0882-3383.
  4. Fitch WM . Distinguishing homologous from analogous proteins . Systematic Zoology . 19 . 2 . 99–113 . June 1970 . 5449325 . 10.2307/2412448 . 2412448 . Where the homology is the result of gene duplication so that both copies have descended side by side during the history of an organism (for example, a and b hemoglobin) the genes should be called paralogous (para = in parallel). Where the homology is the result of speciation so that the history of the gene reflects the history of the species (for example a hemoglobin in man and mouse) the genes should be called orthologous (ortho = exact). .
  5. Falciatore A, Merendino L, Barneche F, Ceol M, Meskauskiene R, Apel K, Rochaix JD . The FLP proteins act as regulators of chlorophyll synthesis in response to light and plastid signals in Chlamydomonas . Genes & Development . 19 . 1 . 176–87 . January 2005 . 15630026 . 540235 . 10.1101/gad.321305 .
  6. Fang G, Bhardwaj N, Robilotto R, Gerstein MB . Getting started in gene orthology and functional analysis . PLOS Computational Biology . 6 . 3 . e1000703 . March 2010 . 20361041 . 2845645 . 10.1371/journal.pcbi.1000703 . 2010PLSCB...6E0703F . free .
  7. https://www.ncbi.nlm.nih.gov/COG/ COGs: Clusters of Orthologous Groups of proteins
  8. Correia K, Yu SM, Mahadevan R . January 2019 . AYbRAH: a curated ortholog database for yeasts and fungi spanning 600 million years of evolution . Database . 2019 . 10.1093/database/baz022 . 6425859 . 30893420.
  9. http://eggnog.embl.de/ eggNOG: evolutionary genealogy of genes: Non-supervised Orthologous Groups
  10. Powell S, Forslund K, Szklarczyk D, Trachana K, Roth A, Huerta-Cepas J, Gabaldón T, Rattei T, Creevey C, Kuhn M, Jensen LJ, von Mering C, Bork P . 6 . eggNOG v4.0: nested orthology inference across 3686 organisms . Nucleic Acids Research . 42 . Database issue . D231-9 . January 2014 . 24297252 . 3964997 . 10.1093/nar/gkt1253 .
  11. http://www.greenphyl.org GreenPhylDB
  12. Rouard M, Guignon V, Aluome C, Laporte MA, Droc G, Walde C, Zmasek CM, Périn C, Conte MG . 6 . GreenPhylDB v2.0: comparative and functional genomics in plants . Nucleic Acids Research . 39 . Database issue . D1095-102 . January 2011 . 20864446 . 3013755 . 10.1093/nar/gkq811 .
  13. http://inparanoid.sbc.su.se/cgi-bin/index.cgi Inparanoid: Eukaryotic Ortholog Groups
  14. Sonnhammer EL, Östlund G . InParanoid 8: orthology analysis between 273 proteomes, mostly eukaryotic . Nucleic Acids Research . 43 . Database issue . D234-9 . January 2015 . 25429972 . 4383983 . 10.1093/nar/gku1203 .
  15. Singh PP, Arora J, Isambert H . Identification of Ohnolog Genes Originating from Whole Genome Duplication in Early Vertebrates, Based on Synteny Comparison across Multiple Genomes . PLOS Computational Biology . 11 . 7 . e1004394 . July 2015 . 26181593 . 4504502 . 10.1371/journal.pcbi.1004394 . 2015PLSCB..11E4394S . free .
  16. Web site: Vertebrate Ohnologs. ohnologs.curie.fr. 2018-10-12.
  17. Altenhoff AM, Glover NM, Train CM, Kaleb K, Warwick Vesztrocy A, Dylus D, de Farias TM, Zile K, Stevenson C, Long J, Redestig H, Gonnet GH, Dessimoz C . 6 . The OMA orthology database in 2018: retrieving evolutionary relationships among all domains of life through richer web and programmatic interfaces . Nucleic Acids Research . 46 . D1 . D477–D485 . January 2018 . 29106550 . 5753216 . 10.1093/nar/gkx1019 .
  18. Zdobnov EM, Tegenfeldt F, Kuznetsov D, Waterhouse RM, Simão FA, Ioannidis P, Seppey M, Loetscher A, Kriventseva EV . 6 . OrthoDB v9.1: cataloging evolutionary and functional annotations for animal, fungal, plant, archaeal, bacterial and viral orthologs . Nucleic Acids Research . 45 . D1 . D744–D749 . January 2017 . 27899580 . 5210582 . 10.1093/nar/gkw1119 .
  19. Nevers Y, Kress A, Defosset A, Ripp R, Linard B, Thompson JD, Poch O, Lecompte O . 6 . OrthoInspector 3.0: open portal for comparative genomics . Nucleic Acids Research . 47 . D411–D418 . January 2019 . D1 . 30380106 . 10.1093/nar/gky1068 . free . 6323921 .
  20. http://nypg.bio.nyu.edu/orthologid/ OrthologID
  21. Book: Egan M, Lee EK, Chiu JC, Coruzzi G, Desalle R . Bioinformatics for DNA Sequence Analysis . Gene orthology assessment with OrthologID . 537 . 23–38 . 2009 . 19378138 . 10.1007/978-1-59745-251-9_2 . Humana Press . 978-1-59745-251-9 . Methods in Molecular Biology . Posada D .
  22. http://www.orthomam.univ-montp2.fr OrthoMaM
  23. Douzery EJ, Scornavacca C, Romiguier J, Belkhir K, Galtier N, Delsuc F, Ranwez V . OrthoMaM v8: a database of orthologous exons and coding sequences for comparative genomics in mammals . Molecular Biology and Evolution . 31 . 7 . 1923–8 . July 2014 . 24723423 . 10.1093/molbev/msu132 . free .
  24. Scornavacca C, Belkhir K, Lopez J, Dernat R, Delsuc F, Douzery EJ, Ranwez V . OrthoMaM v10: Scaling-Up Orthologous Coding Sequence and Exon Alignments with More than One Hundred Mammalian Genomes . Molecular Biology and Evolution . 36 . 4 . 861–862 . April 2019 . 30698751 . 6445298 . 10.1093/molbev/msz015 .
  25. http://www.orthomcl.org OrthoMCL: Identification of Ortholog Groups for Eukaryotic Genomes
  26. Fischer S, Brunk BP, Chen F, Gao X, Harb OS, Iodice JB, Shanmugam D, Roos DS, Stoeckert CJ . 6 . Using OrthoMCL to assign proteins to OrthoMCL-DB groups or to cluster proteomes into new ortholog groups . Current Protocols in Bioinformatics . Chapter 6 . 1 . Unit 6.12.1–19 . September 2011 . 21901743 . 3196566 . 10.1002/0471250953.bi0612s35 . 978-0471250951 .
  27. http://roundup.hms.harvard.edu/ Roundup
  28. http://www.treefam.org/ TreeFam: Tree families database
  29. http://www.treefam.org/ TreeFam: Tree families database
  30. Schreiber F, Patricio M, Muffato M, Pignatelli M, Bateman A . TreeFam v9: a new website, more species and orthology-on-the-fly . Nucleic Acids Research . 42 . Database issue . D922-5 . January 2014 . 24194607 . 3965059 . 10.1093/nar/gkt1055 .
  31. https://davidemms.github.io/ OrthoFinder: Orthologs from gene trees
  32. Vilella AJ, Severin J, Ureta-Vidal A, Heng L, Durbin R, Birney E . EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates . Genome Research . 19 . 2 . 327–35 . February 2009 . 19029536 . 2652215 . 10.1101/gr.073585.107 .
  33. Thanki AS, Soranzo N, Haerty W, Davey RP . GeneSeqToFamily: a Galaxy workflow to find gene families based on the Ensembl Compara GeneTrees pipeline . GigaScience . 7 . 3 . 1–10 . March 2018 . 29425291 . 5863215 . 10.1093/gigascience/giy005 .
  34. Sayers EW, Barrett T, Benson DA, Bolton E, Bryant SH, Canese K, Chetvernin V, Church DM, DiCuccio M, Federhen S, Feolo M, Fingerman IM, Geer LY, Helmberg W, Kapustin Y, Landsman D, Lipman DJ, Lu Z, Madden TL, Madej T, Maglott DR, Marchler-Bauer A, Miller V, Mizrachi I, Ostell J, Panchenko A, Phan L, Pruitt KD, Schuler GD, Sequeira E, Sherry ST, Shumway M, Sirotkin K, Slotta D, Souvorov A, Starchenko G, Tatusova TA, Wagner L, Wang Y, Wilbur WJ, Yaschenko E, Ye J . 6 . Database resources of the National Center for Biotechnology Information . Nucleic Acids Research . 39 . Database issue . D38-51 . January 2011 . 21097890 . 3013733 . 10.1093/nar/gkq1172 .
  35. Fulton DL, Li YY, Laird MR, Horsman BG, Roche FM, Brinkman FS . Improving the specificity of high-throughput ortholog prediction . BMC Bioinformatics . 7 . 270 . May 2006 . 16729895 . 1524997 . 10.1186/1471-2105-7-270 . free .
  36. Studer RA, Robinson-Rechavi M . How confident can we be that orthologs are similar, but paralogs differ? . Trends in Genetics . 25 . 5 . 210–6 . May 2009 . 19368988 . 10.1016/j.tig.2009.03.004 .
  37. Nehrt NL, Clark WT, Radivojac P, Hahn MW . Testing the ortholog conjecture with comparative functional genomic data from mammals . PLOS Computational Biology . 7 . 6 . e1002073 . June 2011 . 21695233 . 3111532 . 10.1371/journal.pcbi.1002073 . 2011PLSCB...7E2073N . free .
  38. Web site: Eisen J . Special Guest Post & Discussion Invitation from Matthew Hahn on Ortholog Conjecture Paper . 20 September 2011 .
  39. Noda-Garcia L, Romero Romero ML, Longo LM, Kolodkin-Gal I, Tawfik DS . Bacilli glutamate dehydrogenases diverged via coevolution of transcription and enzyme regulation . EMBO Reports . 18 . 7 . 1139–1149 . July 2017 . 28468957 . 5494520 . 10.15252/embr.201743990 .
  40. Lundin LG . Evolution of the vertebrate genome as reflected in paralogous chromosomal regions in man and the house mouse . Genomics . 16 . 1 . 1–19 . April 1993 . 8486346 . 10.1006/geno.1993.1133 .
  41. Coulier F, Popovici C, Villet R, Birnbaum D . MetaHox gene clusters . The Journal of Experimental Zoology . 288 . 4 . 345–51 . December 2000 . 11144283 . 10.1002/1097-010X(20001215)288:4<345::AID-JEZ7>3.0.CO;2-Y .
  42. Ruddle FH, Bentley KL, Murtha MT, Risch N . Gene loss and gain in the evolution of the vertebrates . Development . 155–61 . 1994 . 1994 . 10.1242/dev.1994.Supplement.155 . 7579516 .
  43. Pébusque MJ, Coulier F, Birnbaum D, Pontarotti P . Ancient large-scale genome duplications: phylogenetic and linkage analyses shed light on chordate genome evolution . Molecular Biology and Evolution . 15 . 9 . 1145–59 . September 1998 . 9729879 . 10.1093/oxfordjournals.molbev.a026022 . free .
  44. Larsson TA, Olsson F, Sundstrom G, Lundin LG, Brenner S, Venkatesh B, Larhammar D . Early vertebrate chromosome duplications and the evolution of the neuropeptide Y receptor gene regions . BMC Evolutionary Biology . 8 . 184 . June 2008 . 1 . 18578868 . 2453138 . 10.1186/1471-2148-8-184 . free . 2008BMCEE...8..184L .
  45. Pollard SL, Holland PW . Evidence for 14 homeobox gene clusters in human genome ancestry . Current Biology . 10 . 17 . 1059–62 . September 2000 . 10996074 . 10.1016/S0960-9822(00)00676-X . 32135432 . free .
  46. Mulley JF, Chiu CH, Holland PW . Breakup of a homeobox cluster after genome duplication in teleosts . Proceedings of the National Academy of Sciences of the United States of America . 103 . 27 . 10369–10372 . July 2006 . 16801555 . 1502464 . 10.1073/pnas.0600341103 . 2006PNAS..10310369M . free .
  47. Flajnik MF, Kasahara M . Comparative genomics of the MHC: glimpses into the evolution of the adaptive immune system . Immunity . 15 . 3 . 351–62 . September 2001 . 11567626 . 10.1016/S1074-7613(01)00198-4 . free .
  48. McLysaght A, Hokamp K, Wolfe KH . Extensive genomic duplication during early chordate evolution . Nature Genetics . 31 . 2 . 200–4 . June 2002 . 12032567 . 10.1038/ng884 . 8263376 .
  49. Wolfe K . Robustness--it's not where you think it is . Nature Genetics . 25 . 1 . 3–4 . May 2000 . 10802639 . 10.1038/75560 . 85257685 .
  50. Singh PP, Affeldt S, Cascone I, Selimoglu R, Camonis J, Isambert H . On the expansion of "dangerous" gene repertoires by whole-genome duplications in early vertebrates . Cell Reports . 2 . 5 . 1387–98 . November 2012 . 23168259 . 10.1016/j.celrep.2012.09.034 . free .
  51. Malaguti G, Singh PP, Isambert H . On the retention of gene duplicates prone to dominant deleterious mutations . Theoretical Population Biology . 93 . 38–51 . May 2014 . 24530892 . 10.1016/j.tpb.2014.01.004 .
  52. Singh PP, Affeldt S, Malaguti G, Isambert H . Human dominant disease genes are enriched in paralogs originating from whole genome duplication . PLOS Computational Biology . 10 . 7 . e1003754 . July 2014 . 25080083 . 4117431 . 10.1371/journal.pcbi.1003754 . 2014PLSCB..10E3754S . free .
  53. McLysaght A, Makino T, Grayton HM, Tropeano M, Mitchell KJ, Vassos E, Collier DA . Ohnologs are overrepresented in pathogenic copy number mutations . Proceedings of the National Academy of Sciences of the United States of America . 111 . 1 . 361–6 . January 2014 . 24368850 . 3890797 . 10.1073/pnas.1309324111 . 2014PNAS..111..361M . free .
  54. Makino T, McLysaght A . Ohnologs in the human genome are dosage balanced and frequently associated with disease . Proceedings of the National Academy of Sciences of the United States of America . 107 . 20 . 9270–4 . May 2010 . 20439718 . 2889102 . 10.1073/pnas.0914697107 . 2010PNAS..107.9270M . free .
  55. Glover NM, Redestig H, Dessimoz C . Homoeologs: What Are They and How Do We Infer Them? . Trends in Plant Science . 21 . 7 . 609–621 . July 2016 . 27021699 . 4920642 . 10.1016/j.tplants.2016.02.005 . .
  56. García-Moreno J, Mindell DP . Rooting a phylogeny with homologous genes on opposite sex chromosomes (gametologs): a case study using avian CHD . Molecular Biology and Evolution . 17 . 12 . 1826–32 . December 2000 . 11110898 . 10.1093/oxfordjournals.molbev.a026283 . free .