Junk DNA explained
Junk DNA (non-functional DNA) is a DNA sequence that has no known biological function.[1] [2] [3] Most higher organisms have some junk DNA in their genomes, especially plants and animals, while bacteria and viruses have little or no junk DNA. That junk DNA consists mostly of pseudogenes, fragments of transposons and viruses, and repetitive DNA.[4]
Only about 1-2% of vertebrate genomes encode proteins. Additionally, non-protein coding regions such as genes for ribosomal RNA and transfer RNA, regulatory sequences such as promoters, origins of replication, centromeres, telomeres, and scaffold attachment regions are considered as functional elements, but they correspond only to a few percent of most eukaryotic genomes. Hence, a large fraction of eukaryotic genomes have no function.
An ongoing controversy over what constitutes function (and thus junk) centers on results from the ENCODE project, which defines function to include "transcription" or "biochemical activity", statements many scientists would disagree with.
The main evidence for junk DNA is that (1) many sequences can be deleted without consequence,[5] (2) large parts of genomes are conserved only in some species but not others,[6] [7] [8] and (3) repetitive sequences cannot carry much useful information.[9] The main objection to the concept of junk DNA is based on the observation that much of a genome is transcribed,[10] [11] [12] ignoring that transcription does not imply function.[13]
History
The idea that only a fraction of the human genome could be functional dates back to the late 1940s. The estimated mutation rate in humans suggested that if a large fraction of those mutations were deleterious then the human species could not survive such a mutation load (genetic load). This led to predictions in the late 1940s by one of the founders of population genetics, J.B.S. Haldane, and by Nobel laureate Hermann Muller, that only a small percentage of the human genome contains functional DNA elements (genes) that can be destroyed by mutation.[14] [15] (see Genetic load for more information)
In 1966 Muller reviewed these predictions and concluded that the human genome could only contain about 30,000 genes based on the number of deleterious mutations that the species could tolerate.[16] Similar predictions were made by other leading experts in molecular evolution who concluded that the human genome could not contain more than 40,000 genes and that less than 10% of the genome was functional.[17] [18] [6] [19]
The size of genomes in various species was known to vary considerably and there did not seem to be a correlation between genome size and the complexity of the species. Even closely related species could have very different genome sizes. This observation led to what came to be known as the C-value paradox.[20] The paradox was resolved with the discovery of repetitive DNA and the observation that most of the differences in genome size could be attributed to repetitive DNA.[21] Some scientists thought that most of the repetitive DNA was involved in regulating gene expression but many scientists thought that the excess repetitive DNA was nonfunctional.[22] [23] [24] [25]
At about the same time (late 1960s) the newly developed technique of C0t analysis was refined to include RNA:DNA hybridization leading to the discovery that considerably less than 10% of the human genome was complementary to mRNA and this DNA was in the unique (non-repetitive) fraction. This confirmed the predictions made from genetic load arguments and was consistent with the idea that much of the repetitive DNA is nonfunctional.[26] [27] [28]
The idea that large amounts of eukaryotic genomes could be nonfunctional conflicted with the prevailing view of evolution in 1968 since it seemed likely that nonfunctional DNA would be eliminated by natural selection. The development of the neutral theory and the nearly neutral theory provided a way out of this problem since it allowed for the preservation of slightly deleterious nonfunctional DNA in accordance with fundamental principles of population genetics.[29] [30] [31]
The term "junk DNA" began to be used in the late 1950s[32] but Susumu Ohno popularized the term in a 1972 paper titled "So much 'junk' DNA in our genome"[33] where he summarized the current evidence that had accumulated by then.[33] In a second paper that same year, he concluded that 90% of mammalian genomes consisted of nonfunctional DNA.[6] The case for junk DNA was summarized in a lengthy paper by David Comings in 1972 where he listed four reasons for proposing junk DNA:[34]
- some organisms have a lot more DNA than they seem to require (C-value paradox),
- current estimates of the number of genes (in 1972) are much less than the number that can be accommodated,
- the mutation load would be too large if all the DNA were functional, and
- some junk DNA clearly exists.
The discovery of introns in the 1970s seemed to confirm the views of junk DNA proponents because it meant that genes were very large and even huge genomes could not accommodate large numbers of genes. The proponents of junk DNA tended to dismiss intron sequences as mostly nonfunctional DNA (junk) but junk DNA opponents advanced a number of hypotheses attributing functions of various sort to intron sequences.[35] [36] [37] [38] [39]
By 1980 it was apparent that most of the repetitive DNA in the human genome was related to transposons. This prompted a series of papers and letters describing transposons as selfish DNA that acted as a parasite in genomes and produced no fitness advantage for the organism.[40] [41] [42] [43] [44]
Opponents of junk DNA interpreted these results as evidence that most of the genome is functional and they developed several hypotheses advocating that transposon sequences could benefit the organism or the species.[45] The most important opponent of junk DNA at this time was Thomas Cavalier-Smith who argued that the extra DNA was required to increase the volume of the nucleus in order to promote more efficient transport across the nuclear membrane.[46]
The positions of the two sides of the controversy hardened with one side believing that evolution was consistent with large amounts of junk DNA and the other side believing that natural selection should eliminate junk DNA. These differing views of evolution were highlighted in a letter from Thomas Jukes, a proponent of junk DNA, to Francis Crick on December 20, 1979:[47]
Dear Francis, I am sure that you realize how frightfully angry a lot of people will be if you say that much of the DNA is junk. The geneticists will be angry because they think that DNA is sacred. The Darwinian evolutionists will be outraged because they believe every change in DNA that is accepted in evolution is necessarily an adaptive change. To suggest anything else is an insult to the sacred memory of Darwin.
The other point of view was expressed by Roy John Britten and Kohne in their seminal paper on repetitive DNA.
A concept that is repugnant to us is that about half of the DNA of higher organisms is trivial or permanently inert (on an evolutionary timescale).
Functional vs non-functional
The main challenge of identifying junk DNA is to distinguish between "functional" and "non-functional " sequences. This is non-trivial, but there is some good evidence for both categories.
Functional
Protein-coding sequences are the most obvious functional sequences in genomes. However, they make up only 1-2% of most vertebrate genomes. However, there are also functional but non-coding DNA sequences such as regulatory sequences, origins of replication, and centromeres.[48] These sequences are usually conserved in evolution and make up another 3-8% of the human genome.[49]
The Encyclopedia of DNA Elements (ENCODE) project reported that detectable biochemical activity was observed in regions covering at least 80% of the human genome, with biochemical activity defined primarily as being transcribed.[50] While these findings were announced as the demise of junk DNA[51] [52] it is important to point out that transcription does not mean a sequence is "functional", analogous to some meaningless text that can be transcribed or copied without having any meaning.[53] [54] [55] [56] [57] [58] [59]
Non-functional
Non-functional DNA is rare in bacterial genomes which typically have an extremely high gene density, with only a few percent being not protein-coding.[60]
However, in most animal or plant genomes, a large fraction of DNA is non-functional, given that there is no obvious selective pressure on these sequences. More importantly, there is strong evidence that these sequences are not functional in other ways (using the human genome as example):
(1) Repetitive elements, especially mobile elements make up a large fraction of the human genome, such as LTR retrotransposons (8.3% of total genome), SINEs (13.1% of total genome) including Alu elements, LINEs (20.4% of total genome), SVAs (SINE-VNTR-Alu) and Class II DNA transposons (2.9% of total genome).[61] Many of these sequences are the descendents of ancient virus infections and are thus "non-functional" in terms of human genome function.
(2) Many sequences can be deleted as shown by comparing genomes. For instance, an analysis of 14,623 individuals identified 42,765 structural variants in the human genome of which 23.4% affected multiple genes (by deleting them or part of them). This study also found 47 deletions of >1 MB, showing that large chunks of the human genome can get deleted without obvious consequences.[62]
(3) Only a small fraction of the human genome is conserved, indicating that there is no strong (functional) selection pressure on these sequences, so they can rather freely mutate.[63] About 11% or less of the human genome is conserved[64] [65] and about 7% is under purifying selection.[66]
Opponents of junk DNA argue that biochemical activity detects functional regions of the genome that are not identified by sequence conservation or purifying selection.[67] [68] According to some scientists, until a region in question has been shown to have additional features, beyond what is expected of the null hypothesis, it should provisionally be labelled as non-functional.[69]
See also
Notes and References
- Eddy SR . The C-value paradox, junk DNA and ENCODE . Current Biology . 22 . 21 . R898–R899 . November 2012 . 23137679 . 10.1016/j.cub.2012.10.002 . 2012CBio...22.R898E . 28289437 . free . Sean Eddy .
- Palazzo AF, Gregory TR . The case for junk DNA . PLOS Genetics . 10 . 5 . e1004351 . May 2014 . 24809441 . 4014423 . 10.1371/journal.pgen.1004351 . free .
- Fagundes NJ, Bisso-Machado R, Figueiredo PI, Varal M, Zani AL . What We Talk About When We Talk About "Junk DNA" . Genome Biology and Evolution . 14 . 5 . evac055 . May 2022 . 35535669 . 9086759 . 10.1093/gbe/evac055 .
- Gil R, Latorre A . Factors behind junk DNA in bacteria . Genes . 3 . 4 . 634–650 . October 2012 . 24705080 . 3899985 . 10.3390/genes3040634 . free .
- Sudmant PH, Rausch T, Gardner EJ, Handsaker RE, Abyzov A, Huddleston J, Zhang Y, Ye K, Jun G, Fritz MH, Konkel MK, Malhotra A, Stütz AM, Shi X, Casale FP, Chen J, Hormozdiari F, Dayama G, Chen K, Malig M, Chaisson MJ, Walter K, Meiers S, Kashin S, Garrison E, Auton A, Lam HY, Mu XJ, Alkan C, Antaki D, Bae T, Cerveira E, Chines P, Chong Z, Clarke L, Dal E, Ding L, Emery S, Fan X, Gujral M, Kahveci F, Kidd JM, Kong Y, Lameijer EW, McCarthy S, Flicek P, Gibbs RA, Marth G, Mason CE, Menelaou A, Muzny DM, Nelson BJ, Noor A, Parrish NF, Pendleton M, Quitadamo A, Raeder B, Schadt EE, Romanovitch M, Schlattl A, Sebra R, Shabalin AA, Untergasser A, Walker JA, Wang M, Yu F, Zhang C, Zhang J, Zheng-Bradley X, Zhou W, Zichner T, Sebat J, Batzer MA, McCarroll SA, Mills RE, Gerstein MB, Bashir A, Stegle O, Devine SE, Lee C, Eichler EE, Korbel JO . An integrated map of structural variation in 2,504 human genomes . Nature . 526 . 7571 . 75–81 . October 2015 . 26432246 . 4617611 . 10.1038/nature15394 . 2015Natur.526...75. .
- Ohno S . 1972 . An argument for the genetic simplicity of man and other mammals . Journal of Human Evolution . 1 . 6 . 651–662 . 10.1016/0047-2484(72)90011-5 . 1972JHumE...1..651O .
- Morange M . Genome as a multipurpose structure built by evolution . Perspectives in Biology and Medicine . 57 . 1 . 162–171 . 2014 . 25345709 . 10.1353/pbm.2014.0008 . 27613442 .
- Palazzo AF, Kejiou NS . Non-Darwinian Molecular Biology . Frontiers in Genetics . 13 . 831068 . 2022 . 35251134 . 8888898 . 10.3389/fgene.2022.831068 . free .
- de Koning AP, Gu W, Castoe TA, Batzer MA, Pollock DD . Repetitive elements may comprise over two-thirds of the human genome . PLOS Genetics . 7 . 12 . e1002384 . December 2011 . 22144907 . 3228813 . 10.1371/journal.pgen.1002384 . Gregory P. . free . Copenhaver .
- Germain PL, Ratti E, and Boem F . 2014 . Junk or functional DNA? ENCODE and the function controversy . Biology & Philosophy . 29 . 6 . 807–821 . 10.1007/s10539-014-9441-3. 254277794 .
- Mattick JS . RNA out of the mist . Trends in Genetics . 39 . 3 . 187–207 . March 2023 . 36528415 . 10.1016/j.tig.2022.11.001 . 254768457 .
- Kellis M, Wold B, Snyder MP, Bernstein BE, Kundaje A, Marinov GK, Ward LD, Birney E, Crawford GE, Dekker J, Dunham I, Elnitski LL, Farnham PJ, Feingold EA, Gerstein M, Giddings MC, Gilbert DM, Gingeras TR, Green ED, Guigo R, Hubbard T, Kent J, Lieb JD, Myers RM, Pazin MJ, Ren B, Stamatoyannopoulos JA, Weng Z, White KP, Hardison RC . Defining functional DNA elements in the human genome . Proceedings of the National Academy of Sciences of the United States of America . 111 . 17 . 6131–6138 . April 2014 . 24753594 . 4035993 . 10.1073/pnas.1318948111 . 2014PNAS..111.6131K . free .
- Doolittle WF, Brunet TD . On causal roles and selected effects: our genome is mostly junk . BMC Biology . 15 . 1 . 116 . December 2017 . 29207982 . 5718017 . 10.1186/s12915-017-0460-9 . free .
- Muller HJ . Our load of mutations . American Journal of Human Genetics . 2 . 2 . 111–176 . June 1950 . 14771033 . 1716299 .
- Haldane . JBS . 1949 . The rate of mutation of human genes . Hereditas . 35 . 267–273 . 10.1111/j.1601-5223.1949.tb03339.x . free .
- Lin Y, Zhai T, Zhang X . Nanoscale heat transfer in direct nanopatterning into gold films by a nanosecond laser pulse . Optics Express . 22 . 7 . 8396–8404 . April 2014 . 24718213 . 10.1086/282445 . 84202145 . 2459205 .
- Kimura M . Evolutionary rate at the molecular level . Nature . 217 . 5129 . 624–626 . February 1968 . 5637732 . 10.1038/217624a0 . 4161261 . 1968Natur.217..624K .
- King JL, Jukes TH . Non-Darwinian evolution . Science . 164 . 3881 . 788–798 . May 1969 . 5767777 . 10.1126/science.164.3881.788 . 1969Sci...164..788L .
- Ohta T, Kimura M . Functional organization of genetic material as a product of molecular evolution . Nature . 233 . 5315 . 118–119 . September 1971 . 16063236 . 10.1038/233118a0 . 13344748 . 1971Natur.233..118O .
- Thomas CA . The genetic organization of chromosomes . Annual Review of Genetics . 5 . 237–256 . 1971 . 16097657 . 10.1146/annurev.ge.05.120171.001321 .
- Britten RJ, Kohne DE . Repeated sequences in DNA. Hundreds of thousands of copies of DNA sequences have been incorporated into the genomes of higher organisms . Science . 161 . 3841 . 529–540 . August 1968 . 4874239 . 10.1126/science.161.3841.529 . 1968Sci...161..529B .
- Britten RJ, Davidson EH . Gene regulation for higher cells: a theory . Science . 165 . 3891 . 349–357 . July 1969 . 5789433 . 10.1126/science.165.3891.349 . 1969Sci...165..349B .
- Book: Gregory, TR . 2005 . Genome Size Evolution in Animals . The Evolution of the Genome . 3–87 . Elsevier .
- Book: Lewin, Benjamin . 1974 . Chapter 4: Sequences of Eukaryotic DNA . Gene Expression-2: Eukaryotic Chromosomes . John Wiley & Sons .
- Lewin . Benjamin . 1974 . Sequence Organization of Eukaryotic DNA: Defining the Unit of Gene Expression . Cell . 1 . 3 . 107–111 . 10.1016/0092-8674(74)90125-1 .
- Book: Lewin, Benjamin . 1974 . Chapter 5: Transcription and Processing of RNA . Gene Expression-2: Eukaryotic Chromosomes . John Wiley & Sons .
- O'Brien SJ . On estimating functional gene number in eukaryotes . Nature . 242 . 115 . 52–54 . March 1973 . 4512011 . 10.1038/newbio242052a0 .
- Bishop JO . The gene numbers game . Cell . 2 . 2 . 81–86 . June 1974 . 4616752 . 10.1016/0092-8674(74)90095-6 .
- King JL, Jukes TH . Non-Darwinian evolution . Science . 164 . 3881 . 788–798 . May 1969 . 5767777 . 10.1126/science.164.3881.788 . 1969Sci...164..788L .
- Kimura M . Evolutionary rate at the molecular level . Nature . 217 . 5129 . 624–626 . February 1968 . 5637732 . 10.1038/217624a0 . 4161261 . 1968Natur.217..624K .
- Kimura M, Ohta T . Protein polymorphism as a phase of molecular evolution . Nature . 229 . 5285 . 467–469 . February 1971 . 4925204 . 10.1038/229467a0 . 4290427 . 1971Natur.229..467K .
- MA . Sweet . Amalia . 2022 . Requiem for a Gene: The Problem of Junk DNA for the Molecular Paradigm . University of Chicago.
- Ohno S . So much "junk" DNA in our genome . Brookhaven Symposia in Biology . 23 . 366–370 . 1972 . 5065367 .
- Book: Advances in human genetics . Comings DE . 1972 . Springer . 237–431 . The structure and function of chromatin.
- Book: Morange, Michel . 2020 . Chapter 17: Split Genes and Splicing . The Black Box of Biology: A History of the Molecular Revolution . Harvard University Press .
- Gilbert W . Why genes in pieces? . Nature . 271 . 5645 . 501 . February 1978 . 622185 . 10.1038/271501a0 . 4216649 . free . 1978Natur.271..501G .
- Gilbert W . Genes-in-pieces revisited . Science . 228 . 4701 . 823–824 . May 1985 . 4001923 . 10.1126/science.4001923 . 1985Sci...228..823G .
- Crick F . Split genes and RNA splicing . Science . 204 . 4390 . 264–271 . April 1979 . 373120 . 10.1126/science.373120 . 1979Sci...204..264C .
- Doolittle . W.F. . 1978 . Genes in pieces: were they ever together? . Nature . 272 . 5654 . 581–582 . 10.1038/272581a0 . 1978Natur.272..581D . 4162765 . free .
- Doolittle WF, Sapienza C . Selfish genes, the phenotype paradigm and genome evolution . Nature . 284 . 5757 . 601–603 . April 1980 . 6245369 . 10.1038/284601a0 . 4311366 . 1980Natur.284..601D .
- Orgel LE, Crick FH . Selfish DNA: the ultimate parasite . Nature . 284 . 5757 . 604–607 . April 1980 . 7366731 . 10.1038/284604a0 . 4233826 . 1980Natur.284..604O .
- Dover G . Ignorant DNA? . Nature . 285 . 5767 . 618–620 . June 1980 . 7393318 . 10.1038/285618a0 . 4261755 . free . 1980Natur.285..618D .
- Dover G, Doolittle WF . Modes of genome evolution . Nature . 288 . 5792 . 646–647 . December 1980 . 6256636 . 10.1038/288646a0 . 8938434 . free . 1980Natur.288..646D .
- Jain HK . Incidental DNA . Nature . 288 . 5792 . 647–648 . December 1980 . 7453799 . 10.1038/288647a0 . 31899622 . free . 1980Natur.288..647J .
- Cavalier-Smith T . How selfish is DNA? . Nature . 285 . 5767 . 617–618 . June 1980 . 7393317 . 10.1038/285617a0 . 27111068 . free . 1980Natur.285..617C .
- Cavalier-Smith T . Nuclear volume control by nucleoskeletal DNA, selection for cell volume and cell growth rate, and the solution of the DNA C-value paradox . Journal of Cell Science . 34 . 247–278 . December 1978 . 372199 . 10.1242/jcs.34.1.247 .
- Web site: letter to Francis Crick . Jukes . Thomas . December 29, 1979 . National Institutes of Health (USA) .
- Book: Watson J . Molecular Biology of the Gene . 1965 . W. A. Benjamin, Inc. . New York, New York, USA.
- Siepel A, Bejerano G, Pedersen JS, Hinrichs AS, Hou M, Rosenbloom K, Clawson H, Spieth J, Hillier LW, Richards S, Weinstock GM, Wilson RK, Gibbs RA, Kent WJ, Miller W, Haussler D . Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes . Genome Research . 15 . 8 . 1034–1050 . August 2005 . 16024819 . 1182216 . 10.1101/gr.3715005 .
- An integrated encyclopedia of DNA elements in the human genome . Nature . 489 . 7414 . 57–74 . September 2012 . 22955616 . 3439153 . 10.1038/nature11247 . 2012Natur.489...57T . The ENCODE Project Consortium . Dunham I, Kundaje A, Aldred SF, Collins PJ, Davis CA, Doyle F, etal .
- Pennisi E . Genomics. ENCODE project writes eulogy for junk DNA . Science . 337 . 6099 . 1159, 1161 . September 2012 . 22955811 . 10.1126/science.337.6099.1159 .
- Casane D, Fumey J, Laurenti P . [ENCODE apophenia or a panglossian analysis of the human genome] . Medecine Sciences . 31 . 6–7 . 680–686 . 2015 . 26152174 . 10.1051/medsci/20153106023 .
- News: February 24, 2013 . Scientists attacked over claim that 'junk DNA' is vital to life . The Observer . McKie R.
- Eddy SR . The ENCODE project: missteps overshadowing a success . Current Biology . 23 . 7 . R259–R261 . April 2013 . 23578867 . 10.1016/j.cub.2013.03.023 . free . 2013CBio...23.R259E .
- Doolittle WF . Is junk DNA bunk? A critique of ENCODE . Proceedings of the National Academy of Sciences of the United States of America . 110 . 14 . 5294–5300 . April 2013 . 23479647 . 3619371 . 10.1073/pnas.1221376110 . 2013PNAS..110.5294D . free . W. Ford Doolittle .
- Brunet TD, Doolittle WF . Getting "function" right . Proceedings of the National Academy of Sciences of the United States of America . 111 . 33 . E3365 . August 2014 . 25107292 . 4143013 . 10.1073/pnas.1409762111 . free . 2014PNAS..111E3365P .
- Graur D, Zheng Y, Price N, Azevedo RB, Zufall RA, Elhaik E . On the immortality of television sets: "function" in the human genome according to the evolution-free gospel of ENCODE . Genome Biology and Evolution . 5 . 3 . 578–590 . 2013 . 23431001 . 3622293 . 10.1093/gbe/evt028 .
- Doolittle WF, Brunet TD, Linquist S, Gregory TR . Distinguishing between "function" and "effect" in genome biology . Genome Biology and Evolution . 6 . 5 . 1234–1237 . May 2014 . 24814287 . 4041003 . 10.1093/gbe/evu098 .
- Niu DK, Jiang L . Can ENCODE tell us how much junk DNA we carry in our genome? . Biochemical and Biophysical Research Communications . 430 . 4 . 1340–1343 . January 2013 . 23268340 . 10.1016/j.bbrc.2012.12.074 .
- Zhao Z, Cristian A, Rosen G . Keeping up with the genomes: efficient learning of our increasing knowledge of the tree of life . BMC Bioinformatics . 21 . 1 . 412 . September 2020 . 32957925 . 7507296 . 10.1186/s12859-020-03744-7 . free .
- Treangen TJ, Salzberg SL . Repetitive DNA and next-generation sequencing: computational challenges and solutions . Nature Reviews. Genetics . 13 . 1 . 36–46 . November 2011 . 22124482 . 3324860 . 10.1038/nrg3117 .
- Abel HJ, Larson DE, Regier AA, Chiang C, Das I, Kanchi KL, Layer RM, Neale BM, Salerno WJ, Reeves C, Buyske S, Matise TC, Muzny DM, Zody MC, Lander ES, Dutcher SK, Stitziel NO, Hall IM . Mapping and characterization of structural variation in 17,795 human genomes . Nature . 583 . 7814 . 83–89 . July 2020 . 32460305 . 7547914 . 10.1038/s41586-020-2371-0 . 2020Natur.583...83A .
- Book: Graur D . 2016 . Molecular and Genome Evolution . Sinauer Associates, Inc. . Sunderland MA (USA) . 9781605354699.
- Rands CM, Meader S, Ponting CP, Lunter G . 8.2% of the Human genome is constrained: variation in rates of turnover across functional element classes in the human lineage . PLOS Genetics . 10 . 7 . e1004525 . July 2014 . 25057982 . 4109858 . 10.1371/journal.pgen.1004525 . free .
- Christmas MJ, Kaplow IM, Genereux DP, Dong MX, Hughes GM, Li X, Sullivan PF, Hindle AG, Andrews G, Armstrong JC, Bianchi M, Breit AM, Diekhans M, Fanter C, Foley NM, Goodman DB, Goodman L, Keough KC, Kirilenko B, Kowalczyk A, Lawless C, Lind AL, Meadows JR, Moreira LR, Redlich RW, Ryan L, Swofford R, Valenzuela A, Wagner F, Wallerman O, Brown AR, Damas J, Fan K, Gatesy J, Grimshaw J, Johnson J, Kozyrev SV, Lawler AJ, Marinescu VD, Morrill KM, Osmanski A, Paulat NS, Phan BN, Reilly SK, Schäffer DE, Steiner C, Supple MA, Wilder AP, Wirthlin ME, Xue JR, Birren BW, Gazal S, Hubley RM, Koepfli KP, Marques-Bonet T, Meyer WK, Nweeia M, Sabeti PC, Shapiro B, Smit AF, Springer MS, Teeling EC, Weng Z, Hiller M, Levesque DL, Lewin HA, Murphy WJ, Navarro A, Paten B, Pollard KS, Ray DA, Ruf I, Ryder OA, Pfenning AR, Lindblad-Toh K, Karlsson EK . Evolutionary constraint and innovation across hundreds of placental mammals . Science . 380 . 6643 . eabn3943 . April 2023 . 37104599 . 10250106 . 10.1126/science.abn3943 . free . 10230/59591 .
- Halldorsson BV, Eggertsson HP, Moore KH, Hauswedell H, Eiriksson O, Ulfarsson MO, Palsson G, Hardarson MT, Oddsson A, Jensson BO, Kristmundsdottir S, Sigurpalsdottir BD, Stefansson OA, Beyter D, Holley G, Tragante V, Gylfason A, Olason PI, Zink F, Asgeirsdottir M, Sverrisson ST, Sigurdsson B, Gudjonsson SA, Sigurdsson GT, Halldorsson GH, Sveinbjornsson G, Norland K, Styrkarsdottir U, Magnusdottir DN, Snorradottir S, Kristinsson K, Sobech E, Jonsson H, Geirsson AJ, Olafsson I, Jonsson P, Pedersen OB, Erikstrup C, Brunak S, Ostrowski SR, Thorleifsson G, Jonsson F, Melsted P, Jonsdottir I, Rafnar T, Holm H, Stefansson H, Saemundsdottir J, Gudbjartsson DF, Magnusson OT, Masson G, Thorsteinsdottir U, Helgason A, Jonsson H, Sulem P, Stefansson K . The sequences of 150,119 genomes in the UK Biobank . Nature . 607 . 7920 . 732–740 . July 2022 . 35859178 . 9329122 . 10.1038/s41586-022-04965-x . free . 2022Natur.607..732H . 20.500.11815/3726 .
- Liu G, Mattick JS, Taft RJ . A meta-analysis of the genomic and transcriptomic composition of complex life . Cell Cycle . 12 . 13 . 2061–2072 . July 2013 . 4685169 . 10.1186/1877-6566-7-2 . free . 23759593 .
- Mattick JS . A Kuhnian revolution in molecular biology: Most genes in complex organisms express regulatory RNAs . BioEssays . 45 . 9 . e2300080 . September 2023 . 37318305 . 10.1002/bies.202300080 . free .
- Palazzo AF, Lee ES . Non-coding RNA: what is functional and what is junk? . Frontiers in Genetics . 6 . 2 . 2015 . 25674102 . 4306305 . 10.3389/fgene.2015.00002 . free .