Population genetics explained

Population genetics is a subfield of genetics that deals with genetic differences within and among populations, and is a part of evolutionary biology. Studies in this branch of biology examine such phenomena as adaptation, speciation, and population structure.^[1]

Population genetics was a vital ingredient in the emergence of the modern evolutionary synthesis. Its primary founders were Sewall Wright, J. B. S. Haldane and Ronald Fisher, who also laid the foundations for the related discipline of quantitative genetics. Traditionally a highly mathematical discipline, modern population genetics encompasses theoretical, laboratory, and field work. Population genetic models are used both for statistical inference from DNA sequence data and for proof/disproof of concept.^[2]

What sets population genetics apart from newer, more phenotypic approaches to modelling evolution, such as evolutionary game theory and adaptive dynamics, is its emphasis on such genetic phenomena as dominance, epistasis, the degree to which genetic recombination breaks linkage disequilibrium, and the random phenomena of mutation and genetic drift. This makes it appropriate for comparison to population genomics data.

History

Population genetics began as a reconciliation of Mendelian inheritance and biostatistics models. Natural selection will only cause evolution if there is enough genetic variation in a population. Before the discovery of Mendelian genetics, one common hypothesis was blending inheritance. But with blending inheritance, genetic variance would be rapidly lost, making evolution by natural or sexual selection implausible. The Hardy–Weinberg principle provides the solution to how variation is maintained in a population with Mendelian inheritance. According to this principle, the frequencies of alleles (variations in a gene) will remain constant in the absence of selection, mutation, migration and genetic drift.^[3]

The next key step was the work of the British biologist and statistician Ronald Fisher. In a series of papers starting in 1918 and culminating in his 1930 book The Genetical Theory of Natural Selection, Fisher showed that the continuous variation measured by the biometricians could be produced by the combined action of many discrete genes, and that natural selection could change allele frequencies in a population, resulting in evolution. In a series of papers beginning in 1924, another British geneticist, J. B. S. Haldane, worked out the mathematics of allele frequency change at a single gene locus under a broad range of conditions. Haldane also applied statistical analysis to real-world examples of natural selection, such as peppered moth evolution and industrial melanism, and showed that selection coefficients could be larger than Fisher assumed, leading to more rapid adaptive evolution as a camouflage strategy following increased pollution.^[4] ^[5]

The American biologist Sewall Wright, who had a background in animal breeding experiments, focused on combinations of interacting genes, and the effects of inbreeding on small, relatively isolated populations that exhibited genetic drift. In 1932 Wright introduced the concept of an adaptive landscape and argued that genetic drift and inbreeding could drive a small, isolated sub-population away from an adaptive peak, allowing natural selection to drive it towards different adaptive peaks.

The work of Fisher, Haldane and Wright founded the discipline of population genetics. This integrated natural selection with Mendelian genetics, which was the critical first step in developing a unified theory of how evolution worked.^[4] ^[5] John Maynard Smith was Haldane's pupil, whilst W. D. Hamilton was influenced by the writings of Fisher. The American George R. Price worked with both Hamilton and Maynard Smith. American Richard Lewontin and Japanese Motoo Kimura were influenced by Wright and Haldane.

Modern synthesis

See main article: Modern synthesis (20th century).

The mathematics of population genetics were originally developed as the beginning of the modern synthesis. Authors such as Beatty^[6] have asserted that population genetics defines the core of the modern synthesis. For the first few decades of the 20th century, most field naturalists continued to believe that Lamarckism and orthogenesis provided the best explanation for the complexity they observed in the living world.^[7] During the modern synthesis, these ideas were purged, and only evolutionary causes that could be expressed in the mathematical framework of population genetics were retained.^[8] Consensus was reached as to which evolutionary factors might influence evolution, but not as to the relative importance of the various factors.

Theodosius Dobzhansky, a postdoctoral worker in T. H. Morgan's lab, had been influenced by the work on genetic diversity by Russian geneticists such as Sergei Chetverikov. He helped to bridge the divide between the foundations of microevolution developed by the population geneticists and the patterns of macroevolution observed by field biologists, with his 1937 book Genetics and the Origin of Species. Dobzhansky examined the genetic diversity of wild populations and showed that, contrary to the assumptions of the population geneticists, these populations had large amounts of genetic diversity, with marked differences between sub-populations. The book also took the highly mathematical work of the population geneticists and put it into a more accessible form. Many more biologists were influenced by population genetics via Dobzhansky than were able to read the highly mathematical works in the original.^[9]

In Great Britain E. B. Ford, the pioneer of ecological genetics,^[10] continued throughout the 1930s and 1940s to empirically demonstrate the power of selection due to ecological factors including the ability to maintain genetic diversity through genetic polymorphisms such as human blood types. Ford's work, in collaboration with Fisher, contributed to a shift in emphasis during the modern synthesis towards natural selection as the dominant force.^[4] ^[5] ^[11] ^[12]

Neutral theory and origin-fixation dynamics

The original, modern synthesis view of population genetics assumes that mutations provide ample raw material, and focuses only on the change in frequency of alleles within populations.^[13] The main processes influencing allele frequencies are natural selection, genetic drift, gene flow and recurrent mutation. Fisher and Wright had some fundamental disagreements about the relative roles of selection and drift.^[14] The availability of molecular data on all genetic differences led to the neutral theory of molecular evolution. In this view, many mutations are deleterious and so never observed, and most of the remainder are neutral, i.e. are not under selection. With the fate of each neutral mutation left to chance (genetic drift), the direction of evolutionary change is driven by which mutations occur, and so cannot be captured by models of change in the frequency of (existing) alleles alone.^[15]

The origin-fixation view of population genetics generalizes this approach beyond strictly neutral mutations, and sees the rate at which a particular change happens as the product of the mutation rate and the fixation probability.

Four processes

Selection

Natural selection, which includes sexual selection, is the fact that some traits make it more likely for an organism to survive and reproduce. Population genetics describes natural selection by defining fitness as a propensity or probability of survival and reproduction in a particular environment. The fitness is normally given by the symbol w=1-s where s is the selection coefficient. Natural selection acts on phenotypes, so population genetic models assume relatively simple relationships to predict the phenotype and hence fitness from the allele at one or a small number of loci. In this way, natural selection converts differences in the fitness of individuals with different phenotypes into changes in allele frequency in a population over successive generations.

Before the advent of population genetics, many biologists doubted that small differences in fitness were sufficient to make a large difference to evolution.^[9] Population geneticists addressed this concern in part by comparing selection to genetic drift. Selection can overcome genetic drift when s is greater than 1 divided by the effective population size. When this criterion is met, the probability that a new advantageous mutant becomes fixed is approximately equal to 2s.^[16] ^[17] The time until fixation of such an allele is approximately

(2log(sN)+\gamma)/s

.^[18]

Dominance

Dominance means that the phenotypic and/or fitness effect of one allele at a locus depends on which allele is present in the second copy for that locus. Consider three genotypes at one locus, with the following fitness values^[19]

-	Genotype:	A₁A₁	A₁A₂	A₂A₂	-	Relative fitness:	1	1-hs	1-s

s is the selection coefficient and h is the dominance coefficient. The value of h yields the following information:

-	h=0	A₁ dominant, A₂ recessive	-	h=1	A₂ dominant, A₁ recessive	-	0	incomplete dominance	-	h<0		-	h>1

Epistasis

Epistasis means that the phenotypic and/or fitness effect of an allele at one locus depends on which alleles are present at other loci. Selection does not act on a single locus, but on a phenotype that arises through development from a complete genotype.^[20] However, many population genetics models of sexual species are "single locus" models, where the fitness of an individual is calculated as the product of the contributions from each of its loci—effectively assuming no epistasis.

In fact, the genotype to fitness landscape is more complex. Population genetics must either model this complexity in detail, or capture it by some simpler average rule. Empirically, beneficial mutations tend to have a smaller fitness benefit when added to a genetic background that already has high fitness: this is known as diminishing returns epistasis.^[21] When deleterious mutations also have a smaller fitness effect on high fitness backgrounds, this is known as "synergistic epistasis". However, the effect of deleterious mutations tends on average to be very close to multiplicative, or can even show the opposite pattern, known as "antagonistic epistasis".^[22]

Synergistic epistasis is central to some theories of the purging of mutation load^[23] and to the evolution of sexual reproduction.

Mutation

The genetic process of mutation takes place within an individual, resulting in heritable changes to the genetic material. This process is often characterized by a description of the starting and ending states, or the kind of change that has happened at the level of DNA (e.g,. a T-to-C mutation, a 1-bp deletion), of genes or proteins (e.g., a null mutation, a loss-of-function mutation), or at a higher phenotypic level (e.g., red-eye mutation). Single-nucleotide changes are frequently the most common type of mutation, but many other types of mutation are possible, and they occur at widely varying rates that may show systematic asymmetries or biases (mutation bias).

Mutations can involve large sections of DNA becoming duplicated, usually through genetic recombination.^[24] This leads to copy-number variation within a population. Duplications are a major source of raw material for evolving new genes.^[25] Other types of mutation occasionally create new genes from previously noncoding DNA.^[26] ^[27]

In the distribution of fitness effects (DFE) for new mutations, only a minority of mutations are beneficial. Mutations with gross effects are typically deleterious. Studies in the fly Drosophila melanogaster suggest that if a mutation changes a protein produced by a gene, this will probably be harmful, with about 70 percent of these mutations having damaging effects, and the remainder being either neutral or weakly beneficial.^[28]

This biological process of mutation is represented in population-genetic models in one of two ways, either as a deterministic pressure of recurrent mutation on allele frequencies, or a source of variation. In deterministic theory, evolution begins with a predetermined set of alleles and proceeds by shifts in continuous frequencies, as if the population is infinite. The occurrence of mutations in individuals is represented by a population-level "force" or "pressure" of mutation, i.e., the force of innumerable events of mutation with a scaled magnitude u applied to shifting frequencies f(A1) to f(A2). For instance, in the classic mutation–selection balance model,^[29] the force of mutation pressure pushes the frequency of an allele upward, and selection against its deleterious effects pushes the frequency downward, so that a balance is reached at equilibrium, given (in the simplest case) by f = u/s.

This concept of mutation pressure is mostly useful for considering the implications of deleterious mutation, such as the mutation load and its implications for the evolution of the mutation rate.^[30] Transformation of populations by mutation pressure is unlikely. Haldane^[31] argued that it would require high mutation rates unopposed by selection, and Kimura^[32] concluded even more pessimistically that even this was unlikely, as the process would take too long (see evolution by mutation pressure).

However, evolution by mutation pressure is possible under some circumstances and has long been suggested as a possible cause for the loss of unused traits.^[33] For example, pigments are no longer useful when animals live in the darkness of caves, and tend to be lost.^[34] An experimental example involves the loss of sporulation in experimental populations of B. subtilis. Sporulation is a complex trait encoded by many loci, such that the mutation rate for loss of the trait was estimated as an unusually high value,

\mu=0.003

.^[35] Loss of sporulation in this case can occur by recurrent mutation, without requiring selection for the loss of sporulation ability. When there is no selection for loss of function, the speed at which loss evolves depends more on the mutation rate than it does on the effective population size,^[36] indicating that it is driven more by mutation than by genetic drift.

The role of mutation as a source of novelty is different from these classical models of mutation pressure. When population-genetic models include a rate-dependent process of mutational introduction or origination, i.e., a process that introduces new alleles including neutral and beneficial ones, then the properties of mutation may have a more direct impact on the rate and direction of evolution, even if the rate of mutation is very low.^[37] ^[38] That is, the spectrum of mutation may become very important, particularly mutation biases, predictable differences in the rates of occurrence for different types of mutations, because bias in the introduction of variation can impose biases on the course of evolution.^[39]

Mutation plays a key role in other classical and recent theories including Muller%27s ratchet, subfunctionalization, Eigen's concept of an error catastrophe and Lynch's mutational hazard hypothesis.

Genetic drift

See main article: Genetic drift.

Genetic drift is a change in allele frequencies caused by random sampling.^[40] That is, the alleles in the offspring are a random sample of those in the parents.^[41] Genetic drift may cause gene variants to disappear completely, and thereby reduce genetic variability. In contrast to natural selection, which makes gene variants more common or less common depending on their reproductive success,^[42] the changes due to genetic drift are not driven by environmental or adaptive pressures, and are equally likely to make an allele more common as less common.

The effect of genetic drift is larger for alleles present in few copies than when an allele is present in many copies. The population genetics of genetic drift are described using either branching processes or a diffusion equation describing changes in allele frequency.^[43] These approaches are usually applied to the Wright-Fisher and Moran models of population genetics. Assuming genetic drift is the only evolutionary force acting on an allele, after t generations in many replicated populations, starting with allele frequencies of p and q, the variance in allele frequency across those populations is

V_t ≈ pq\left(1-\exp\left\{-

	t
	2N_e

\right\}\right).

^[44]

Ronald Fisher held the view that genetic drift plays at the most a minor role in evolution, and this remained the dominant view for several decades. No population genetics perspective have ever given genetic drift a central role by itself, but some have made genetic drift important in combination with another non-selective force. The shifting balance theory of Sewall Wright held that the combination of population structure and genetic drift was important. Motoo Kimura's neutral theory of molecular evolution claims that most genetic differences within and between populations are caused by the combination of neutral mutations and genetic drift.^[45]

The role of genetic drift by means of sampling error in evolution has been criticized by John H Gillespie^[46] and Will Provine,^[47] who argue that selection on linked sites is a more important stochastic force, doing the work traditionally ascribed to genetic drift by means of sampling error. The mathematical properties of genetic draft are different from those of genetic drift.^[48] The direction of the random change in allele frequency is autocorrelated across generations.

Gene flow

See main article: Gene flow.

Because of physical barriers to migration, along with the limited tendency for individuals to move or spread (vagility), and tendency to remain or come back to natal place (philopatry), natural populations rarely all interbreed as may be assumed in theoretical random models (panmixy).^[49] There is usually a geographic range within which individuals are more closely related to one another than those randomly selected from the general population. This is described as the extent to which a population is genetically structured.^[50]

Genetic structuring can be caused by migration due to historical climate change, species range expansion or current availability of habitat. Gene flow is hindered by mountain ranges, oceans and deserts or even human-made structures such as the Great Wall of China, which has hindered the flow of plant genes.^[51]

Gene flow is the exchange of genes between populations or species, breaking down the structure. Examples of gene flow within a species include the migration and then breeding of organisms, or the exchange of pollen. Gene transfer between species includes the formation of hybrid organisms and horizontal gene transfer. Population genetic models can be used to identify which populations show significant genetic isolation from one another, and to reconstruct their history.^[52]

Subjecting a population to isolation leads to inbreeding depression. Migration into a population can introduce new genetic variants,^[53] potentially contributing to evolutionary rescue. If a significant proportion of individuals or gametes migrate, it can also change allele frequencies, e.g. giving rise to migration load.^[54]

In the presence of gene flow, other barriers to hybridization between two diverging populations of an outcrossing species are required for the populations to become new species.

Horizontal gene transfer

See main article: Horizontal gene transfer.

Horizontal gene transfer is the transfer of genetic material from one organism to another organism that is not its offspring; this is most common among prokaryotes.^[55] In medicine, this contributes to the spread of antibiotic resistance, as when one bacteria acquires resistance genes it can rapidly transfer them to other species.^[56] Horizontal transfer of genes from bacteria to eukaryotes such as the yeast Saccharomyces cerevisiae and the adzuki bean beetle Callosobruchus chinensis may also have occurred.^[57] ^[58] An example of larger-scale transfers are the eukaryotic bdelloid rotifers, which appear to have received a range of genes from bacteria, fungi, and plants.^[59] Viruses can also carry DNA between organisms, allowing transfer of genes even across biological domains.^[60] Large-scale gene transfer has also occurred between the ancestors of eukaryotic cells and prokaryotes, during the acquisition of chloroplasts and mitochondria.^[61]

Linkage

If all genes are in linkage equilibrium, the effect of an allele at one locus can be averaged across the gene pool at other loci. In reality, one allele is frequently found in linkage disequilibrium with genes at other loci, especially with genes located nearby on the same chromosome. Recombination breaks up this linkage disequilibrium too slowly to avoid genetic hitchhiking, where an allele at one locus rises to high frequency because it is linked to an allele under selection at a nearby locus. Linkage also slows down the rate of adaptation, even in sexual populations.^[62] ^[63] ^[64] The effect of linkage disequilibrium in slowing down the rate of adaptive evolution arises from a combination of the Hill–Robertson effect (delays in bringing beneficial mutations together) and background selection (delays in separating beneficial mutations from deleterious hitchhikers).

Linkage is a problem for population genetic models that treat one gene locus at a time. It can, however, be exploited as a method for detecting the action of natural selection via selective sweeps.

In the extreme case of an asexual population, linkage is complete, and population genetic equations can be derived and solved in terms of a travelling wave of genotype frequencies along a simple fitness landscape.^[65] Most microbes, such as bacteria, are asexual. The population genetics of their adaptation have two contrasting regimes. When the product of the beneficial mutation rate and population size is small, asexual populations follow a "successional regime" of origin-fixation dynamics, with adaptation rate strongly dependent on this product. When the product is much larger, asexual populations follow a "concurrent mutations" regime with adaptation rate less dependent on the product, characterized by clonal interference and the appearance of a new beneficial mutation before the last one has fixed.

Applications

Explaining levels of genetic variation

Neutral theory predicts that the level of nucleotide diversity in a population will be proportional to the product of the population size and the neutral mutation rate. The fact that levels of genetic diversity vary much less than population sizes do is known as the "paradox of variation".^[66] While high levels of genetic diversity were one of the original arguments in favor of neutral theory, the paradox of variation has been one of the strongest arguments against neutral theory.

It is clear that levels of genetic diversity vary greatly within a species as a function of local recombination rate, due to both genetic hitchhiking and background selection. Most current solutions to the paradox of variation invoke some level of selection at linked sites.^[67] For example, one analysis suggests that larger populations have more selective sweeps, which remove more neutral genetic diversity.^[68] A negative correlation between mutation rate and population size may also contribute.^[69]

Life history affects genetic diversity more than population history does, e.g. r-strategists have more genetic diversity.

Detecting selection

Population genetics models are used to infer which genes are undergoing selection. One common approach is to look for regions of high linkage disequilibrium and low genetic variance along the chromosome, to detect recent selective sweeps.

A second common approach is the McDonald–Kreitman test which compares the amount of variation within a species (polymorphism) to the divergence between species (substitutions) at two types of sites; one assumed to be neutral. Typically, synonymous sites are assumed to be neutral.^[70] Genes undergoing positive selection have an excess of divergent sites relative to polymorphic sites. The test can also be used to obtain a genome-wide estimate of the proportion of substitutions that are fixed by positive selection, α.^[71] ^[72] According to the neutral theory of molecular evolution, this number should be near zero. High numbers have therefore been interpreted as a genome-wide falsification of neutral theory.^[73]

Demographic inference

The simplest test for population structure in a sexually reproducing, diploid species, is to see whether genotype frequencies follow Hardy-Weinberg proportions as a function of allele frequencies. For example, in the simplest case of a single locus with two alleles denoted A and a at frequencies p and q, random mating predicts freq(AA) = p² for the AA homozygotes, freq(aa) = q² for the aa homozygotes, and freq(Aa) = 2pq for the heterozygotes. In the absence of population structure, Hardy-Weinberg proportions are reached within 1–2 generations of random mating. More typically, there is an excess of homozygotes, indicative of population structure. The extent of this excess can be quantified as the inbreeding coefficient, F.

Individuals can be clustered into K subpopulations.^[74] ^[75] The degree of population structure can then be calculated using F_ST, which is a measure of the proportion of genetic variance that can be explained by population structure. Genetic population structure can then be related to geographic structure, and genetic admixture can be detected.

Coalescent theory relates genetic diversity in a sample to demographic history of the population from which it was taken. It normally assumes neutrality, and so sequences from more neutrally evolving portions of genomes are therefore selected for such analyses. It can be used to infer the relationships between species (phylogenetics), as well as the population structure, demographic history (e.g. population bottlenecks, population growth), biological dispersal, source–sink dynamics^[76] and introgression within a species.

Another approach to demographic inference relies on the allele frequency spectrum.^[77]

Evolution of genetic systems

By assuming that there are loci that control the genetic system itself, population genetic models are created to describe the evolution of dominance and other forms of robustness, the evolution of sexual reproduction and recombination rates, the evolution of mutation rates, the evolution of evolutionary capacitors, the evolution of costly signalling traits, the evolution of ageing, and the evolution of co-operation. For example, most mutations are deleterious, so the optimal mutation rate for a species may be a trade-off between the damage from a high deleterious mutation rate and the metabolic costs of maintaining systems to reduce the mutation rate, such as DNA repair enzymes.^[78]

One important aspect of such models is that selection is only strong enough to purge deleterious mutations and hence overpower mutational bias towards degradation if the selection coefficient s is greater than the inverse of the effective population size. This is known as the drift barrier and is related to the nearly neutral theory of molecular evolution. Drift barrier theory predicts that species with large effective population sizes will have highly streamlined, efficient genetic systems, while those with small population sizes will have bloated and complex genomes containing for example introns and transposable elements.^[79] However, somewhat paradoxically, species with large population sizes might be so tolerant to the consequences of certain types of errors that they evolve higher error rates, e.g. in transcription and translation, than small populations.^[80]

External links

Population Genetics Tutorials (archived 23 January 2015)
Molecular population genetics
The ALlele FREquency Database at Yale University
EHSTRAFD.org – Earth Human STR Allele Frequencies Database (archived 13 July 2009)
History of population genetics
How Selection Changes the Genetic Composition of Population, video of lecture by Stephen C. Stearns (Yale University)
National Geographic: Atlas of the Human Journey (Haplogroup-based human migration maps)

Notes and References

Web site: Population genetics - Latest research and news . www.nature.com . 2018-01-29.
Servedio . Maria R. . Maria Servedio . Brandvain . Yaniv . Dhole . Sumit . Fitzpatrick . Courtney L. . Goldberg . Emma E. . Stern . Caitlin A. . Van Cleve . Jeremy . Yeh . D. Justin . Not Just a Theory—The Utility of Mathematical Models in Evolutionary Biology . PLOS Biology . 9 December 2014 . 12 . 12 . e1002017 . 10.1371/journal.pbio.1002017 . 25489940 . 4260780 . free .
Book: Ewens, W.J. . 2004 . Mathematical Population Genetics . Springer . New York . 978-0-387-20191-7 . 2nd .
Book: Bowler, Peter J. . Peter J. Bowler . Evolution : the history of an idea . 2003 . University of California Press . Berkeley . 978-0-520-23693-6 . 3rd . 325–339 .
Book: Larson, Edward J. . Evolution : the remarkable history of a scientific theory . 2004 . Modern Library . New York . 978-0-679-64288-6 . Modern Library . 221–243 .
Book: Beatty, John . Integrating Scientific Disciplines . 2 . Springer Netherlands . 9789024733422 . 125–135 . The Synthesis and the Synthetic Theory. 10.1007/978-94-010-9435-1_7 . Science and Philosophy . 1986 .
Book: Mayr . Ernst . Ernst Mayr . William B. . Provine . The Evolutionary synthesis : perspectives on the unification of biology . 1998 . Harvard University Press . Cambridge, Massachusetts . 9780674272262 . 295–298 . [New ed]..
Book: Provine, W. B. . 1988 . Evolutionary progress . Progress in evolution and meaning in life . 49–79 . University of Chicago Press.
Provine . William B. . 1978 . The role of mathematical population geneticists in the evolutionary synthesis of the 1930s and 1940s . Studies of the History of Biology . 2 . 167–192. 11610409 .
Book: Ford, E. B. . E.B. Ford . 1964 . 4th . 1975 . Ecological genetics . Chapman and Hall . London . 1ff.
Book: Mayr, Ernst . Ernst Mayr . 1988 . Toward a New Philosophy of Biology: Observations of an Evolutionist . Cambridge, Massachusetts . . 402 . 978-0-674-89665-9 . Toward a New Philosophy of Biology .
Book: Mayr . Ernst . Ernst Mayr . Provine . William B. . The Evolutionary Synthesis : perspectives on the unification of biology . 1998 . Harvard University Press . Cambridge, Massachusetts . 9780674272262 . 338–341 . [New ed]..
McCandlish . David M. . Stoltzfus . Arlin . Modeling Evolution Using the Probability of Fixation: History and Implications . The Quarterly Review of Biology . September 2014 . 89 . 3 . 225–252 . 10.1086/677571. 25195318 . 19619966 .
Crow . James F. . Wright and Fisher on Inbreeding and Random Drift . Genetics . 184 . 3 . 2010 . 609–611 . 0016-6731 . 10.1534/genetics.109.110023. 20332416 . 2845331 . free .
Casillas . Sònia . Barbadilla . Antonio . Molecular Population Genetics . Genetics . 2017 . 205 . 3 . 1003–1035 . 10.1534/genetics.116.196493. 28270526 . 5340319 .
Haldane . J. B. S. . J. B. S. Haldane . A Mathematical Theory of Natural and Artificial Selection, Part V: Selection and Mutation . Mathematical Proceedings of the Cambridge Philosophical Society . 1927 . 23 . 838–844 . 1927PCPS...23..838H . 10.1017/S0305004100015644 . 7. 86716613 .
Orr . H. A. . The population genetics of beneficial mutations . 10.1098/rstb.2009.0282 . Philosophical Transactions of the Royal Society B: Biological Sciences . 365 . 1544 . 1195–1201 . 2010 . 20308094. 2871816.
Hermisson . J. . Pennings . P. S. . Soft sweeps: molecular population genetics of adaptation from standing genetic variation . Genetics . 2005 . 169 . 2335–2352 . 10.1534/genetics.104.036947 . 15716498 . 4 . 1449620 .
Book: Gillespie, John . John H. Gillespie . Population Genetics: A Concise Guide . 2nd . Johns Hopkins University Press . 2004 . 978-0-8018-8008-7.
Miko . I. . 2008 . Epistasis: Gene interaction and phenotype effects . Nature Education . 1 . 1 . 197 .
Berger . D. . Postma . E. . Biased Estimates of Diminishing-Returns Epistasis? Empirical Evidence Revisited . Genetics . 13 October 2014 . 198 . 4 . 1417–1420 . 10.1534/genetics.114.169870 . 25313131 . 4256761.
Kouyos . Roger D. . Silander . Olin K. . Bonhoeffer . Sebastian . Epistasis between deleterious mutations and the evolution of recombination . Trends in Ecology & Evolution . June 2007 . 22 . 6 . 308–315 . 10.1016/j.tree.2007.02.014. 17337087 . 2007TEcoE..22..308K .
Crow . J. F. . The high spontaneous mutation rate: is it a health risk? . Proceedings of the National Academy of Sciences of the United States of America . 5 August 1997 . 94 . 16 . 8380–8386 . 9237985 . 10.1073/pnas.94.16.8380 . 33757. 1997PNAS...94.8380C . free .
10.1038/nrg2593 . 19597530 . 10 . 8 . 551–564 . Hastings . P. J. . Mechanisms of change in gene copy number . Nature Reviews Genetics . 2009 . Lupski . J. R. . Rosenberg . S. M. . Ira . G. . 2864001 .
Long . M. . Betrán . E. . Thornton . K. . Wang . W. . The origin of new genes: glimpses from the young and old . Nat. Rev. Genet. . 4 . 11 . 865–75 . November 2003 . 14634634 . 10.1038/nrg1204 . 33999892 .
Liu . N. . Okamura . K. . Tyler . D. M. . Phillips . Chung . Lai . The evolution and functional diversification of animal microRNA genes . Cell Research . 2008 . 18 . 985–996 . 10.1038/cr.2008.278 . 18711447 . 10 . 2712117 .
McLysaght . Aoife . Hurst . Laurence D. . Open questions in the study of de novo genes: what, how and why . Nature Reviews Genetics . 25 July 2016 . 17 . 9 . 567–578 . 10.1038/nrg.2016.78 . 27452112. 6033249 .
Sawyer . S. A. . Parsch . J. . Zhang . Z.. Hartl . D. L. . Prevalence of positive selection among nearly neutral amino acid replacements in Drosophila . Proceedings of the National Academy of Sciences . 104 . 16 . 2007 . 6504–6510 . 0027-8424 . 10.1073/pnas.0701572104. 17409186 . 1871816 . 2007PNAS..104.6504S . free .
Book: James F.. Crow. Motoo. Kimura. An Introduction to Population Genetics Theory. 1970. Blackburn Press. New Jersey. 9781932846126. [Reprint].
Lynch. Michael. Evolution of the mutation rate. Trends in Genetics. August 2010. 26. 8. 345–352. 10.1016/j.tig.2010.05.003. 20594608. 2910838.
Book: J. B. S. Haldane . 1932 . The Causes of Evolution . Longmans, Green and Co., New York .
M. Kimura . 1980 . Average time until fixation of a mutant allele in a finite population under continued mutation pressure: Studies by analytical, numerical, and pseudo-sampling methods . Proc Natl Acad Sci U S A . 77 . 1 . 522–526 . 10.1073/pnas.77.1.522 . 16592764 . 348304 . 1980PNAS...77..522K . free .
Haldane . J. B. S. . J. B. S. Haldane . 1933 . The Part Played by Recurrent Mutation in Evolution . American Naturalist . 67 . 5–19 . 2457127 . 10.1086/280465 . 708. 84059440 .
10.1016/j.cub.2007.01.051 . 17306543 . 17 . 5 . 452–454 . Protas . Meredith . Regressive evolution in the Mexican cave tetra, Astyanax mexicanus . Current Biology . 2007 . Conrad . M. . Gross . J. B. . Tabin . C. . Borowsky . R . 2570642 . 2007CBio...17..452P .
H. Maughan, J. Masel, C. W. Birky, Jr. and W. L. Nicholson . 2007 . The roles of mutation accumulation and selection in loss of sporulation in experimental populations of Bacillus subtilis . Genetics . 177 . 2 . 937–48 . 10.1534/genetics.107.075663. 17720926 . 2034656 .
Masel . J. . Joanna Masel . King . O. D. . Maughan . H. . The loss of adaptive plasticity during long periods of environmental stasis . 10.1086/510212 . American Naturalist . 169 . 1 . 38–46 . 2007 . 17206583 . 1766558 .
K. Gomez, J. Bertram and J. Masel . 2020 . Mutation bias can shape adaptation in large asexual populations experiencing clonal interference . Proc. R. Soc. B . 287 . 1937 . 20201503 . 10.1098/rspb.2020.1503. 33081612 . 7661309 .
A. V. Cano, H. Rozhonova, A. Stoltzfus, D. M. McCandlish and J. L. Payne . 2022-02-10 . Mutation bias shapes the spectrum of adaptive substitutions . Proc Natl Acad Sci U S A . 119 . 7 . 10.1073/pnas.2119720119. free . 35145034 . 8851560 . 2022PNAS..11919720C .
Stoltzfus . A. . Yampolsky . L. Y. . 2009 . Climbing Mount Probable: Mutation as a Cause of Nonrandomness in Evolution . Journal of Heredity . 100 . 637–647 . 10.1093/jhered/esp048 . 19625453 . 5 . free .
21 . R837–R838 . Masel . J. . Joanna Masel . Genetic drift . Current Biology . 2011 . 10.1016/j.cub.2011.08.007 . 20 . 22032182. free . 2011CBio...21.R837M .
Book: Futuyma, Douglas . Evolutionary Biology . . 1998 . 978-0-87893-189-7 . Glossary.
Book: Avers, Charlotte . 1989 . Process and Pattern in Evolution . Oxford University Press .
188 . 783–785 . Wahl . L. M. . Fixation when N and s Vary: Classic Approaches Give Elegant New Results . Genetics . 2011 . 10.1534/genetics.111.131748 . 4 . 21828279 . 3176088.
Book: Barton . Nicholas H. . Briggs . Derek E. G. . Eisen . Jonathan A. . Goldstein . David B. . Patel . Nipam H. . Evolution . Cold Spring Harbor Laboratory Press . 2007 . 978-0-87969-684-9 . 417.
Book: Futuyma, Douglas . Evolutionary Biology . . 1998 . 978-0-87893-189-7 . 320.
Gillespie . J. H. . Genetic Drift in an Infinite Population: The Pseudohitchhiking Model . Genetics . 155 . 2 . 909–919 . 2000 . 10.1093/genetics/155.2.909 . 10835409 . 1461093.
Book: Provine, William B. . The "Random Genetic Drift" Fallacy . CreateSpace.
Neher . Richard A. . Shraiman . Boris I. . August 2011 . Genetic Draft and Quasi-Neutrality in Large Facultatively Sexual Populations . Genetics . 188 . 4 . 975–996 . 10.1534/genetics.111.128876 . 0016-6731 . 3176096 . 21625002. 1108.1635 .
Buston . P. M. . Pilkington . J. G. . 2007 . Are clownfish groups composed of close relatives? An analysis of microsatellite DNA vraiation in Amphiprion percula . Molecular Ecology . 12 . 733–742 . 12675828 . 3 . 10.1046/j.1365-294X.2003.01762.x . 35546810 . etal.
Repaci . V. . Stow . A. J. . Briscoe . D. A. . 2007 . Fine-scale genetic structure, co-founding and multiple mating in the Australian allodapine bee (Ramphocinclus brachyurus) . Journal of Zoology . 270 . 4 . 687–691 . 10.1111/j.1469-7998.2006.00191.x.
Su . H. . Qu . L.-J. . He . K. . Zhang . Z. . Wang . J. Chen . Z. . Gu . H. . The Great Wall of China: a physical barrier to gene flow? . Heredity . 90 . 3 . 2003 . 212–219 . 0018-067X . 10.1038/sj.hdy.6800237. 12634804 . 13367320 .
Gravel . S. . Population Genetics Models of Local Ancestry . 2012 . 2012arXiv1202.4811G . 1202 . 607–619 . 1202.4811 . 10.1534/genetics.112.139808 . 22491189 . 3374321 . Genetics . 2.
Morjan . C. . Rieseberg . L. . How species evolve collectively: implications of gene flow and selection for the spread of advantageous alleles . Molecular Ecology . 13 . 6 . 1341–56 . 2004 . 15140081 . 10.1111/j.1365-294X.2004.02164.x . 2600545 . 2004MolEc..13.1341M .
Bolnick . Daniel I. . Nosil . Patrik . Natural Selection in Populations Subject to a Migration Load . Evolution . September 2007 . 61 . 9 . 2229–2243 . 10.1111/j.1558-5646.2007.00179.x . 17767592 . 25685919 . free.
Boucher . Yan . Douady . Christophe J. . Papke . R. Thane . Walsh . David A. . Boudreau . Mary Ellen R. . Nesbø. Camilla L. . Case . Rebecca J. . Doolittle . W. Ford. Lateral Gene Transfer and the Origins of Prokaryotic Groups . Annual Review of Genetics . 37 . 1 . 2003 . 283–328 . 0066-4197 . 10.1146/annurev.genet.37.050503.084247. 14616063 .
Walsh . T. . Combinatorial genetic evolution of multiresistance . Current Opinion in Microbiology . 9 . 5 . 476–82 . 2006 . 16942901 . 10.1016/j.mib.2006.08.009.
Kondo . N. . Nikoh . N. . Ijichi . N. . Shimada . M. . Fukatsu . T. . Genome fragment of Wolbachia endosymbiont transferred to X chromosome of host insect . Proceedings of the National Academy of Sciences . 99 . 22 . 2002 . 14280–14285 . 0027-8424 . 10.1073/pnas.222228199. 12386340 . 137875 . 2002PNAS...9914280K . free .
Sprague . G. . Genetic exchange between kingdoms . Current Opinion in Genetics & Development . 1 . 4 . 530–533 . 1991 . 1822285 . 10.1016/S0959-437X(05)80203-5.
Gladyshev . E. A. . Meselson . M. . Arkhipova . I. R. . Massive Horizontal Gene Transfer in Bdelloid Rotifers . Science . 320 . 5880 . 2008 . 1210–1213 . 0036-8075 . 10.1126/science.1156407. 18511688 . 2008Sci...320.1210G . 11862013 .
Baldo . A. . McClure . M. . Evolution and horizontal transfer of dUTPase-encoding genes in viruses and their hosts . Journal of Virology . 73 . 9 . 7710–7721 . 1 September 1999 . 10438861 . 104298 . 10.1128/JVI.73.9.7710-7721.1999 .
Poole . A. . Penny . D. . Evaluating hypotheses for the origin of eukaryotes . BioEssays . 29 . 1 . 74–84 . 2007 . 17187354 . 10.1002/bies.20516.
Weissman . D. B. . Hallatschek . O. . The Rate of Adaptation in Large Sexual Populations with Linear Chromosomes . Genetics . 15 January 2014 . 196 . 4 . 1167–1183 . 10.1534/genetics.113.160705 . 24429280 . 3982688.
Weissman . Daniel B. . Barton . Nicholas H. . McVean . Gil . Limits to the Rate of Adaptive Substitution in Sexual Populations . PLOS Genetics . 7 June 2012 . 8 . 6 . e1002740 . 10.1371/journal.pgen.1002740 . 22685419 . 3369949 . free .
Neher . R. A. . Shraiman . B. I. . Fisher . D. S. . Rate of Adaptation in Large Sexual Populations . Genetics . 30 November 2009 . 184 . 2 . 467–481 . 10.1534/genetics.109.109009. 19948891 . 2828726 . 1108.3464 .
Desai . Michael M. . Fisher . Daniel S. . Beneficial Mutation Selection Balance and the Effect of Linkage on Positive Selection . Genetics . 176 . 3 . 1759–1798 . 2007 . 10.1534/genetics.106.067678 . 17483432 . 1931526.
Book: Lewontin, R. C. . The genetic basis of evolutionary change . 1973 . Columbia University Press . New York . 978-0231033923 . [4th printing.] .
Ellegren . Hans . Galtier . Nicolas . Determinants of genetic diversity . Nature Reviews Genetics . 6 June 2016 . 17 . 7 . 422–433 . 10.1038/nrg.2016.58 . 27265362 . 23531428 .
Corbett-Detig . Russell B. . Hartl . Daniel L. . Sackton . Timothy B. . Barton . Nick H. . Natural Selection Constrains Neutral Diversity across A Wide Range of Species . PLOS Biology . 10 April 2015 . 13 . 4 . e1002112 . 10.1371/journal.pbio.1002112. 25859758 . 4393120 . free .
Sung . W. . Ackerman . M. S. . Miller . S. F. . Doak . T. G. . Lynch . M. . Drift-barrier hypothesis and mutation-rate evolution . Proceedings of the National Academy of Sciences . 17 October 2012 . 109 . 45 . 18488–18492 . 10.1073/pnas.1216223109. 3494944 . 23077252 . 2012PNAS..10918488S . free .
Charlesworth . J. Eyre-Walker . 2008 . The McDonald–Kreitman Test and Slightly Deleterious Mutations . Molecular Biology and Evolution . 25 . 6 . 1007–1015 . 10.1093/molbev/msn005. 18195052 . free .
Eyre-Walker . A. . 2006 . The genomic rate of adaptive evolution . Trends in Ecology and Evolution . 21 . 10 . 569–575 . 10.1016/j.tree.2006.06.015. 16820244 . 2006TEcoE..21..569E .
Smith . N. G. C. . Eyre-Walker . A. . 10.1038/4151022a . Adaptive protein evolution in Drosophila . Nature . 415 . 6875 . 1022–1024 . 2002 . 11875568 . 2002Natur.415.1022S . 4426258 .
Hahn . M. W. . 2008 . Toward a selection theory of molecular evolution . Evolution . 255–265 . 62 . 2 . 10.1111/j.1558-5646.2007.00308.x . 18302709. 5986211 . free .
Pritchard . J. K. . Stephens . M. . Donnelly . P. . June 2000 . Inference of population structure using multilocus genotype data . Genetics . 155 . 2 . 945–959 . 10.1093/genetics/155.2.945 . 0016-6731 . 1461096 . 10835412.
Verity . Robert . Nichols . Richard A. . August 2016 . Estimating the Number of Subpopulations (K) in Structured Populations . Genetics . 203 . 4 . 1827–1839 . 10.1534/genetics.115.180992 . 0016-6731 . 4981280 . 27317680.
Manlik . Oliver . Chabanne . Delphine . Daniel . Claire . Bejder . Lars . Allen . Simon J. . Sherwin . William B. . Demography and genetics suggest reversal of dolphin source–sink dynamics, with implications for conservation. . Marine Mammal Science . 35 . 3 . 732–759 . 13 November 2018. 10.1111/mms.12555. 92108810 .
Gutenkunst . Ryan N. . Hernandez . Ryan D. . Williamson . Scott H. . Bustamante . Carlos D. . McVean . Gil . Inferring the Joint Demographic History of Multiple Populations from Multidimensional SNP Frequency Data . PLOS Genetics . 23 October 2009 . 5 . 10 . e1000695 . 10.1371/journal.pgen.1000695 . 19851460 . 2760211. 0909.0925 . free .
Sniegowski . P. . Gerrish P.; Johnson T;. Shaver A. . The evolution of mutation rates: separating causes from consequences . BioEssays . 22 . 12 . 1057–1066 . 2000 . 11084621 . 10.1002/1521-1878(200012)22:12<1057::AID-BIES3>3.0.CO;2-W. 36771934 .
Lynch . Michael . Conery . John S. . 2003 . The origins of genome complexity . . 302 . 1401–1404 . 14631042 . 10.1126/science.1089370 . 5649 . 2003Sci...302.1401L . 10.1.1.135.974 . 11246091 .
Rajon . E. . Masel . J. . Joanna Masel . Evolution of molecular error rates and the consequences for evolvability . Proceedings of the National Academy of Sciences . 3 January 2011 . 108 . 3 . 1082–1087 . 10.1073/pnas.1012918108 . 21199946 . 3024668. 2011PNAS..108.1082R . free .