Effective population size explained

The effective population size (N_e) is the size of an idealised population that would experience the same rate of genetic drift as the real population. The effective population size is normally smaller than the census population size N, partly because chance events prevent some individuals from breeding, and partly due to background selection and genetic hitchhiking. Idealised populations are based on unrealistic but convenient assumptions including random mating, rarity of natural selection such that each gene evolves independently, and constant population size.^[1]

\mu

, because in such an idealised population, the heterozygosity is equal to

4N\mu

. In a population with selection at many loci and abundant linkage disequilibrium, the coalescent effective population size may not reflect the census population size at all, or may reflect its logarithm.

The concept of effective population size was introduced in the field of population genetics in 1931 by the American geneticist Sewall Wright.^[2] ^[3] Some versions of the effective population size are used in wildlife conservation.

Overview: Types of effective population size

Depending on the quantity of interest, effective population size can be defined in several ways. Ronald Fisher and Sewall Wright originally defined it as "the number of breeding individuals in an idealised population that would show the same amount of dispersion of allele frequencies under random genetic drift or the same amount of inbreeding as the population under consideration". More generally, an effective population size may be defined as the number of individuals in an idealised population that has a value of any given population genetic quantity that is equal to the value of that quantity in the population of interest. The two population genetic quantities identified by Wright were the one-generation increase in variance across replicate populations (variance effective population size) and the one-generation change in the inbreeding coefficient (inbreeding effective population size). These two are closely linked, and derived from F-statistics, but they are not identical.^[4]

Today, the effective population size is usually estimated empirically with respect to the sojourn or coalescence time, estimated as the within-species genetic diversity divided by the mutation rate, yielding a coalescent effective population size.^[5] Another important effective population size is the selection effective population size 1/s_critical, where s_critical is the critical value of the selection coefficient at which selection becomes more important than genetic drift.^[6]

Empirical measurements

In Drosophila populations of census size 16, the variance effective population size has been measured as equal to 11.5.^[7] This measurement was achieved through studying changes in the frequency of a neutral allele from one generation to another in over 100 replicate populations.

For coalescent effective population sizes, a survey of publications on 102 mostly wildlife animal and plant species yielded 192 N_e/N ratios. Seven different estimation methods were used in the surveyed studies. Accordingly, the ratios ranged widely from 10^-6 for Pacific oysters to 0.994 for humans, with an average of 0.34 across the examined species. Based on these data they subsequently estimated more comprehensive ratios, accounting for fluctuations in population size, variance in family size and unequal sex-ratio. These ratios average to only 0.10-0.11.^[8]

A genealogical analysis of human hunter-gatherers (Eskimos) determined the effective-to-census population size ratio for haploid (mitochondrial DNA, Y chromosomal DNA), and diploid (autosomal DNA) loci separately: the ratio of the effective to the census population size was estimated as 0.6–0.7 for autosomal and X-chromosomal DNA, 0.7–0.9 for mitochondrial DNA and 0.5 for Y-chromosomal DNA.^[9]

Variance effective size

In the Wright-Fisher idealized population model, the conditional variance of the allele frequency

, given the allele frequency

in the previous generation, is

\operatorname{var}(p'\midp)={p(1-p)\over2N}.

Let

\widehat{\operatorname{var}}(p'\midp)

denote the same, typically larger, variance in the actual population under consideration. The variance effective population size

	(v)
N
	e

is defined as the size of an idealized population with the same variance. This is found by substituting

\widehat{\operatorname{var}}(p'\midp)

for

\operatorname{var}(p'\midp)

and solving for

which gives

	(v)
N
	e

={p(1-p)\over2\widehat{\operatorname{var}}(p)}.

Theoretical examples

In the following examples, one or more of the assumptions of a strictly idealised population are relaxed, while other assumptions are retained. The variance effective population size of the more relaxed population model is then calculated with respect to the strict model.

Variations in population size

Population size varies over time. Suppose there are t non-overlapping generations, then effective population size is given by the harmonic mean of the population sizes:^[10]

{1\overN_e}={1\overt}

	t
\sum
	i=1

{1\overN_i}

For example, say the population size was N = 10, 100, 50, 80, 20, 500 for six generations (t = 6). Then the effective population size is the harmonic mean of these, giving:

{1\overN_e}

={\begin{matrix}

	1
	10