Genetic map function explained

In genetics, mapping functions are used to model the relationship between map distance (measured in map units or centimorgans) between markers and recombination frequency between markers. One utility of this is that it allows values to be obtained for genetic distances, which is typically not estimable, from recombination fractions, which typically are.[1]

The simplest mapping function is the Morgan Mapping Function, eponymously devised by Thomas Hunt Morgan. Other well-known mapping functions include the Haldane Mapping Function introduced by J. B. S. Haldane in 1919,[2] and the Kosambi Mapping Function introduced by Damodar Dharmananda Kosambi in 1944.[3] [4] Few mapping functions are used in practice other than Haldane and Kosambi. The main difference between them is in how crossover interference is incorporated.[5]

Morgan Mapping Function

Where d is the distance in map units, the Morgan Mapping Function states that the recombination frequency r can be expressed as

r=d

. This assumes that one crossover occurs, at most, in an interval between two loci, and that the probability of the occurrence of this crossover is proportional to the map length of the interval.

Where d is the distance in map units, the recombination frequency r can be expressed as:

r=

1
2

[1-(1-2d)]=d

The equation only holds when

1
2

\geqd\geq0

as, otherwise, recombination frequency would exceed 50%. Therefore, the function cannot approximate recombination frequencies beyond short distances.

Haldane Mapping Function

Overview

Two properties of the Haldane Mapping Function is that it limits recombination frequency up to, but not beyond 50%, and that it represents a linear relationship between the frequency of recombination and map distance up to recombination frequencies of 10%.[6] It also assumes that crossovers occur at random positions and that they do so independent of one another. This assumption therefore also assumes no crossover interference takes place;[7] but using this assumption allows Haldane to model the mapping function using a Poisson distribution.

Definitions

Formula

r=

1
2

(1-e-2d)

Inverse

d=-

1
2

ln(1-2r)

Kosambi Mapping Function

Overview

The Kosambi mapping function was introduced to account for the impact played by crossover interference on recombination frequency. It introduces a parameter C, representing the coefficient of coincidence, and sets it equal to 2r. For loci which are strongly linked, interference is strong; otherwise, interference decreases towards zero. Interference declines according to the linear function i = 1 - 2r.[8]

Formula

r=

1
2

\tanh(2d)=

1
2
e4d-1
e4d+1

Inverse

d=

1
2

\tanh-1(2r)=

1ln(
4
1+2r
1-2r

)

Comparison and application

Below 10% recombination frequency, there is little mathematical difference between different mapping functions and the relationship between map distance and recombination frequency is linear (that is, 1 map unit = 1% recombination frequency). When genome-wide SNP sampling and mapping data is present, the difference between the functions is negligible outside of regions of high recombination, such as recombination hotspots or ends of chromosomes.

While many mapping functions now exist,[9] [10] [11] in practice functions other than Haldane and Kosambi are rarely used. More specifically, the Haldane function is preferred when distance between markers is relatively small, whereas the Kosambi function is preferred when distances between markers is larger and crossovers need to be accounted for.[12]

Further reading

Notes and References

  1. Book: Broman . Karl W. . A guide to QTL mapping with R/qtl . Sen . Saunak . 2009 . Springer . 978-0-387-92124-2 . Statistics for biology and health . Dordrecht . 14 . 669122118.
  2. Haldane . J.B.S. . 1919 . The combination of linkage values, and the calculation of distances between the loci of linked factors . Journal of Genetics . 8 . 29 . 299–309.
  3. Kosambi . D. D. . 1943 . The Estimation of Map Distances from Recombination Values . Annals of Eugenics . en . 12 . 1 . 172–175 . 10.1111/j.1469-1809.1943.tb02321.x . 2050-1420.
  4. Book: Wu . Rongling . Statistical genetics of quantitative traits: linkage, maps, and QTL . Ma . Chang-Xing . Casella . George . 2007 . Springer . 978-0-387-20334-8 . New York . 65 . 141385359.
  5. Peñalba . Joshua V. . Wolf . Jochen B. W. . 2020 . From molecules to populations: appreciating and estimating recombination rate variation . Nature Reviews Genetics . en . 21 . 8 . 476–492 . 10.1038/s41576-020-0240-1 . 1471-0064.
  6. Web site: mapping function . 2024-04-29 . Oxford Reference . en . 10.1093/oi/authority.20110803100132641?rskey=srzx3w&result=6. 2024-04-30 .
  7. Book: Mammalian genomics . 2005 . CABI Pub . 978-0-85199-910-4 . Ruvinsky . Anatoly . Wallingford, Oxfordshire, UK ; Cambridge, MA, USA . 15 . Graves . Jennifer A. Marshall.
  8. Book: Hartl . Daniel L. . Genetics: analysis of genes and genomes . Jones . Elizabeth W. . 2005 . Jones and Bartlett . 978-0-7637-1511-3 . 7th . Sudbury, Mass. . 168.
  9. Crow . J F . 1990 . Mapping functions. . Genetics . en . 125 . 4 . 669–671 . 10.1093/genetics/125.4.669 . 1943-2631 . 1204092 . 2204577.
  10. Felsenstein . Joseph . 1979 . A Mathematically Tractable Family of Genetic Mapping Functions with Different Amounts of Interference . Genetics . en . 91 . 4 . 769–775 . 10.1093/genetics/91.4.769 . 1216865 . 17248911.
  11. Pascoe . L. . Morton . N.E. . 1987 . The use of map functions in multipoint mapping . American Journal of Human Genetics . 40 . 2 . 174–183. 3565379 . 1684067 .
  12. Book: Handbook of computational molecular biology . 2006 . CRC Press . 978-1-58488-406-4 . Aluru . Srinivas . 17-10–17-11 .