Genetic studies on Gujarati people explained

The study of the genetics and archaeogenetics of the Gujarati people of India aims at uncovering these people's genetic history. According to the 1000 Genomes Project, "Gujarati" is a general term used to describe people who trace their ancestry to the region of Gujarat, located in the northwestern part of the Indian subcontinent, and who speak the Gujarati language, an Indo-European language.[1] They have some genetic commonalities as well as differences with other ethnic groups of India.

Autosomal DNA

DNA studies

Our analysis of haplotype similarity at the SLC24A5 region [which affects skin pigmentation] between the Gujarati Indians also indicated greater degree of sharing with the southern Europeans than with the south Tamil Indians.

One of the GIH subgroups fall outside the main gradient of Indian groups, suggesting that they harbor substantial ancestry that is not a simple mixture of ASI and ANI. A speculative hypothesisis that some Gujarati groups descend from the founders of the “Gurjara Pratihara” empire, which is thought to have been founded by Central Asian invaders in the 7th century A.D. and to have ruled parts of northwest India from the 7-12th centuries. I. Karve noted that endogamous groups with names like “Gurjar” are now distributed throughout the northwest of the subcontinent, and hypothesized that that they likely trace their names to this invading group.

Autosomal DNA components

The 1000 Genomes Project collected 117 samples of unrelated Gujarati people living in Houston, Texas (abbreviated GIH). These samples were analyzed in a 2016 study by South African and Indian geneticists (Sengupta et al., 2016), who divided them into two subgroups, and used admixture to estimate the proportion of inferred ancestral component for each subgroup. Based on genetic and geographical affinities, Gujaratis and Gujarati Brahmins were placed in the Northwest Indian subcontinent in supplementary table 1 of this study.[12]

Populations Ancestral North Indian (ANI) Ancestral South Indian (ASI) Ancestral Austroasiatic (AAA) Ancestral Tibeto-Burman (ATB)Andaman archipelago (A&N)
Gujaratis (1)76.2% 9.4% 13.8% 0.3% 0.2%
Gujaratis (2)80.7% 7.1% 8.5% 3.1% 0.5%

A separate study by geneticists from West Bengal (Basu et al., 2016) estimated the following ancestral proportions for Gujarati Brahmins, and classified them in northwest India:[13]

Most South Asians carry both the Ancestral North Indian (ANI) component, which is closely related to those in Central Asia, West Asia and Europe, and the Ancestral South Indian (ASI) component, which is restricted to South Asia.[14]

mtDNA

Haplogroup R

Sub-haplogroup U

Over 33% of all mitochondrial genetic markers of the population of Gujarat originate from West Eurasia. The mitochondrial DNA (mtDNA) sub-haplogroup U7 is common in Gujarat. It is found in over 12% of the population, higher than in Punjab (9%), Pakistan, Iran, Afghanistan, or anywhere else. U7 is only found in frequencies between 0% and 0.9% in other populations of India. The study by Quintana-Murci et al. (2004) of 34 Gujaratis found the presence of U2a and U7 sub-haplogroups (8.8%) followed by U2b (5.9%) and U2c (2.9%).[15] [16]

Other sub-haplogroups of R

The study by Quintana-Murci et al. (2004) of Gujaratis found the presence of R* (8.8%), H (5.9%), J1(2.9%).

Other minor haplogroups of N

The study by Quintana-Murci et al. (2004) found presence of haplogroup W (8.8%) in Gujaratis as well as other northwestern states like Punjab and Kashmir. Haplogroup W is descended from the haplogroup N2. They also found N* (2.9%) in Gujaratis.[17]

The mtDNA haplogroup HV is found in at least one Patel Gujarati family which is also found in some Tamil Brahmins.

Y chromosome

The Y chromosome DNA (Y-DNA) of some Gujarati males has haplogroup R1a, specifically R-L657 (the L657 subclade). Other has R-Y874 (the Y874 subclade of R1a1a1) which also found in Telugu people. Other common finding is haplogroup J2b2 (J-M241). The SNP mutation referred to as Y951 is also found in Gujarati people as well as Punjabis. The haplogroup C-M356 (C1b, previously called C5a, within which C-P92 is a subclade defined by the P92 mutation) is also found among some Gujarati people.[17] [18]

A set of 48 bi-allelic markers on the non-recombining region of Y chromosome (NRY) were analysed in 284 males; representing nine Indo-European speaking tribal populations of South Gujarat. The genetic structure of the populations revealed that none of these groups was overtly admixed or completely isolated. However, elevated haplogroup diversity and FST value point towards greater diversity and differentiation which suggests the possibility of early demographic expansion of the study groups. The phylogenetic analysis revealed 13 paternal lineages, of which six haplogroups: C5, H1a*, H2, J2, R1a1* and R2 accounted for a major portion of the Y chromosome diversity. The higher frequency of the six haplogroups and the pattern of clustering in the populations indicated overlapping of haplogroups with West and Central Asian populations. Other analyses undertaken on the population affiliations revealed that the Indo-European speaking populations along with the Dravidian speaking groups of southern India have an influence on the tribal groups of Gujarat. The vital role of geography in determining the distribution of Y lineages was also noticed. This implies that although language plays a vital role in determining the distribution of Y lineages, the present day linguistic affiliation of any population in India for reconstructing the demographic history of the country should be considered with caution.[19]

R1a origins in Gujarat

Following the 2010 discovery that the oldest strain of the R1a1a branch was concentrated in the Gujarat-Sindh-Western Rajasthan region, it has been suggested that this location may have been the origin of this genetic group. The genetic similarities between North Indians and Eastern Europeans are possibly due to having a common ancestor. R1b, the most common haplogroup in Western Europe, has a relatively low concentration in India, and it possibly originated in the Persian Gulf. It is possible that these haplogroups originated from two major genetic dispersals from the Persian Gulf-Makran-Gujarat region.[20]

Genetics and archaeogenetics of Gujarati groups

Genetics

Inverted Y chromosome polymorphism in Gujarati Muslims of South AfricaThe pericentric inversion of the Y-chromosome (inv(Y)) is a rare chromosomal heteromorphism which is hereditary. It has no clinical significance or reproductive disadvantage.[21] It is studied in Gujarati Muslims of South Africa, 8 inv(Y) men opposed to 9 normal men. The p49a/TaqI and p49a/PvuII haplotypes were determined and found that 8 inv(Y) possessed identical TaqI and PvuII where as 7 different TaqI and 8 different PvuII haplotypes observed in the 9 normal men. So It is concluded that inv(Y) has common genetic origin in Gujarati Muslims of South Africa. The origin is traced to Kholvad, a small village near Surat, and some neighbouring villages. It is probably originated through random genetic drift in reproductively isolated community, maintained by strict endogamy based on religious and linguistic affiliations.[22] [23]
Vitiligo in GujaratisVitiligo is a long term skin condition characterized by patches of the skin losing their pigment. It affects 1—2% of the population of world while 0.5—2.5% population of India. But Gujarat and Rajasthan has the highest prevalence (~8.8%).[24] Rasheedunnisa Begum et al. conducted genetic studies of over 1500 patients from Gujarat and found an SNP variation in the autosomal DNA of Gujaratis which make them more prone to Vitiligo.[25] [26]
Deletion β° thalassaemia

A study suggested the Indian deletion β° type (600 bp deletion involving the 3’ end) thalassaemia has single origin as all 23 patients of it in study were from either Sindh or Gujarat who had identical haplotypes.[27]

Meckel syndromeThe Leicestershire Perinatal Mortality Survey for the years 1976 to 1982 found high incidences of Meckel syndrome in Gujarati immigrants.[28]

Archeogenetics

Impact of cultural marriage practices on genetic variationIn India, caste is a form of social stratification characterized by endogamy. Within caste, there are endogamous groups known as gol (marriage circle) by Gujarati people. In it, there are small number of exogamous lineages known as gotra. A person can not marry within gotra as well as outside gol. Gujarati Patels practice this "exogamic endogamy" which is also found elsewhere in India. One such gol known as Chha Gaam Patels (CGP) include people from six villages of Charotar region of Gujarat. A study found their genetic similarities as well as confirms their patrilocal and patrilineal practices within the group. It also confirms low-level of female gene inflow within the group. The study illustrates impact of culture marriage practice on genetic variation in Indian population.[29]
ParsisThe Parsis have sharp contrast between genetic data obtained from mitochondrial DNA (mtDNA), a maternal component, and Y-chromosome DNA (Y-DNA), a paternal component. According to Y-DNA, they resemble the Iranian population, which supports their historical origin from Iran. But about 60% of their maternal gene pool originates from South Asian haplogroups, which is just 7% in Iranians. Parsis have high frequency of haplogroup M of mtDNA (55%), similar to Indians, which is just 1.7% in combined Iranian sample. Due to high diversity in Y-DNA and mtDNA lineages, the strong drift effect is unlikely even though they had small population. The studies suggest a male-mediated migration of Parsi ancestors from Iran to Gujarat where they admixed with local female population which ultimately resulting in loss of Iranian mtDNA.[30]
Dawoodi BohrasDawoodi Bohras of Gujarat show 47% genetic contribution from West Asia, especially Iran; followed by 30% from Arabia and from 23% closest Hindu parental populations. This shows considerable genetic flow from West Asia in them.[31]
Gujarati tribalFour tribal populations (Chaudhari, Vasava, Kotwalia and Gamit) of the Surat district in South Gujarat were studied for the distribution of 22 polymorphic systems of the blood. The studies suggested that they are considerably hetero-genetic and have small genetic difference among them which is due to genetic variation. It also showed that Vasava and Kotwalia are closely related genetically, and Chaudhari and Gamit are different from them.[32]
Genetic distance(Fst) between Gujaratis and other South AsiansAccording to a 2016 study on "Population Stratification and Underrepresentation of Indian Subcontinent Genetic Diversity in the 1000 Genomes Project Dataset", It shows us that, Gujaratis (GIH Samples in 1000 genomes project) are closer to PJL (Punjabis from Lahore) samples (Fst=0.0036) and ITU (Telugu) samples (Fst=0.0039) than they are to STU (Sri Lankan Tamil) samples (Fst=0.0044) and BEB (Bengali) samples (Fst=0.0045).[12]

See also

Notes and References

  1. Web site: Gujarati Indians in Houston, Texas, USA [GIH]]. Coriell Institute for Medical Research.
  2. Silva. Marina. Oliveira. Marisa. Vieira. Daniel. Brandão. Andreia. Rito. Teresa. Pereira. Joana B.. Fraser. Ross M.. Hudson. Bob. Gandini. Francesca. Edwards. Ceiridwen. Pala. Maria. Koch. John. Wilson. James F.. Pereira. Luísa. Richards. Martin B.. Soares. Pedro. A genetic chronology for the Indian Subcontinent points to heavily sex-biased dispersals. BMC Evolutionary Biology. 17. 1. 88. 2017. 1471-2148. 10.1186/s12862-017-0936-9. 28335724. 5364613 . free .
  3. Pandit. Lekha. D'Cunha. Anitha. Malli. Chaithra. Genetic variations in the Dravidian population of South West coast of India: Implications in designing case-control studies. Indian Journal of Medical Research. 145. 6. 2017. 753–757. 0971-5916. 10.4103/ijmr.IJMR_1435_15. 29067977. 5674545 . free .
  4. Ayub. Qasim. Mezzavilla. Massimo. Pagani. Luca. Haber. Marc. Mohyuddin. Aisha. Khaliq. Shagufta. Mehdi. Syed Qasim. Tyler-Smith. Chris. The Kalash Genetic Isolate: Ancient Divergence, Drift, and Selection. The American Journal of Human Genetics. 96. 5. 2015. 775–783. 0002-9297. 10.1016/j.ajhg.2015.03.012. 25937445. 4570283.
  5. Ali. Mohammad. Liu. Xuanyao. Pillai. Esakimuthu. Chen. Peng. Khor. Chiea-Chuen. Ong. Rick Twee-Hee. Teo. Yik-Ying. Characterizing the genetic differences between two distinct migrant groups from Indo-European and Dravidian speaking populations in India. BMC Genetics. 15. 1. 2014. 86. 1471-2156. 10.1186/1471-2156-15-86. 25053360. 4120727 . free .
  6. Pemberton . Trevor J. . DeGiorgio . Michael . Rosenberg . Noah A. . Population Structure in a Comprehensive Genomic Data Set on Human Microsatellite Variation . G3: Genes, Genomes, Genetics. 3 . 5 . 2013 . 891–907 . 2160-1836 . 10.1534/g3.113.005728 . 23550135 . 3656735 .
  7. Ellegren . Hans . Magalhães . Tiago R. . Casey . Jillian P. . Conroy . Judith . Regan . Regina . Fitzpatrick . Darren J. . Shah . Naisha . Sobral . João . Ennis . Sean . HGDP and HapMap Analysis by Ancestry Mapper Reveals Local and Global Population Relationships . PLOS ONE . 7 . 11 . 2012 . e49438 . 1932-6203 . 10.1371/journal.pone.0049438. 23189146 . 3506643 . 2012PLoSO...749438M . free .
  8. Metspalu. Mait. Romero. Irene Gallego. Yunusbayev. Bayazit. Chaubey. Gyaneshwer. Mallick. Chandana Basu. Hudjashov. Georgi. Nelis. Mari. Mägi. Reedik. Metspalu. Ene. Remm. Maido. Pitchappan. Ramasamy. Singh. Lalji. Thangaraj. Kumarasamy. Villems. Richard. Kivisild. Toomas . Shared and Unique Components of Human Population Structure and Genome-Wide Signals of Positive Selection in South Asia . The American Journal of Human Genetics . 89 . 6 . 2011 . 731–744 . 0002-9297 . 10.1016/j.ajhg.2011.11.010. 22152676. 3234374.
  9. Xing . Jinchuan . Watkins . W Scott . Hu . Ya . Huff . Chad D . Sabo . Aniko . Muzny . Donna M . Bamshad . Michael J. Gibbs . Richard A . Jorde . Lynn B . Yu . Fuli . Genetic diversity in India and the inference of Eurasian population expansion . Genome Biology . 11 . 11 . 2010 . R113 . 1465-6906 . 10.1186/gb-2010-11-11-r113. 21106085 . 3156952 . free .
  10. Paschou. P.. Lewis. J.. Javed. A.. Drineas. P.. Ancestry informative markers for fine-scale individual assignment to worldwide populations. Journal of Medical Genetics. 47. 12. 2010. 835–847. 0022-2593. 10.1136/jmg.2010.078212. 20921023. 10.1.1.846.8202. 6432430.
  11. Reich. David. Thangaraj. Kumarasamy. Patterson. Nick. Price. Alkes L.. Singh. Lalji. Reconstructing Indian population history. Nature. 461. 7263. 2009. 489–494. 0028-0836. 10.1038/nature08365. 19779445. 2842210. 2009Natur.461..489R.
  12. Sengupta. Dhriti. Choudhury. Ananyo. Basu. Analabha. Ramsay. Michèle. Population Stratification and Underrepresentation of Indian Subcontinent Genetic Diversity in the 1000 Genomes Project Dataset. Genome Biology and Evolution. 8. 11. 2016. 3460–3470. 1759-6653. 10.1093/gbe/evw244. 27797945. 5203783.
  13. Basu. Analabha. Sarkar-Roy. Neeta. Majumder. Partha P.. Genomic reconstruction of the history of extant populations of India reveals five distinct ancestral components and a complex structure. Proceedings of the National Academy of Sciences. 113. 6. 2016. 1594–1599. 0027-8424. 10.1073/pnas.1513197113. 26811443. 4760789. 2016PNAS..113.1594B. free.
  14. 10.1016/j.ajhg.2013.07.006 . Genetic Evidence for Recent Population Mixture in India . 2013 . Moorjani . Priya . Thangaraj . Kumarasamy . Patterson . Nick . Lipson . Mark . Loh . Po-Ru . Govindaraj . Periyasamy . Berger . Bonnie. Bonnie Berger . Reich . David . Singh . Lalji . The American Journal of Human Genetics . 93 . 3 . 422–438 . 1 . 23932107 . 3769933.
  15. Metspalu. Mait. Kivisild. Toomas. Metspalu. Ene. Parik. Jüri. Hudjashov. Georgi. Kaldma. Katrin. Serk. Piia. Karmin. Monika. Behar. Doron M.. 2004-01-01. Most of the extant mtDNA boundaries in South and Southwest Asia were likely shaped during the initial settlement of Eurasia by anatomically modern humans. BMC Genetics. 5. 26. 10.1186/1471-2156-5-26. 1471-2156. 516768. 15339343 . free .
  16. Maji. Suvendu. Krithika. S.. Vasulu. T. S.. 2008. Distribution of mitochondrial DNA macrohaplogroup N in India with special reference to haplogroup R and its sub-haplogroup U.. International Journal of Human Genetics. 8. 1–2. 85–96. 10.1080/09723757.2008.11886022. 14231815.
  17. Web site: Gress . Priti Chitnis . Gujarati Genetics - DNA of Gujaratis of India . Khazaria.com . 31 August 2004 . 10 March 2017.
  18. Rootsi. Siiri. Behar. Doron M.. Järve. Mari. Lin. Alice A.. Myres. Natalie M.. Passarelli. Ben. Poznik. G. David. Tzur. Shay. Sahakyan. Hovhannes. 2013-12-17. Phylogenetic applications of whole Y-chromosome sequences and the Near Eastern origin of Ashkenazi Levites. Nature Communications. en. 4. 2928. 10.1038/ncomms3928. 2041-1723. 3905698. 24346185. 2013NatCo...4.2928R.
  19. Khurana. Priyanka. Aggarwal. Aastha. Mitra. Siuli. Italia. Yazdi M.. Saraswathy. Kallur N.. Chandrasekar. Adimoolam. Kshatriya. Gautam K.. 2014-03-10. Y Chromosome Haplogroup Distribution in Indo-European Speaking Tribes of Gujarat, Western India. PLOS ONE. 9. 3. e90414. 10.1371/journal.pone.0090414. 1932-6203. 3948632. 24614885. 2014PLoSO...990414K. free.
  20. Book: Sanjeev Sanyal. Sowmya Rajendran. The Incredible History of India's Geography. 28 November 2017. Penguin Random House India Private Limited. 978-93-5118-932-9. 24.
  21. Verma. Ram S.. Rodriguez. Jorge. Dosik. Harvey. 1982-05-01. The clinical significance of pericentric inversion of the human Y chromosome: a rare "third" type of heteromorphism. Journal of Heredity. en. 73. 3. 236–238. 10.1093/oxfordjournals.jhered.a109627. 6212612. 0022-1503.
  22. Spurdle. Amanda. Jenkins. Trefor. The Inverted Y-Chromosome Polymorphism in the Gujarati Muslim Indian Population of South Africa Has a Single Origin . Human Heredity . 42 . 5 . 330–332 . 10.1159/000154093. 1459580. 1992.
  23. Bernstein . R. . Wadee . A. . Rosendorff . J. . Wessels . A. . Jenkins . T. . Inverted Y chromosome polymorphism in the Gujerati Muslim Indian population of South Africa . Human Genetics . 74 . 3 . 1986 . 223–9 . 10.1007/bf00282538 . 3781558 . 38020547 .
  24. Laddha . Naresh C. . Dwivedi . Mitesh . Shajil . E.M. . Prajapati . Hiral . Marfatia . Y.S. . Begum . Rasheedunnisa . Association of PTPN22 1858C/T polymorphism with vitiligo susceptibility in Gujarat population . Journal of Dermatological Science . 49 . 3 . 2008 . 260–262 . 10.1016/j.jdermsci.2007.10.002 . 18037273 .
  25. Web site: DNA variation that makes Gujaratis susceptible to vitiligo identified . The Times of India . 5 December 2014 . 10 March 2017.
  26. EM . Shajil . Laddha . Naresh C. . Chatterjee . Sreejatha . Gani . Amina R. . Malek . Reem A. . Shah . Bela J. . Begum . Rasheedunnisa . Association of catalase T/C exon 9 and glutathione peroxidase codon 200 polymorphisms in relation to their activities and oxidative stress with vitiligo susceptibility in Gujarat population . Pigment Cell Research . 20 . 5 . 2007 . 405–407 . 10.1111/j.1600-0749.2007.00406.x . 17850515 . 29779269 .
  27. Thein. S. L.. Old. J. M.. Wainscoat. J. S.. Petrou. M.. Modell. B.. Weatherall. D. J.. 1984-06-01. Population and genetic studies suggest a single origin for the Indian deletion β° thalassaemia. British Journal of Haematology. en. 57. 2. 271–278. 10.1111/j.1365-2141.1984.tb02896.x. 6329257. 13519331. 1365-2141.
  28. Young. I. D.. Rickett. A. B.. Clarke. M.. 1985-08-01. High incidence of Meckel's syndrome in Gujarati Indians.. Journal of Medical Genetics. en. 22. 4. 301–304. 10.1136/jmg.22.4.301. 0022-2593. 4045959. 1049454.
  29. Pemberton . Trevor J. . Li . Fang-Yuan . Hanson . Erin K. . Mehta . Niyati U. . Choi . Sunju . Ballantyne . Jack . Belmont . John W. . Rosenberg . Noah A. . Tyler-Smith . Chris . Patel . Pragna I. . Impact of restricted marital practices on genetic variation in an endogamous Gujarati group . American Journal of Physical Anthropology . 149 . 1 . 2012 . 92–103 . 10.1002/ajpa.22101 . 22729696 . 3436606 .
  30. Quintana-Murci. Lluís. Chaix. Raphaëlle. Wells. R. Spencer. Behar. Doron M.. Sayar. Hamid. Scozzari. Rosaria. Rengo. Chiara. Al-Zahery. Nadia. Semino. Ornella. 2004. Where West Meets East: The Complex mtDNA Landscape of the Southwest and Central Asian Corridor. The American Journal of Human Genetics. 74. 5. 827–845. 10.1086/383236. 1181978. 15077202.
  31. Eaaswarkhanth . Muthukrishnan . Dubey . Bhawna . Meganathan . Poorlin Ramakodi . Ravesh . Zeinab . Khan . Faizan Ahmed . Singh . Lalji . Thangaraj . Kumarasamy . Haque . Ikramul . Diverse genetic origin of Indian Muslims: evidence from autosomal STR loci . Journal of Human Genetics . 54 . 6 . 2009 . 340–348 . 10.1038/jhg.2009.38 . 19424286 . 153224 . free .
  32. Bhasin . M.K. . Singh . Indera P. . Indera Paul Singh . Sudhakar . K. . Bhardwaj . Veena . Chahal . S.M.S. . Walter . H. . Dannewitz . Angela . Genetic studies in four tribal populations of the Surat District, Gujarat (India) . Annals of Human Biology . 12 . 1 . 1985 . 27–39 . 10.1080/03014468500007541 . 3977282 .