[en] There is considerable ethno-linguistic and genetic variation among human populations in Asia, although tracing the origins of this diversity is complicated by migration events. Thailand is at the center of Mainland Southeast Asia (MSEA), a region within Asia that has not been extensively studied. Genetic substructure may exist in the Thai population, since waves of migration from southern China throughout its recent history may have contributed to substantial gene flow. Autosomal SNP data were collated for 438,503 markers from 992 Thai individuals. Using the available self-reported regional origin, four Thai subpopulations genetically distinct from each other and from other Asian populations were resolved by Neighbor-Joining analysis using a 41,569 marker subset. Using an independent Principal Components-based unsupervised clustering approach, four major MSEA subpopulations were resolved in which regional bias was apparent. A major ancestry component was common to these MSEA subpopulations and distinguishes them from other Asian subpopulations. On the other hand, these MSEA subpopulations were admixed with other ancestries, in particular one shared with Chinese. Subpopulation clustering using only Thai individuals and the complete marker set resolved four subpopulations, which are distributed differently across Thailand. A Sino-Thai subpopulation was concentrated in the Central region of Thailand, although this constituted a minority in an otherwise diverse region. Among the most highly differentiated markers which distinguish the Thai subpopulations, several map to regions known to affect phenotypic traits such as skin pigmentation and susceptibility to common diseases. The subpopulation patterns elucidated have important implications for evolutionary and medical genetics. The subpopulation structure within Thailand may reflect the contributions of different migrants throughout the history of MSEA. The information will also be important for genetic association studies to account for population-structure confounding effects.
Disciplines :
Life sciences: Multidisciplinary, general & others
Author, co-author :
Wangkumhang, Pongsakorn; National Center for Genetic Engineering and Biotechnology, Thailand > Genome Institute > Biostatistics and informatics Laboratory
Shaw, Philip James; National Center for Genetic Engineering and Biotechnology, Thailand > Genome Institute > Biostatistics and informatics Laboratory
Chaichoompu, Kridsadakorn ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Bioinformatique
Ngamphiw, Chumpol; National Center for Genetic Engineering and Biotechnology, Thailand > Genome Institute > Biostatistics and informatics Laboratory
Assawamakin, Anunchai
Nuinoon, Manit
Sripichai, Orapan
Svasti, Saovaros
Fucharoen, Suthat
Praphanphoj, Verayuth
Tongsima, Sissades
Language :
English
Title :
Insight into the peopling of mainland southeast Asia from thai population genetic structure.
Publication date :
2013
Journal title :
PLoS ONE
eISSN :
1932-6203
Publisher :
Public Library of Science, United States - California
Cavalli-Sforza LL, Menozzi P, Piazza A (1994) The history and geography of human genes. xi. Princeton, N.J.: Princeton University Press. p. 518, p.A paragraph return was deleted
Consortium HP-AS, Abdulla MA, Ahmed I, Assawamakin A, Bhak J, et al. (2009) Mapping human genetic diversity in Asia. Science 326: 1541-1545.
Reich D, Patterson N, Kircher M, Delfin F, Nandineni MR et al. (2011) Denisova admixture and the first modern human dispersals into Southeast Asia and Oceania. Am J Hum Genet 89: 516-528. doi: 10.1016/j.ajhg.2011.09.005. PubMed: 21944045.
Rasmussen M, Guo X, Wang Y, Lohmueller KE, Rasmussen S et al. (2011) An Aboriginal Australian genome reveals separate human dispersals into Asia. Science 334: 94-98. doi:10.1126/science.1211177. PubMed: 21940856.
Stoneking M, Delfin F (2010) The human genetic history of East Asia: weaving a complex tapestry. Curr Biol 20: R188-R193. doi:10.1016/j.cub.2009.11. 052. PubMed: 20178766.
Reich D, Thangaraj K, Patterson N, Price AL, Singh L (2009) Reconstructing Indian population history. Nature 461: 489-494. doi: 10.1038/nature08365. PubMed: 19779445.
Yamaguchi-Kabata Y, Nakazono K, Takahashi A, Saito S, Hosono N et al. (2008) Japanese population structure, based on SNP genotypes from 7003 individuals compared to other ethnic groups: effects on population-based association studies. Am J Hum Genet 83: 445-456. doi:10.1016/j.ajhg.2008.08.019. PubMed: 18817904.
Xu S, Yin X, Li S, Jin W, Lou H et al. (2009) Genomic dissection of population substructure of Han Chinese and its implication in association studies. Am J Hum Genet 85: 762-774. doi:10.1016/j.ajhg. 2009.10.015. PubMed: 19944404.
Chen J, Zheng H, Bei JX, Sun L, Jia WH et al. (2009) Genetic structure of the Han Chinese population revealed by genome-wide SNP variation. Am J Hum Genet 85: 775-785. doi:10.1016/j.ajhg. 2009.10.016. PubMed: 19944401.
Matsumura H, Pookajorn S (2005) A morphometric analysis of the Late Pleistocene Human Skeleton from the Moh Khiew Cave in Thailand. Homo 56: 93-118. doi:10.1016/j.jchb.2005.05.004. PubMed: 16130834. (Pubitemid 41020768)
Matsumura H, Hudson MJ (2005) Dental perspectives on the population history of Southeast Asia. Am J Phys Anthropol 127: 182-209. doi:10.1002/ajpa.20067. PubMed: 15558609. (Pubitemid 40664277)
Oota H, Kurosaki K, Pookajorn S, Ishida T, Ueda S (2001) Genetic study of the Paleolithic and Neolithic Southeast Asians. Hum Biol 73: 225-231. doi:10.1353/hub.2001.0023. PubMed: 11446426. (Pubitemid 32620876)
Hill C, Soares P, Mormina M, Macaulay V, Meehan W et al. (2006) Phylogeography and ethnogenesis of aboriginal Southeast Asians. Mol Biol Evol 23: 2480-2491. doi:10.1093/molbev/msl124. PubMed:16982817. (Pubitemid 44737142)
Thangaraj K, Chaubey G, Reddy AG, Singh VK, Singh L (2006) Unique origin of Andaman Islanders: insight from autosomal loci. J Hum Genet 51: 800-804. doi:10.1007/s10038-006-0026-0. PubMed: 16924390. (Pubitemid 44386824)
Macaulay V, Hill C, Achilli A, Rengo C, Clarke D et al. (2005) Single, rapid coastal settlement of Asia revealed by analysis of complete mitochondrial genomes. Science 308: 1034-1036. doi:10.1126/science. 1109792. PubMed: 15890885. (Pubitemid 40941876)
Higham C (1996) The Bronze Age of Southeast Asia. xvi. Cambridge, England ; New York: Cambridge University Press. 381 pp.
Bellwood PS, Fox JJ, Tryon DT (1996) The Austronesians: historical and comparative perspectives. Canberra: Department of Anthropology. : Research School of Pacific and Asian Studies. 359 p
Lertrit P, Poolsuwan S, Thosarat R, Sanpachudayan T, Boonyarit H et al. (2008) Genetic history of Southeast Asian populations as revealed by ancient and modern human mitochondrial DNA analysis. Am J Phys Anthropol 137: 425-440. doi:10.1002/ajpa.20884. PubMed: 18615504.
Schliesinger J (2001) Tai groups of Thailand. Bangkok, Thailand: White Lotus Press.
Baker CJ, Pasuk P (2009) A history of Thailand. Cambridge ; New York: Cambridge University Press. 315
Ooi KG (2004) Southeast Asia : a historical encyclopedia, from Angkor Wat to East Timor. Santa Barbara, CA: ABC-CLIO.
Kutanan W, Kampuansai J, Colonna V, Nakbunlung S, Lertvicha P et al. (2011) Genetic affinity and admixture of northern Thai people along their migration route in northern Thailand: evidence from autosomal STR loci. J Hum Genet 56: 130-137. doi:10.1038/jhg.2010.135. PubMed: 21107341.
Mahasirimongkol S, Chantratita W, Promso S, Pasomsab E, Jinawath N et al. (2006) Similarity of the allele frequency and linkage disequilibrium pattern of single nucleotide polymorphisms in drugrelated gene loci between Thai and northern East Asian populations: implications for tagging SNP selection in Thais. J Hum Genet 51: 896-904. doi:10.1007/s10038-006-0041-1. PubMed: 16957813. (Pubitemid 44465700)
Listman JB, Malison RT, Sughondhabirom A, Yang BZ, Raaum RL et al. (2007) Demographic changes and marker properties affect detection of human population differentiation. BMC Genet 8: 21. doi: 10.1186/1471-2156-8-21. PubMed: 17498298.
Xu S, Kangwanpong D, Seielstad M, Srikummool M, Kampuansai J et al. (2010) Genetic evidence supports linguistic affinity of Mlabri-a hunter-gatherer group in Thailand. BMC Genet 11: 18. doi: 10.1186/1471-2156-11-18. PubMed: 20302622.
Zimmermann B, Bodner M, Amory S, Fendt L, Rock A et al. (2009) Forensic and phylogeographic characterization of mtDNA lineages from northern Thailand (Chiang Mai). Int J Leg Med 123: 495-501. doi: 10.1007/s00414-009-0373-4.
Besaggio D, Fuselli S, Srikummool M, Kampuansai J, Castrì L et al. (2007) Genetic variation in Northern Thailand Hill Tribes: origins and relationships with social structure and linguistic differences. BMC Evol Biol 7 Suppl 2: S12. doi:10.1186/1471-2148-7-12. PubMed: 17767728.
Fucharoen G, Fucharoen S, Horai S (2001) Mitochondrial DNA polymorphisms in Thailand. J Hum Genet 46: 115-125. doi:10.1007/s100380170098. PubMed: 11310578. (Pubitemid 32290639)
Xing J, Watkins WS, Shlien A, Walker E, Huff CD et al. (2010) Toward a more uniform sampling of human genetic diversity: a survey of worldwide populations by high-density genotyping. Genomics 96: 199-210. doi:10.1016/j.ygeno.2010.07.004. PubMed: 20643205.
Nuinoon M, Makarasara W, Mushiroda T, Setianingsih I, Wahidiyat PA et al. (2010) A genome-wide association identified the common genetic variants influence disease severity in beta0-thalassemia/hemoglobin E. Hum Genet 127: 303-314. doi:10.1007/s00439-009-0770-2. PubMed: 20183929.
Lee JH, Cheng R, Graff-Radford N, Foroud T, Mayeux R et al. (2008) Analyses of the National Institute on Aging Late-Onset Alzheimer's Disease Family Study: implication of additional loci. Arch Neurol 65: 1518-1526. doi:10.1001/archneur.65.11.1518. PubMed: 19001172.
Felsenstein J (1981) Evolutionary trees from DNA sequences: a maximum likelihood approach. J Mol Evol 17: 368-376. doi:10.1007/BF01734359. PubMed: 7288891. (Pubitemid 11016207)
Huson DH, Richter DC, Rausch C, Dezulian T, Franz M et al. (2007) Dendroscope: An interactive viewer for large phylogenetic trees. BMC Bioinformatics 8: 460. doi:10.1186/1471-2105-8-460. PubMed: 18034891.
Intarapanich A, Shaw PJ, Assawamakin A, Wangkumhang P, Ngamphiw C et al. (2009) Iterative pruning PCA improves resolution of highly structured populations. BMC Bioinformatics 10: 382. doi: 10.1186/1471-2105-10-382. PubMed: 19930644.
Limpiti T, Intarapanich A, Assawamakin A, Shaw PJ, Wangkumhang P et al. (2011) Study of large and highly stratified population datasets by combining iterative pruning principal component analysis and structure. BMC Bioinformatics 12: 255. doi:10.1186/1471-2105-12-255. PubMed: 21699684.
Alexander DH, Novembre J, Lange K (2009) Fast model-based estimation of ancestry in unrelated individuals. Genome Res 19: 1655-1664. doi:10.1101/gr.094052.109. PubMed: 19648217.
Pritchard JK, Stephens M, Donnelly P (2000) Inference of population structure using multilocus genotype data. Genetics 155: 945-959. PubMed: 10835412. (Pubitemid 30397141)
Giardina E, Pietrangeli I, Martínez-Labarga C, Martone C, de Angelis F et al. (2008) Haplotypes in SLC24A5 Gene as Ancestry Informative Markers in Different Populations. Curr Genomics 9: 110-114. doi: 10.2174/138920208784139528. PubMed: 19440451. (Pubitemid 351888528)
Noguchi E, Sakamoto H, Hirota T, Ochiai K, Imoto Y et al. (2011) Genome-wide association study identifies HLA-DP as a susceptibility gene for pediatric asthma in Asian populations. PLOS Genet 7: e1002170. PubMed: 21814517.
Oze I, Matsuo K, Suzuki T, Kawase T, Watanabe M et al. (2009) Impact of multiple alcohol dehydrogenase gene polymorphisms on risk of upper aerodigestive tract cancers in a Japanese population. Cancer Epidemiol Biomarkers Prev 18: 3097-3102. doi: 10.1158/1055-9965.EPI-09-0499. PubMed: 19861527.
Tan A, Sun J, Xia N, Qin X, Hu Y et al. (2012) A genome-wide association and gene-environment interaction study for serum triglycerides levels in a healthy Chinese male population. Hum Mol Genet 21: 1658-1664. doi:10.1093/hmg/ddr587. PubMed: 22171074.
Wu L, Xi B, Hou D, Zhao X, Liu J et al. (2013) The single nucleotide polymorphisms in BRAP decrease the risk of metabolic syndrome in a Chinese young adult population. Diabetes Vasc Dis Res 10: 202-207. doi:10.1177/ 1479164112455535.
Li Y, Wu GD, Zuo J, Meng Y, Fang FD (2005) [Screening susceptibility genes of type 2 diabetes in Chinese population by single nucleotide polymorphism analysis]. Zhongguo Yi Xue Ke Xue Yuan Xue Bao 27: 274-279. PubMed: 16038259.
Simonson TS, Xing J, Barrett R, Jerah E, Loa P et al. (2011) Ancestry of the Iban is predominantly Southeast Asian: genetic evidence from autosomal, mitochondrial, and Y chromosomes. PLOS ONE 6: e16338. doi:10.1371/journal.pone. 0016338. PubMed: 21305013.
Jinam TA, Hong LC, Phipps ME, Stoneking M, Ameen M et al. (2012) Evolutionary history of continental southeast Asians: "early train" hypothesis based on genetic analysis of mitochondrial and autosomal DNA data. Mol Biol Evol 29: 3513-3527. doi:10.1093/molbev/mss169. PubMed: 22729749.
Dancause KN, Chan CW, Arunotai NH, Lum JK (2009) Origins of the Moken Sea Gypsies inferred from mitochondrial hypervariable region and whole genome sequences. J Hum Genet 54: 86-93. doi:10.1038/jhg.2008.12. PubMed: 19158811.
Peng MS, Quang HH, Dang KP, Trieu AV, Wang HW et al. (2010) Tracing the Austronesian footprint in Mainland Southeast Asia: a perspective from mitochondrial DNA. Mol Biol Evol 27: 2417-2430. doi: 10.1093/molbev/msq131. PubMed: 20513740.
Patterson N, Price AL, Reich D (2006) Population structure and eigenanalysis. PLOS Genet 2: e190. doi:10.1371/journal.pgen. 0020190. PubMed: 17194218.
Trust Wellcome. Case Control C (2007) Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447: 661-678
Luca D, Ringquist S, Klei L, Lee AB, Gieger C et al. (2008) On the use of general control samples for genome-wide association studies: genetic matching highlights causal variants. Am J Hum Genet 82: 453-463. doi:10.1016/j.ajhg.2007. 11.003. PubMed: 18252225.
Breurec S, Guillard B, Hem S, Brisse S, Dieye FB et al. (2011) Evolutionary history of Helicobacter pylori sequences reflect past human migrations in Southeast Asia. PLOS ONE 6: e22058. doi: 10.1371/journal.pone. 0022058. PubMed: 21818291.
Benjamin G, Chou C (2002) Tribal communities in the Malay world : historical, cultural, and social perspectives. Leiden, the Netherlands. Singapore: International Institute for Asian Studies; Institute of Southeast Asian Studies. 489
Ang KC, Ngu MS, Reid KP, Teh MS, Aida ZS et al. (2012) Skin color variation in Orang Asli tribes of Peninsular Malaysia. PLOS ONE 7: e42752. doi:10.1371/journal.pone.0042752. PubMed: 22912732.
Stephens M, Donnelly P (2003) A comparison of bayesian methods for haplotype reconstruction from population genotype data. Am J Hum Genet 73: 1162-1169. doi:10.1086/379378. PubMed: 14574645. (Pubitemid 37414228)