Need Help?

Population Structure and Genetic Diversity in Argentinean populations

Argentina was one of the last places peopled when humans entered the Americas. Its earliest archeological sites are dated from 13,000 years BP, and throughout its history the region has exhibited broad cultural diversity, involving both hunter-gatherer and agro-pastoral societies, different linguistic families, the Southern expansion of the Inca Empire, the arrival of the Spanish conquistadors and slave trade from Africa. More recently, Argentina has had the highest proportion of influx of transoceanic migration between 1850 and 1940 of any Latin American country. Furthermore, the geographical distribution of immigrants was oriented towards the Pampa region of central-eastern Argentina and the Northeast, so different patterns of mixed ancestry throughout the country are expected. In this present work we aim to describe the history and structure of the Argentinean population. Thus far, we have collected DNA samples from 2904 participants from 15 dispersed regions in Argentina, from the Andean populations in the Northwest to the river lands of the Northeast. Our preliminary data includes genome-scale Illumina Exome Array 250K genotypes, plus next-generation sequencing data from uniformly spaced genomic regions targeted with a genotype-by- sequencing strategy (which covers 1.5% of the genome), for 375 participants from 9 of the 15 populations. We will identify proportions and patterns of European, African and Indigenous American ancestry in the Argentinean cohort via local ancestry inference. The first series of analyses will focus on genomic tracts of Indigenous American ancestry for fine-scale examination of sub-continental structure by comparing to a panel of 493 samples from 52 Indigenous American populations from throughout the Americas. Furthermore, this work will serve as a test of different low-cost strategies (exome genotype array vs. genotype-by-sequencing) for accurate inference of population structure in extant populations.

Click on a Dataset ID in the table below to learn more, and to find out who to contact about access to these data

Dataset ID Description Technology Samples
EGAD00010001075 Illumina Exome 250K 391
Publications Citations
GBStools: A Statistical Method for Estimating Allelic Dropout in Reduced Representation Sequencing Data.
PLoS Genet 12: 2016 e1005631
Population structure in Argentina.
PLoS One 13: 2018 e0196325