Need Help?

The Simons Genome Diversity Project: 300 genomes from 142 diverse populations

Here we report the Simons Genome Diversity Project data set: high quality genomes from 300 individuals from 142 diverse populations. These genomes include at least 5.8 million base pairs that are not present in the human reference genome. Our analysis reveals key features of the landscape of human genome variation, including that the rate of accumulation of mutations has accelerated by about 5% in non-Africans compared to Africans since divergence. We show that the ancestors of some pairs of present-day human populations were substantially separated by 100,000 years ago, well before the archaeologically attested onset of behavioural modernity. We also demonstrate that indigenous Australians, New Guineans and Andamanese do not derive substantial ancestry from an early dispersal of modern humans; instead, their modern human ancestry is consistent with coming from the same source as that in other non- Africans.

Click on a Dataset ID in the table below to learn more, and to find out who to contact about access to these data

Dataset ID Description Technology Samples
EGAD00001002721 Illumina HiSeq 2000 21
EGAD00001005019 Illumina HiSeq 2000 43
Publications Citations
Worldwide genetic variation of the IGHV and TRBV immune receptor gene families in humans.
Life Sci Alliance 2: 2019 e201800221
African evolutionary history inferred from whole genome sequence data of 44 indigenous African populations.
Genome Biol 20: 2019 82
G6PD distribution in sub-Saharan Africa and potential risks of using chloroquine/hydroxychloroquine based treatments for COVID-19.
Pharmacogenomics J 21: 2021 649-656
Chronology of natural selection in Oceanian genomes.
iScience 25: 2022 104583