The Simons Genome Diversity Project: 300 genomes from 142 diverse populations

Study ID Alternative Stable ID Type
EGAS00001001959 Other

Study Description

Here we report the Simons Genome Diversity Project data set: high quality genomes from 300 individuals from 142 diverse populations. These genomes include at least 5.8 million base pairs that are not present in the human reference genome. Our analysis reveals key features of the landscape of human genome variation, including that the rate of accumulation of mutations has accelerated by about 5% in non-Africans compared to Africans since divergence. We show that the ancestors of some pairs of present-day human populations were substantially separated by 100,000 years ago, well before the archaeologically attested onset of behavioural modernity. We also demonstrate that indigenous Australians, New Guineans and Andamanese do not derive substantial ancestry from an early dispersal of modern humans; instead, their modern human ancestry is consistent with coming from the same source as that in other non- Africans.

Study Datasets 2 datasets.

Click on a Dataset ID in the table below to learn more, and to find out who to contact about access to these data

Dataset ID Description Technology Samples
Whole genome sequencing of 300 individuals from 142 diverse populations
Illumina HiSeq 2000 21
Whole genome sequencing of 92 individuals from 44 African indigenous populations. Sequences made with Illumina HiSeq 2000 sequencing system; data uploaded in BAM format.
Illumina HiSeq 2000 43

Who archives the data?