Whole genome sequencing of six ethnic groups from Burkina Faso, Cameroon, and Tanzania

We conducted low-coverage (~10x) sequencing of six ethnic groups from three malaria-endemic countries in sub-Saharan Africa, with the aim of characterising genetic diversity and improving imputation performance for genotyped samples in these populations. This data is part of a pre-publication release and is available under managed access. Please see for more information on MalariaGEN data access policies. The purpose of the project was to support the discovery and understanding of genetic variants that influence human disease. Specifically defined goals were: a. the discovery of single nucleotide variants at frequencies of 1% or higher in diverse populations, b. even more comprehensive discovery (variants down to frequencies of 0.1 - 0.5%) in functional gene regions, and c. discovery of structural variants, such as copy number variants, other insertions and deletions, and inversions, including sequence-level understanding of breakpoints.

Click on a Dataset ID in the table below to learn more, and to find out who to contact about access to these data

Dataset ID Description Technology Samples
EGAD00001005011 111
EGAD00001005012 76
EGAD00001005013 63
EGAD00001005014 174
EGAD00001005015 156
EGAD00001005016 148
