H3Africa - Genomic and Environmental Risk Factors for Cardiometabolic Disease in Africans
The long-term vision of the AWI-Gen Collaborative Centre (CC) is to build sustainable capacity in Africa for research that leads to an understanding of the interplay between genetic, epigenetic and environmental risk factors for obesity and related cardiometabolic diseases (CMD) in sub-Saharan Africa. The CC will be consolidated under the auspices of the University of the Witwatersrand (Wits) and the International Network for the Demographic Evaluation of Populations and Their Health in Developing Countries (INDEPTH). It will capitalize on the unique strengths of existing longitudinal cohorts, including the urban Soweto and rural Agincourt studies in South Africa (Wits based), and the well established INDEPTH demographic health and surveillance centers in Kenya, Ghana, Burkina Faso and South Africa. The centers offer established infrastructure, trained fieldworkers, long-standing community engagement, and detailed longitudinal phenotypic data, focusing on obesity and cardiometabolic health. Key strengths are harmonized phenotyping across sites, building on strong existing cohorts, and representation of the geographic and social variability of African populations. We aim to: 1. Build sustainable infrastructure (biobanks and laboratories) and capabilities (well characterized population cohorts, genotyping and bioinformatics) for genomic research on the African continent; 2. Understand the genomic architecture of sub-Saharan populations from west, east and south Africa to guide genomic studies (genome sequencing and high throughput SNP and CNV arrays using unrelated individuals and family trios to improve the accuracy of haplotype analyses) and; 3. Investigate the independent and synergistic genomic contributions to body fat distribution (BMI, hip/waist circumference, subcutaneous and visceral fat) in these populations considering the relevant environmental and social contexts (rural/urban communities, quickly transitioning obesity prevalence, differential HIV, TB, and malaria infection histories). We will investigate the effect of obesity and fat distribution on the risk for CMD in the longitudinal cohorts.
Study
EGAS00001002482
Neuropsychiatric Genetics of African Populations - Psychosis (NeuroGAP-Psychosis)
The NeuroGAP-Psychosis study was conducted in South Africa, Kenya, Uganda and Ethiopia as a response to the lack of genetic diversity in large genomic studies of psychiatric disorders. The study aims to expand knowledge of the genetic and environmental risk factors for neuropsychiatric disorders in Africa through large-scale sample collection and analysis. The initial design is a case-control study in which cases are individuals with schizophrenia or bipolar disorder and controls are age, sex and ancestry matched with individuals from the same geographic location. Cases are patients who are recruited from clinical facilities, both inpatient and outpatient. Controls are people who present for treatment for general medical conditions at health facilities. Data on phenotype, mental disorders, history of physical health problems, substance use and history of past traumatic events was collected from all participants. Additionally, a sample of saliva is collected for DNA extraction. DNA samples were sent to the Broad Institute for genotyping using Illumina Infinium Global Screening Array.
Study
phs002528
APCDR Uganda GWAS: Genome-wide sequence variation and susceptibility loci for cardiometabolic traits in a sub-Saharan African population (UG2G component)
Genomic studies in African populations provide unique opportunities to understand disease aetiology, human genetic diversity and population history in a regional and a global context. To leverage the relative benefits of different strategies, we undertook a combined approach of genotyping and whole-genome sequencing (WGS) in a population-based study of 6,400 individuals from a geographically defined rural community in South-West Uganda. We present data from 4,778 individuals with genotypes for ~2.2 million SNPs from the Uganda GWAS resource (UGWAS), and sequence data on up to 1,978 individuals spanning 41.5M SNPs and 4.5M indels (UG2G); 343 individuals overlap between the two datasets.
We highlight the value of the largest sequence panel from Africa to date as a global resource for variant discovery, imputation and understanding the mutational spectrum and its clinical relevance in African populations. Alongside phenotype data, we provide a rich new genomic resource for researchers in Africa and globally.
Study
EGAS00001000545
H3Africa - Kidney Disease Research Network
Chronic kidney disease (CKD) affects an estimated 14% of adults in sub-Saharan Africa but very little research has been done on the etiology, progression and prevention of CKD on the continent. As part of the Human Heredity and Health in Africa (H3A) Consortium, the H3Africa Kidney Disease Research Network was established to study prevalent forms of kidney disease in sub-Saharan Africa and increase the capacity for genetics and genomics research. The study is performing comprehensive phenotypic characterization with analysis of environmental and genetic factors from seven clinical centers in two African countries (Ghana and Nigeria) over a five-year period. Approximately 4,000 participants with specified kidney disease diagnoses and 4000 control participants will be enrolled in the four African countries. In addition, approximately 50 families with hereditary glomerular disease will be enrolled. The study includes both pediatric and adult participants ages <1 to 74yrs across a broad spectrum of kidney diseases secondary to hypertension, diabetes, human immunodeficiency virus, sickle cell disease, biopsy-proven glomerular disease and CKD of unknown origin. Clinical and demographic data with biospecimens are collected for assessment of clinical, biochemical and genetic markers of kidney disease.
Study
EGAS00001006558
APCDR Uganda GWAS: Genome-wide sequence variation and susceptibility loci for cardiometabolic traits in a sub-Saharan African population (UGWAS component)
Genomic studies in African populations provide unique opportunities to understand disease aetiology, human genetic diversity and population history in a regional and a global context. To leverage the relative benefits of different strategies, we undertook a combined approach of genotyping and whole-genome sequencing (WGS) in a population-based study of 6,400 individuals from a geographically defined rural community in South-West Uganda. We present data from 4,778 individuals with genotypes for ~2.2 million SNPs from the Uganda GWAS resource (UGWAS), and sequence data on up to 1,978 individuals spanning 41.5M SNPs and 4.5M indels (UG2G); 343 individuals overlap between the two datasets.
We highlight the value of the largest sequence panel from Africa to date as a global resource for variant discovery, imputation and understanding the mutational spectrum and its clinical relevance in African populations. Alongside phenotype data, we provide a rich new genomic resource for researchers in Africa and globally.
Study
EGAS00001001558
Ghana Breast Health Study
The Ghana Breast Health Study (GBHS) was a multi-disciplinary, population-based case-control study conducted in Accra and Kumasi, Ghana that was designed to investigate breast cancer etiology in the West Africa region. The study enrolled 1,126 invasive breast cancer cases and 2,106 frequency matched population-based controls using census-based sampling. GHBS participants were between the ages of 18 to 74 years of age. Breast cancer cases were recruited at the time they presented with lesions suspicious of breast cancer at three hospitals: Korle Bu Teaching Hospital (Accra), Komfo Anoyke Teaching Hospital (Kumasi) and Peace and Love Hospital (Kumasi). These hospitals are the primary providers of treatment for breast cancer in Ghana, allowing for cases in our study to represent most breast cancers diagnosed in these areas during the study period. The following publications describe the GBHS in detail:
1) Brinton LA et al. Design considerations for identifying breast cancer risk factors in a population-based study in Africa. Int J Cancer 2017;140:2667-2677 (PMID:28295287).
2) Nyante SJ et al. Recruiting population controls for case-control studies in sub-Saharan Africa: The Ghana Breast Health Study. PloS one 2019;14:e0215347 (PMID:30990841)Among subjects with an available source of germline DNA genotyping was performed using the Infinium Global Screening Array-24 per the manufacturer’s guidelines using the Infinium automated protocol. We excluded probable duplicates and close relatives, and samples with a call rate < 95% or samples with extreme heterozygosity. Female sex was verified by Identifiler using X chromosome heterozygosity and the X chromosome inbreeding coefficient F statistics. The GLU genetics’ ld.tagzilla module was used to confirm African ancestry. To resolve more detailed population substructure, principal components analysis was conducted among all unrelated subjects using the EIGENSOFT v7.2.1. Imputation was performed using the Michigan imputation server Minimac3 with 1K genomes phase3 as reference panel.
Study
phs002387
RODAM cohort
The RODAM study assesses the health and wellbeing of Ghanaian residents in Ghana and Europe and follows them up over time. With this unique approach the RODAM study attempts to unravel the causes of cardiovascular disease and its risk factors among African migrants in Europe and non-migrants sub-Saharan Africa. Fecal samples from RODAM participants in rural, urban Ghana and Amsterdam, the Netherlands were sequenced using 16S rRNA sequencing to assess gut microbiome differences between research sites.
Study
EGAS50000000805
National Cancer Institute - Population Structure and Natural Selection in the Epidemiology of Burkitt Lymphoma in East African Children and Minors (EMBLEM) study in Uganda
A genome-wide analysis of genetic structure, gene flow, and natural selection was conducted in populations in the endemic Burkitt lymphoma (eBL) belt in Ghana and Northern Uganda, both subject to a high incidence of falciparum malaria and eBL. These populations have different ethnolinguistic ancestries and are located 2400 miles apart in sub-Saharan Africa. We characterized genetic composition of these populations in the context of 22 additional African populations and present evidence for gene flow events that occurred in the last 3000 years, possibly related to regional migrations in Western Africa and major migrations involving Nilotic, Cushitic, and Bantu groups. The Ugandan population was comprised of 758 children (mean age ~7 years) from 17 Western Nilo-Saharan tribes. The current dataset includes 561 healthy children and 197 children with eBL from northern Uganda.
Study
phs001705
H3Africa AWI-GEN Phase 1 Phenotype
The phenotypic data for ~12500 samples of the AWI-Gen Phase 1 Population cross-sectional study of older adults (mostly between 40 and 60 years), men and women. Six study sites in four sub-Saharan African counties including Ghana, Burkina Faso, Kenya and South Africa. Some groups are missing data for specific variables. Data includes questionnaire data (demography, health history, family health history, behaviour and infection data); anthropometry; and laboratory assays on blood and urine.
Dataset
EGAD00001006425
APCDR AGV Project: Whole genome sequencing of 3 African populations (curated data)
The African Genome Variation Project (AGVP) provides a resource to help design, implement and interpret genomic studies in sub-Saharan Africa (SSA) and worldwide. The AGVP represents dense genotypes from 1,481 and whole genome sequences (WGS) from 320 individuals across SSA.
As part of the AGVP we sequenced 3 African populations; Baganda from Uganda (4x), Zulu from South Africa (4x), Gumuz, Wolayta, Oromo, Somali and Amhara from Ethiopia (8x).
Study
EGAS00001000960