SNP6 data for seminoma samples
For our BAP1 manuscript
To identify protein altering variants (PAVs) for glioma we analysed Illumina HumanExome BeadChip exome array data on 1,882 glioma cases and 8,079 controls from three independent European populations. In addition to single variant tests we incorporated information on the predicted functional consequences of PAVs and analysed sets of genes with a higher likelihood of having a role in glioma on the basis of the profile of somatic mutations documented by large-scale sequencing initiatives. Globally tThere was a strong relationship between effect size and SNPs predicted to be damaging (P=2.29x10-49); however, these variants which are most likely to impact on risk, are rare (MAF<5%). While no single variant showed an association which was statistically significant at the genomewide threshold a number of represented promising Notable associations - were seen with the BRCA2:c.9976A>T, p.(Lys3326Ter)BRCA2 p.Lys332X variant, which has been shown documented to influence breast and lung cancer risks (OR=2.3, P=4.00x10-4 for glioblastoma [GBM]) and IDH2:c.782G>A, p.(Arg261His)IDH2 p.Arg261His (OR=3.21, P=7.67x10-3, for non-GBM). Additionally, gene burden tests revealed a statistically significant association for HARS2 and risk of GBM (P=2.20x10-6). Genome scans of low frequency PAVs represent a complementary strategy to identify disease-causing variants compared with scans based on tagSNPs. Strategies to lessen the multiple testing burden by restricting analysis to PAVs with higher priors affords an opportunity to maximise study power.
The CentralAfricanCMC_Pemberton dataset encompasses 153,798 SNPs from the Illumina Cardio-MetaboChip (Voight et al. 2012) genotyped in 406 individuals from 19 Central African Populations from Gabon, Cameroon, Centralafrican Republic and Uganda). Individual phenotypic and cultural information at the individual level for this data set encompass gender, lifestyle (hunter-gatherer or farmer), and, when available, stature phenotype (standing height in cm, sitting-height in cm, and weight in kg). Other cultural, linguistic, and geographical location information about the sampled populations can be found in Pemberton et al. , Human Genetics, 2018 (https://doi.org/10.1007/s00439-018-1902-3).This dataset can only be accessed and used for non-commercial research purposes with a finality complying with the informed consent provided by Central African donors for the study of human evolutionary history only.