Need Help?

Assessing the impact of low frequency coding variants on disease risk using the Exomechip

Following several rounds of Genome Wide Association (GWA) scans and subsequent meta-analyses of results multiple risk loci have been identified for many common diseases. However, in most instances the implicated common variants identified so far explain only a modest fraction of the genetic risk. In parallel to our efforts to identify the causative variants in the known loci it is necessary to assess the full spectrum of sequence variants for disease risk and in particular low frequency and rare variants that have not been tested in a comprehensive way so far. To this end an international effort by investigators who have performed whole exome sequencing in circa 12,000 individuals have assembled a set of ~250,000 exonic variants of low frequency – defined as seen in at least two studies and a minimum of three individuals (non-sysnonymous) or two individuals for SNPs altering splice sites / stop codons. The SNP content from the exome studies was complemented with additional interesting SNP sets totalling 25,000 markers (GWA tag SNPs, grid of common variants, HLA, Mitochondrial, Y-chromosome etc) and were used to generate a custom iSELECT array, the exome chip. Adequately powered association studies to test low frequency variants for association to disease risk are still very expensive if conducted by whole exome sequencing. The exome chip provides a cost efficient way to undertake such an experiment and the Wellcome Trust Cases Control Consortium will be applying this approach to eight diseases, type 1 and type 2 diabetes, coronary artery disease, hypertension, multiple sclerosis, rheumatoid arthritis, bipolar, and ankylosing spondylitis. The first objective will be to generate a large set of at least 10,000 common UK controls as analysis of lower frequency variants will require large sample sizes. This includes ~6000 samples from the 1958 Birth Cohort.

Click on a Dataset ID in the table below to learn more, and to find out who to contact about access to these data

Dataset ID Description Technology Samples
EGAD00010000234 Illumina HumanExome-12v1_A-GenCall, zCall 12241