Exome Sequencing to Identify Medically Relevant Associations in Finnish Sub-Isolate Samples from the FINRISK Cohort
We propose to undertake, at the McDonnell Genome Institute of Washington University, a collaborative exome sequencing project of ~10,000 samples from a large Finnish population cohort, FINRISK, a participating component of the Sequencing Initiative Suomi (SISu) consortium. Together these sequenced samples will provide a unique resource for identification of infrequent or rare variants with a large impact on cardiovascular and metabolic disorders as well as on a wide range of heritable, disease-related quantitative phenotypes. Our collaborative group has already shown, in exome sequencing studies of smaller Finnish samples, that numerous loss of function variants are dramatically more frequent in Finland compared to other European populations, and that such variants demonstrate strong associations to several disease-related traits. Finland is the largest population isolate in Europe and the enrichment of these variants reflects its rapid growth from a severe population bottleneck about 100 generations ago.
Northern and Eastern parts of Finland experienced additional, more recent bottlenecks, and therefore the enrichment of particular high-impact variants in these regions is even more extreme (Stoll et al., 2013, PMID: 23912948). We therefore propose to focus exome sequencing efforts on about 10,000 members of FINRISK who are specifically drawn from these sub-isolate regions. As a very wide range of phenotypes are available from this cohort (including cardiovascular and metabolic disease outcomes obtained from national registries) we hypothesize that the proposed exome sequencing will identify numerous new disease-related associations. Within the National Biobanks of Finland (www.nationalbiobanks.fi) we have DNA and tissue samples from over 200,000 individuals of whom 50,000 have GWAS data; these samples provide an exceptional resource for both imputation from the sequencing studies proposed here, as well as for replication of associations.
- Type: Longitudinal
- Archiver: The database of Genotypes and Phenotypes (dbGaP)