The Northern Finland Birth Cohorts program (NFBC) was initiated in the 1960s in the two northernmost provinces of Finland to study risk factors involved in pre-term birth and intrauterine growth retardation, and the consequences of these early adverse events on subsequent morbidity and mortality. The uniqueness of NBFCs is that the data of the cohorts were obtained from early fetal life (including maternal health during pregnancy) to adulthood. The NFBC1966 includes 12,058 live births to mothers in the two northern-most provinces of Finland. Two decades later, a second cohort of 9432 births was obtained (NFBC1986). In NFBC1966 pregnancies were followed prospectively from the first antenatal contact (10-16th week). After birth, the offspring were examined and then again underwent clinical evaluation at ages 1y, 7y, 14-16y and 31y. At each visit, a wide range of phenotypic, lifestyle and demographic data were gathered by questionnaires and clinical examinations. For the most part, NFBC1986 has undergone similar evaluations to NFBC1966. Linkage to national registries includes hospitalization, deaths, education, medication, pensions, and provides up-to-date demographic and clinical information for members of both cohorts. DNA samples were obtained from 5,923 subjects from NFBC1966 and 6688 subjects from NFBC1986. Data coverage, 96% of all births in 1966 and 99% in 1986, is highly representative for the whole population. The NFBC program comprises more than 20 different projects coordinated by the Center of Lifecourse Disease studies in Northern Finland (COLD) at Oulu University. The prospective data collected from the NFBCs form a unique resource, allowing the study of disease emergence, and of the importance of genetic, biological, social and behavioral risk factors. The genome-wide association (GWA) study sponsored through the STAMPEED program of NHLBI employed genomic DNA samples previously collected by the NFBC1966 study and stored in the DNA repository of the National Institute for Health and Welfare, Finland. This NHLBI sponsored RO1 project aimed to identify genetic variants contributing to metabolic and cardiovascular diseases (CVD). In addition to de-identified genome wide genotypic data, a selected list of phenotypic data related to CVD including weight, height, BMI, HDL, LDL, total cholesterol, triglyceride, glucose, insulin and fasting status, are also available in dbGaP. A summary of the GWAS for the NFBC1966 cardiovascular risk traits can be found in Sabatti et al., Nature Genetics 41: 35-46, 2009, PMID: 19060910. The version 2 release of this study contains sequence data from seventeen loci associated with levels of triglyceride, HDL-C, LDL-C, total cholesterol, fasting plasma glucose, and fasting plasma insulin (Kathiresan et al. 2008, Willer et al. 2008, Sabatti et al. 2009, Dupuis et al. 2010, Teslovich et al. 2010). At each locus, protein-coding regions and 5' and 3' untranslated regions of genes nearest to single nucleotide polymorphisms showing genome-wide significant association with metabolic syndrome-related traits, were sequenced. Targeted Illumina sequencing of 78 genes (~270kb) using 150bp probes was performed on 4943 subjects of the Northern Finland Birth Cohort 1966 (NFBC1966). Whole exome sequencing on the Illumina platform was carried out on 586 of those participants. The sequencing study is part of a larger project that is funded by the National Human Genome Research Institute's Allelic Spectrum in Common Disease Initiative, and comprises sequence data from more than 7000 individuals in two Finnish cohorts: NFBC1966 and the Finland-United States Investigation of NIDDM Genetics (FUSION) study.
The ~52,000 sample Type 2 Diabetes Exome Sequencing project is a collaboration of six consortia with various funding mechanisms that have joined together to investigate genetic variants for type 2 diabetes (T2D) using the largest T2D case/control sample set compiled to date. This includes samples from: Type 2 Diabetes Genetic Exploration by Next-generation sequencing in multi-Ethnic Samples (T2D-GENES) Genetics of Type 2 Diabetes (GoT2D) Exome Sequencing Project (ESP) Slim Initiative in Genomic Medicine for the Americas: Type 2 Diabetes (SIGMA T2D) Lundbeck Foundation Centre for Applied Medical Genomics in Personalised Disease Prediction, Prevention, and Care (LuCAMP) Progress in Diabetes Genetics in Youth (ProDIGY) This data generated from the SIGMA Mexico City Diabetes Study was part of the Slim Initiative in Genomic Medicine for the Americas: Type 2 Diabetes (SIGMA T2D), which is an international research consortium funded by the Carlos Slim Foundation that seeks to identify the genetic risk factors for type 2 diabetes (T2D) in Mexico and Latin America and translate those findings into improved treatment and prevention of diabetes. The SIGMA T2D project has sequenced and genotyped more than 13,000 samples from Mexican and Mexican Americans. The individuals were obtained from over 20 cohorts across the 6 consortia that are listed in Table 1. The strategy was to perform deep exome sequencing of individuals, 24,991 with T2D and 24,953 controls, divided among five ancestry groups: Europeans, East Asians, South Asians, American Hispanics, and African Americans. The T2D-GENES, ProDIGY and SIGMA studies, sequencing was performed at the Broad Institute using the Agilent v2 capture reagent or Illumina Rapid Capture on Illumina HiSeq machines. Please note that while we summarize the full sample list in publications and below, two of the cohorts below are not in dbGaP, due to the samples not being consented for deposition. This includes the Kooperative Gesundheitsforschung in der Region Augsburg (KORA) study and Lundbeck Foundation Centre for Applied Medical Genomics in Personalised Disease Prediction, Prevention, and Care (LuCamp) study. The Exome Sequencing Project (ESP) was deposited in dbGaP as part of their initial study and the phs numbers for that project can be found here: https://esp.gs.washington.edu/drupal/dbGaP_Releases Table 1. 52,000 sample T2D Case/Control Whole Exome Sequencing Studies Ancestry Consortia Study Countries of Origin # Cases # Controls African American T2D-GENES Project 1 Jackson Heart Study US 500 526 African American T2D-GENES Project 1 Wake Forest School of Medicine Study US 518 530 African American ESP Exome Sequencing Project (ESP) US 467 1374 African American T2D-GENES Follow up study BioMe Biobank Program (BioMe) US 1297 1256 East Asian T2D-GENES Project 1 Korea Association Research Project Korea 526 561 East Asian T2D-GENES Project 1& Follow up Study Singapore Diabetes Cohort Study; Singapore Prospective Study Program Singapore (Chinese) 1486 1568 East Asian T2D-GENES Follow up study Korea SNUH South Korea 450 475 East Asian T2D-GENES Follow up study Research Studies in Hong Kong (Hong Kong) Hong Kong 493 485 European T2D-GENES Project 1 Ashkenazi US, Israel 506 355 European T2D-GENES Project 1 Metabolic Syndrome in Men Study (METSIM) Finland 484 498 European GoT2D Finland-United States Investigation of NIDDM Genetics (FUSION) Study Finland 472 476 European GoT2D Kooperative Gesundheitsforschung in der Region Augsburg (KORA) Germany 97 90 European GoT2D UK Type 2 Diabetes Genetics Consortium (UKT2D) UK 322 320 European GoT2D Malmo-Botnia Study Finland, Sweden 478 443 European LuCamp Lundbeck Foundation Centre for Applied Medical Genomics in Personalised Disease Prediction, Prevention, and Care (LuCamp) Denmark 997 997 European ESP Exome Sequencing Project (ESP) US 390 2843 European T2D-GENES Follow up study Genetics of Diabetes and Audit Research Tayside Study (GoDARTS) Scotland, UK 960 966 European T2D-GENES Follow up study Framingham Heart Study (FHS) US 396 596 Hispanic T2D-GENES Project 1 San Antonio Family Heart Study, San Antonio Family Diabetes/ Gallbladder Study, Veterans Administration Genetic Epidemiology Study, and the Investigation of Nephropathy and Diabetes Study Family Component US 272 218 Hispanic T2D-GENES Project 1 & SIGMAv2 Starr County, Texas US 1762 1738 Hispanic SIGMAv1 Mexico City Diabetes Study Mexico 281 549 Hispanic SIGMAv1 & v2 Multiethnic Cohort (MEC) US 1476 1443 Hispanic SIGMAv1 & v2 UNAM/INCMNSZ Diabetes Study (UIDS) Mexico 1998 1977 Hispanic SIGMAv1 & v2 Diabetes in Mexico Study (DMS) Mexico 1522 1546 Multi ethnic ProDIGY Treatment Options for Type 2 Diabetes in Adolescents and Youth (TODAY) US 3097 0 Multi ethnic ProDIGY SEARCH for Diabetes in Youth (SEARCH) US 553 0 South Asian T2D-GENES Project 1 London Life Sciences Population Study (LOLIPOP) UK (Indian Asian) 531 538 South Asian T2D-GENES Project 1 & Follow up study Singapore Indian Eye Study Singapore (Indian Asian) 1640 1478 South Asian T2D-GENES Follow up study Pakistan Genomic Resource (PGR) Pakistan 914 932
Burden of Disease in Sarcoma
Whole Genome Methylation in CLL
Gene Discovery in Age-Related Hearing Loss
Screening for abnormal CGI methylation in primary colorectal tumours
Diagnostic yield of affymetrix optima microarray in patients with non-syndromic autism spectrum disorders in India.
RNA-seq datasets in human insulinoma samples
Extension analysis to pursue candidate genes of interest in chordoma