This batch is a subset of the full DETECT-A dataset, containing 25210 fastq files generated from 3040 subjects. All sequencing was conducted using Illumina HiSeq 4000 and Illumina MiSeq platforms. Note that the division into batches follows no specific criteria and that the sequencing data for each subject has multiple files which may span multiple datasets. Thus, for a comprehensive analysis, it is recommended to request access to all datasets that comprise this study.
This batch is a subset of the full DETECT-A dataset, containing 9 fastq files generated from 8 subjects. All sequencing was conducted using Illumina HiSeq 4000 and Illumina MiSeq platforms. Note that the division into batches follows no specific criteria and that the sequencing data for each subject has multiple files which may span multiple datasets. Thus, for a comprehensive analysis, it is recommended to request access to all datasets that comprise this study.
We report a significant advance in Moyamoya disease (MMD) research through our unique access to a large MMD patient population combined with high depth exome sequencing and bioinformatics. We confirm a reported RNF213 founder mutation (FM), provide evidence for its ethnic specificity and identify novel variants in other genes that associate with Caucasian and non-RNF213 FM MMD. This work has a broader impact in vascular research by highlighting novel genetic aspects of cerebrovascular disease. Moyamoya Disease (MMD) is a rare cerebrovascular disorder characterized by unilateral or bilateral progressive stenosis or occlusion of the internal carotid artery, with frequent involvement of the anterior cerebral artery and the middle cerebral artery (MCA). Fragile collateral vessels subsequently develop and are particularly prone to hemorrhage. MMD patients are usually diagnosed angiographically following presentation of developmental delays, seizures, migraines, ischemia, and/or hemorrhage. The most common approach to re-establish normal blood flow is bypass surgery. While some progress has been made in identifying disease-associated genes, the underlying disease biology is not well understood. Moyamoya disease (MMD) is a rare cerebrovascular disease. Our unique access to a large MMD patient population combined with high depth exome sequencing and bioinformatics has led to a significant advance in the field. We confirm a reported RNF213 founder mutation (FM), provide evidence for its ethnic specificity and identify novel variants in other genes that associate with Caucasian and non-RNF213 FM MMD. This work has a broader impact in vascular research by highlighting novel genetic aspects of cerebrovascular disease. The 125 ethnically diverse, unrelated MMD patients were matched based on sex and broad ethnic category to 125 controls. Control DNA from the 1000 Genomes project was used to design a control Library by Personalis. 125 controls were selected from this Personalis Control DNA Library for our study. (The 1000 Genomes Project Consortium 2012). Genomic DNA was extracted using the Gentra Puregene kit (Qiagen, Valencia, CA). Libraries were prepared from approximately 3 µg of high quality genomic DNA (50-200 ng/µl) using Illumina TruSeq Genomic DNA High throughput Sample Prep Kits (Illumina, San Diego, CA) and exome enrichment (targeting 62Mb) was accomplished using the TruSeq Exome Target Enrichment kit (Illumina, San Diego, CA), all according to manufacturer's protocols. Target enrichment validation was confirmed by determining the concentration of the library by PicoGreen-based quantitation. Library yields ranged from 100-1000ng of DNA, a portion of which was run on the Bioanalyzer HS DNA chip (Agilent, Santa Clara, CA), with an average size of 300-550nt for DNA fragments. Exome Sequencing. Sequencing was performed using Illumina Hiseq2000 or HiSeq2500 sequencers with single lane, paired-end 2X100bp reads. DNA fragments were generated and amplified using Clonal Single Molecule Array technology (Illumina, San Diego, CA). The sequences were determined using the Clonal Single Molecule Array and Sequencing-by-Synthesis using Illumina's instrumentation and Reversible Terminator Chemistry. Each sequencing lane interrogated the DNA sequences of a pool of 3 individual sample libraries each carrying a unique index. Sequencing reads of at least 2x100bp in length for a total of approximately 8 Gb of sequence data were generated for each sample.
This study funded by the National Cancer Institute (NCI) involves conducting a genome-wide association study of common genetic variants to identify markers of susceptibility to bladder cancer. This bladder GWAS has led to the discovery of three novel regions in the genome associated with bladder cancer risk. Cases were defined as individuals having histologically confirmed primary carcinoma of the urinary bladder, including carcinoma in situ (ICD-0-2 topography codes C67.0-C67.9 or ICD9 codes 188.1-188.9). Scan data were obtained from two case-control studies carried out in Spain and the United States (specifically, those in the Maine and Vermont components of the New England Bladder Cancer Study) and three prospective cohort studies in Finland and the United States (specifically Alpha-Tocopherol, Beta-Carotene Cancer Prevention Study, Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial, and The American Cancer Society Cancer Prevention Study II Nutrition Cohort). We used data from 591,637 single nucleotide polymorphisms 3,532 affected individuals (cases) and 5,119 controls of European descent and replication including 8382 cases and 48275 controls from 16 studies. In a combined analysis, we identified three new regions associated with bladder cancer on chromosomes 22q13.1, 19q12 and 2q37.1: rs1014971, (P ≈ 8 x 10-12) maps to a non-genic region of chromosome 22q13.1, rs8102137 (P = 2 x 10-11) on maps to CCNE1 and rs11892031 (P = 1 x 10-7) maps to the UGT1A cluster on 2q37.1. We confirmed four previously identified genome-wide associations on chromosomes 3q28, 4p16.3, 8q24.21 and 8q24.3 (Rothman N et al., Nature Genetics, 2010, PMID: 20972438). Through meta-analysis with the MD Anderson Texas Bladder Cancer Study (TXBCS), we also identified a novel susceptibility locus that mapped to a region of 18q12.3, marked by rs7238033 (P = 8.7 x 10(-9); allelic odds ratio 1.20 with 95% CI: 1.13-1.28) and two highly correlated SNPs, rs10775480/rs10853535 (r(2)= 1.00; P = 8.9 x 10(-9); allelic odds ratio 1.16 with 95% CI: 1.10-1.22) (Garcia-Closas M et al, Human Molecular Genetics, 2011) For NCI-GWAS2, we performed genotyping on cases and controls for the New Hampshire component of the New England Bladder Cancer Study (NEBCS-NH). For the majority of new bladder cancer cases, we genotyped only cases from four case-control studies, the Los Angeles Bladder Cancer Study (LABCS), the French Center for Research on Prostate Diseases (CeRePP), the French Bladder Study (FBCS) and the Brescia Bladder Cancer Study (BBCS). We used existing control data from four cohort studies already genotyped and subjected to rigorous quality control metrics: the European Prospective Investigation Into Cancer and Nutrition Study (EPIC), Womens Health Initiative (WHI), Health Professionals Follow-up Study (HPFS), Nurses Health Study (NHS), which have been a part of Cancer Genetic Markers of Susceptibility (CGEMS). Meta-analysis of NCI-GWAS1, NCI-GWAS2 and a previously reported GWAS TXBCS-GWAS along with taqman replication, identified two new loci: rs10936599 on 3q26.2 (P = 4.53 x 10(-9)) and rs907611 on 11p15.5 (P = 4.11 x 10(-8)). Two notable loci were also identified that approached genome-wide statistical significance: rs6104690 on 20p12.2 (P = 7.13 x 10(-7)) and rs4510656 on 6p22.3 (P = 6.98 X 10(-7)); these require further studies for confirmation (Figueroa J et al, Human Molecular Genetics, 2013).
Genetics of thinness compared to obesity - summary statistics