A challenge in clinical genomics is to predict whether copy number variation (CNV) affecting a gene or multiple genes will manifest as disease. Increasing recognition of gene dosage effects in neurodevelopmental disorders prompted us to develop a computational approach based on critical-exon (highly expressed in brain, highly conserved) examination for potential etiologic effects. Using a large CNV dataset, our updated analyses revealed significant (P < 1.64 x 10-15) enrichment of critical-exons within rare CNVs in cases compared to controls. Separately, we used a weighted gene co-expression network analysis (WGCNA) to construct an unbiased protein module from prenatal and adult tissues and found it significantly enriched for critical exons in prenatal (P < 1.15 x 10-50, OR = 2.11) and adult (P < 6.03 x 10-18, OR = 1.55) development. WGCNA yielded 1,206 proteins for which we prioritized the corresponding genes as likely to have a role in neurodevelopmental disorders. We compared the gene lists obtained from critical-exon and WGCNA analysis and found 438 candidate genes associated with CNVs annotated as pathogenic, or variants of uncertain significance (VOUS), from among 10,619 developmental delay cases. We identified genes containing CNVs previously considered to be VOUS, to be new candidate genes for neurodevelopmental disorders (GIT1, MVB12B and PPP1R9A) demonstrating the utility of this strategy to index the clinical effects of CNVs. Reprinted from Uddin et al, 2016, with permission from Scientific Reports.
The SNP Typing for Association with Multiple Phenotypes from Existing Epidemiologic (STAMPEED) asthma project includes subjects with asthma and controls from the Chicago Asthma Genetics Study (CAG), NHLBI multicenter Severe Asthma Research Program (SARP) and NHLBI Collaborative Studies on the Genetics of Asthma CSGA (Wake Forest). All studies included European American and African American children and adults with asthma ranging from mild to severe and adult controls. CAG participants were collected at the University of Chicago. SARP participants were recruited at the NHLBI SARP sites with an emphasis on recruiting severe asthmatics (Moore et al, Am J Respir Crit Care Med, 2010, PMID: 19892860). CSGA cases and controls collected by the Wake Forest investigators were also included. Asthma status was based on both a physician's diagnosis and either bronchodilator reversibility or hyper-responsiveness to methacholine as well as less than 5 pack years of smoking. Genotyping was performed on the Illumina 1Mv1 platform, with individual genotypes called using clustering algorithms as implemented in the BeadStudio software by Illumina. The total number of markers following standard QC was 1,025,129. Imputation was performed using the HapMap phase 2, release 21 SNPs using MACH with the phased HapMap CEU and YRI haplotypes as a reference. Case/control association tests for asthma status were performed using logistic regression in R (http://CRAN.R-project.org/) on genotype dosages, and adjusting for the first principal component from EIGENSTRAT.
Purpose: The purpose of this study was to assess the somatic mutation landscape of breast tumors before and after neoadjuvant chemotherapy. The data includes whole exome sequencing of pre- and post- tissues that were collected in the context of an already published clinical trial SWOG S0800 (ZA Nahleh et al. Breast Cancer Research and Treatment. 158:485-495, 2016). The trial compared the efficacy of sequential nab-paclitaxel followed or preceded by doxorubicin/cyclophosphamide anthracycline chemotherapy with or without bevacizumab as preoperative therapy for stage II-III breast cancer. Experimental Design: 29 pre-treatment biopsies and 9 matching post-treatment surgically resected cancer tissues were available for analysis and were subjected to whole exome sequencing to identify mutational patterns associated with response to neoadjuvant chemotherapy. The 9 paired samples with residual invasive cancer after therapy were also analyzed to assess changes in mutational patterns in response to therapy. No matching normal tissues were available for germline sequencing and therefore we used n=7 post-treatment breast tissues without residual cancer (i.e. complete eradication of cancer by therapy) as a normal cohort to facilitate somatic variant calling.Conclusion: These results suggest that genomic disturbances in BRCA-related DNA repair mechanisms, reflected by a dominant mutational signature 3, confer increased chemotherapy sensitivity. Cancers that survive neoadjuvant chemotherapy, frequently have alterations in cell cycle regulating genes but different genes are affected in different patients.
This study is part 2 of 2 for Schumacher et al, Nat Communication. 2015 Jul 7;6:7138. Part 2 is the pooled meta-analysis of 29 different studies of 37,955 European (18,299 Cases/19,656 Controls) participants using eight different genotyping platforms/variations. table, th, td { border: 1px solid black; border-collapse: collapse; } Study Name Participants Platform CFR-1 2976 Illumina 1M, 1M Duo, Omni1 CFR-2 3053 Affymetrix Axiom MECC 1 982 Illumina Omni 2.5 MECC2 1940 Affymetrix Axiom Kentucky 2172 Affymetrix Axiom ACS/CAPII 1086 Affymetrix Axiom Melbourne 1008 Affymetrix Axiom Newfoundland 672 Affymetrix Axiom ASTERKSI 1895 Illumina 300K COLO23 212 Illumina 300K DACHS1 3417 Illumina 300K DACHS2 1173 Illumina OmniExpress DALS1 1416 Illumina 550k/610K DALS2 874 Illumina 300K HPFS1 457 Illumina OmniExpress HPFS2 348 Illumina OmniExpress HPFS AA 658 Illumina OmniExpress MEC 674 Illumina 300K NHS1 1168 Illumina OmniExpress NHS2 340 Illumina OmniExpress NHSAA 1091 Illumina OmniExpress OFCCR 1172 Affymetrix 100k/500K PHS 771 Illumina OmniExpress PMH 402 Illumina 300K PLCO1 2509 Illumina 550k/610K PLCO2 901 Illumina 300K VITAL 573 Illumina 300K WHI1 1999 Illumina 550k/610K WHI2 573 Illumina 300K Since this study is a meta-analysis no corresponding phenotype, pedigree, sample, subjects, or primary molecular data are available. Each contributing study/platform will have fulfilled its own DBGAP submission commitment. This DBGAP entry is for European participants in the the pooled meta-analysis exclusively.
The Genetics of Type 2 Diabetes Consortium (GoT2D) is a collaboration between the University of Michigan, the Broad Institute and the Wellcome Trust Centre for Human Genetics. The overall aim is to extend upon recent efforts, such as genome-wide association studies (GWAS) and large scale meta-analyses. While they have proved successful at mapping genomic loci that influence human diseases, like type 2 diabetes, much of the heritability remains unexplained. In this study, we use next generation sequencing and genotyping technologies to query for lower frequency variants in the human genome. Thereby, allowing a deeper characterization of the spectrum of alleles associated with type 2 diabetes risk, and a better assessment of the genes that play a role in the etiology of type 2 diabetes development. We studied 1,326 T2D cases and 1,331 normoglycemic controls from Northern and Central Europe (Sweden, Finland, UK, and Germany). To efficiently characterize the entire genome sequence of each individual, we performed low-coverage (~5x) whole-genome sequencing, augmented by deep coverage (~100x) sequencing of the exome (Fuchsberger et al, 2016), and dense (2.5M) single nucleotide polymorphism (SNP) genotyping using the HumanOmni2.5 array. The data deposited in dbGaP will include all the Swedish, Finnish, and UK samples, but the German data will be deposited in the European Genome-phenome Archive (EGA), by virtue of the project specific funding requirements.
Excess gene dosage from human chromosome 21 (HSA21), due to trisomy or translocation of HSA21 material, causes Down syndrome (DS). We derived trisomy 21 (T21) induced pluripotent stem cells (iPSCs) alongside otherwise isogenic euploid controls from mosaic DS fibroblasts (GM00260, first described in Lubiniecki et al., PMID: 225951) of the NIGMS genetic cell repository, to enable analysis of whole-chromosomal and individual gene dosage imbalance in the context of in vitro models of DS development. Virtual karyotyping (by Infinium CytoSNP-850k), and phased genotyping (by linked-read whole-genome sequencing) of the resulting iPSC lines provide a deeply-characterized genomic platform for dissection of HSA21 gene dosage. We apply this highly unique resource to allelic analysis of mRNA sequencing of iPSCs and differentiated neural lineages, and demonstrate its utility in the context of inducible HSA21 dosage compensation. To this end, we equipped one distinct HSA21 copy with a doxycycline-inducible XIST transgene, which endogenously silences one X in 46,XX females. Standard and allele-specific RNA-seq analysis confirms mono-allelic T21 silencing to be virtually complete and irreversible, further supported by DNA methylation data (Infinium methylEPIC). This full-length HSA21 map of phased single-nucleotide and insertion/deletion variants enables allele-specific applications for gene-level dissection T21 dosage effects. Because our XIST remains inducible in post-mitotic T21 neurons and astrocytes, we perform single-nuclei RNA-seq to demonstrate XIST efficiently represses genes even after terminal differentiation.
HeLa is the most widely used model cell line for studying human cellular and molecular biology. To date, no genomic reference for this cell line has been released, and experiments have relied on the human reference genome. Effective design and interpretation of molecular genetic studies performed using HeLa cells require accurate genomic information. Here we present a detailed genomic and transcriptomic characterization of a HeLa cell line. We performed DNA and RNA sequencing of a HeLa Kyoto cell line and analyzed its mutational portfolio and gene expression profile. Segmentation of the genome according to copy number revealed a remarkably high level of aneuploidy and numerous large structural variants at unprecedented resolution. Some of the extensive genomic rearrangements are indicative of catastrophic chromosome shattering, known as chromothripsis. Our analysis of the HeLa gene expression profile revealed that several pathways, including cell cycle and DNA repair, exhibit significantly different expression patterns from those in normal human tissues. Our results provide the first detailed account of genomic variants in the HeLa genome, yielding insight into their impact on gene expression and cellular function as well as their origins. This study underscores the importance of accounting for the strikingly aberrant characteristics of HeLa cells when designing and interpreting experiments, and has implications for the use of HeLa as a model of human biology. Copyright Landry et al., "The Genomic and Transcriptomic Landscape of a HeLa Cell Line," G3: Genes | Genomes | Genetics (2013).
Cryptosporidiosis causes severe diarrhea in infants in the developing world. There is no vaccine to prevent it, and little in the way of treatment. This study on Bangladeshi urban slum children aims to support the design of a vaccine, both by determining how the immune system protects from infection and by identifying the genotypes of the parasite that should be included in a vaccine, as well as aid in development of therapies by identifying human genes that control infection. The primary objective of the study was to determine the incidence and contribution to disease of the different species and genotypes of cryptosporidia. Secondary objectives were designed to determine acquired immune response to cryptosporidiosis and identify human genes that influence susceptibility to cryptosporidiosis. This is an observational study. Children were recruited from Mirpur Dhaka slum (Cohort 1) and rural Mirzapur (Cohort 2) and followed for cryptosporidium infection longitudinally from birth through age 4 years in Cohort 1 and birth through age 2 years in Cohort 2. Biweekly household visits for diarrheal surveillance were made in addition to anthropometric measurements of mother and child, blood samples collected two times each year from the child, a work-up of diarrheal stools and non-diarrheal surveillance stools for cryptosporidium and other enteropathogens, and blood and breast milk samples from the mother. A detailed description of the study design and procedures can be obtained from publication: Kevin L Steiner, et al., 2018, PMID: 29897482.
This dataset contains three sets of samples. The first sample set contains euploid fetus pregnancies reported by NIPTIFY screening test and postnatal evaluation. Dataset was processed similarly to previously published guidelines from KU Leuven, with modifications [1]. Briefly, peripheral blood samples were collected in cell-free DNA BCT tubes (Streck, USA), and plasma was separated with standard dual centrifugation. Cell-free DNA was extracted from 3 ml plasma using MagMAX Cell-Free DNA Isolation Kit (ThermoFisher Scientific). Whole-genome libraries were prepared using the FOCUS (Fragmented DNA Compact Sequencing Assay, Competence Centre on Health Technologies, Estonia) NIPT method protocol with 12 cycles for the final PCR enrichment step. In the following quantification, equal amounts of 36 samples were pooled, and the quality and quantity of the pool were assessed on Agilent 2200 TapeStation (Agilent Technologies, USA). Whole genome sequencing was performed on the NextSeq 550 instrument (Illumina Inc.) with an average coverage of 0.32× (minimum 0.08 and maximum 0.42) and producing 85 bp single-end reads. The second sample set contains a single NIPT sample postnatally diagnosed with Prader-Willi syndrome. The sample was sequenced with Illumina NextSeq 500 platform, producing 85 bp single-end reads with an average per-sample coverage of 0.32× at the University of Tartu, Institute of Genomics Core Facility, according to the manufacturer’s standard protocols, as described previously [2]. The third sample set contains samples SC005 (SeraCare Life Sciences Inc lot #10446565), SC0042 (#10571706), and SC016 (#10560229). These are SeraCare Life Sciences Inc circulating cell-free DNA (ccfDNA) like mixture of human genomic DNA that consists of matched maternal and fetus. SC005 and SC0042 consist of matched DNA of maternal and fetus with DiGeorge Syndrome. SC016 is a custom-ordered DNA Mix with fetus DNA having a pathogenic loss of the terminal region of 20p13 and a pathogenic 3q29 duplication. SC016 was processed as the first sample set was processed, and SC0042 was processed as the second sample set was processed. Sample SC005 was processed once as was sample set 1 and once as was sample set 2 processed. This study was performed with the approval of the Research Ethics Committee of the University of Tartu (#352/M-12). 1. Bayindir B, Dehaspe L, Brison N, Brady P, Ardui S, Kammoun M, et al. Noninvasive prenatal testing using a novel analysis pipeline to screen for all autosomal fetal aneuploidies improves pregnancy management. Eur J Hum Genet. 2015;23: 1286– 1293. doi:10.1038/ejhg.2014.282 2. Žilina O, Rekker K, Kaplinski L, Sauk M, Paluoja P, Teder H, et al. Creating basis for introducing noninvasive prenatal testing in the Estonian public health setting. Prenat Diagn. 2019;39: 1262–1268. doi:10.1002/pd.5578
The data provided here was critical in establishing that human long-term hematopoietic stem cells (LT-HSC), previously described as the most primitive HSC population, is actually composed of distinct subsets that can be prospectively isolated. Via mechanistic studies centering around the Rho-GTPase effector kinase PAK4 and its inhibitor INKA1, we identified the immune checkpoint ligand CD112 as a marker for hematopoietic stem and progenitor cells, that is highest expressed on LT-HSC. More importantly, CD112 can be used to stratify functionally distinct subsets within LT-HSC: In response to regeneration-mediated stress, the CD112low subset exhibits a transient restraint (termed latency) before contributing to hematopoietic reconstitution, while the CD112high subset is primed to respond rapidly. High resolution RNA-seq of the CD112 surface expression spectrum within rare LT-HSC subsets (human umbilical cord blood) demonstrated that more genes are differentially upregulated in the deeper quiescent and less metabolic active subset. Genes enriched in this subset centre around cell adhesion and Rho-GTPase signaling. This is in agreement with the scRNAseq data from human G-CSF mobilized peripheral blood (mPB) generated here that was used as an model of in vivo activation/priming revealing via RNA-velocity and pseudo-time analysis that INKA1high versus PAK4high, CDK6high and CD112high enrichment are either detected early or late in diffusion pseudotime indicative of quiescent versus primed cell status, respectively. RNAseq following INKA1 overexpression in LT-HSC and ST-HSC revealed by GSEA an overall stemness preserving phenotype and particularly in LT-HSC, but not in short-term HSC (ST-HSC), suppression of transcriptional programs linked to activation. Collectively, our data decipher the molecular intricacies underlying HSC heterogeneity and self-renewal regulation and point to latency as an orchestrated physiological response that integrates quiescence control with HSC fate choices to preserve a stem cell reservoir.
We report genome-wide association study results for the levels of A1, A2 and fetal hemoglobins, analyzed for the first time concurrently. Integrating high-density array genotyping and whole-genome sequencing in a large general population cohort from Sardinia, we detected 23 associations at 10 loci. Five signals are due to variants at previously undetected loci: MPHOSPH9, PLTP-PCIF1, ZFPM1 (FOG1), NFIX and CCND3. Among the signals at known loci, ten are new lead variants and four are new independent signals. Half of all variants also showed pleiotropic associations with different hemoglobins, which further corroborated some of the detected associations and identified features of coordinated hemoglobin species production.
Early-stage Luminal B breast cancer is frequent and is a major cause of breast cancer death due to its poor prognosis. Our proposal aims to study the biology behind the sensitivity and resistance of Luminal B breast cancer to chemotherapy (CHT) or a non-CHT regimen composed of hormone therapy in combination with ribociclib, a CDK4/6 inhibitor. To accomplish this, we first completed the SOLTI-1402 CORALLEEN phase II trial, a study where 106 patients with early-stage Luminal B breast cancer were randomized to standard neoadjuvant CHT for 6 months, or neoadjuvant letrozole and ribociclib for 6 months. After treatment, patients underwent surgery. The primary results of the study, which showed that the response rate to letrozole+ribociclib was similar to CHT, was reported (Prat et al; Lancet Oncol). Tumor biopsies were available at baseline, week 3 and surgery. A total of 257 samples were analyzed using the Illumina TruSeq Stranded Total RNA w/Ribo Zero Gold with MiSeq in TGL (Sequencer NovaSeq S4/PE/100x)
Whole exome sequencing of 10 metastatic biopsies from four TRACERx100 patients (see EGA dataset EGAS00001002247), collected either after relapse or death. The data from these samples are initially published with Abbosh, C. et al. Phylogenetic ctDNA analysis depicts early stage lung cancer evolution. Nature, http://dx.doi.org/10.1038/nature22364 (2017). Abstract: Earlier detection of relapse following primary surgery for non-small cell lung cancer and the characterization of emerging subclones seeding metastatic sites might offer new therapeutic approaches to limit tumor recurrence. The potential to non-invasively track tumor evolutionary dynamics in ctDNA of early-stage lung cancer is not established. Here we conduct a patient-specific approach to ctDNA profiling in the first 100 lung TRACERx (TRAcking Cancer Evolution through therapy (Rx)) study participants, including one patient co-recruited to the PEACE (Posthumous Evaluation of Advanced Cancer Environment) post-mortem study. We identify independent predictors of ctDNA release in early-stage non-small cell lung cancer and perform tumor volume limit of detection analyses. Through blinded profiling of post-operative plasma, we observe evidence of adjuvant chemotherapy resistance and identify patients destined to experience recurrence of their lung cancer. Finally, we show that phylogenetic ctDNA profiling tracks the subclonal nature of lung cancer relapse and metastases, providing a new approach for ctDNA driven therapeutic studies.
The Environmental Determinants of Diabetes in the Young (TEDDY) Study investigates genetic and genetic-environmental interactions, including gestational infection or other gestational events, childhood infections and other environmental factors after birth, in relation to the development of pre-diabetic islet autoimmunity and type 1 diabetes (T1D). Beginning in 2002, a consortium of six centers assembled to participate in the development and implementation of studies to identify environmental factors that trigger the development of islet autoimmunity and T1D in genetically susceptible individuals. The TEDDY study screened around 400,000 newborns and recruited 7,749 neonates from the general population with a pre-determined T1D risk of 3% and 919 neonates with first degree relatives who have T1D and who have a pre-determined T1D risk of 10%. Thus, TEDDY proposes to follow over 8,000 participants across six clinical centers worldwide (Finland, Germany, Sweden and three in the United States) until the age of 15. Participants are followed every three months for islet autoantibody measurements with blood sampling until four years of age and then at least every six months until the age of 15. After the age of four, autoantibody positive subjects continue to be followed at 3 month intervals and autoantibody negative subjects are followed at six month intervals. In addition to the analysis of autoantibodies, additional data and sample collection is performed at each visit. The parents collect monthly stool samples in early childhood. The parents also fill out questionnaires at regular intervals in connection with study visits and record information about diet and health status in the child's TEDDY Book between visits. Continued long-term follow-up of the currently active TEDDY participants will provide important scientific information on early childhood diet, reported and measured infections, vaccinations, and psychosocial stressors that may contribute to the development of type 1 diabetes and islet autoimmunity. DNA extracted from the 9-month TEDDY blood sample was used for SNP genotyping. Samples were available for 7,082 participants. SNPs were genotyped by the Center for Public Health Genomics at the University of Virginia, using the Illumina ImmunoChip SNP microarray of around 196,000 SNPs selected from 186 regions associated with 12 autoimmune diseases (including T1D) (Hadley et al., 2015). "Data quality control (QC) steps included eliminating subjects with a low call rate (> 5% SNPs missing) and discrepancies between reported gender and prior genotyping at the HLA laboratory. Secondly, SNPs were removed from analysis due to low call rate, Hardy-Weinberg equilibrium P value < 10-6, except for chromosome 6 due to HLA eligibility requirements)" (Törn et al., 2015). QC resulted in a total of 7,023 subjects with data on 176,586 SNPs. Additional information on the TEDDY study are available in the following articles: The Environmental Determinants of Diabetes in the Young (TEDDY) Study. TEDDY Study Group. Annals of the New York Academy of Science, 2008 and TEDDY - The Environmental Determinants of Diabetes in the Young - An Observational Clinical Trial. Annals of the New York Academy of Science, 2006 Details of the TEDDY protocol can be found in The Environmental Determinants of Diabetes in the Young (TEDDY): Genetic Criteria and International Diabetes Risk Screening of 421,000 infants. Pediatric Diabetes, 2011
This study includes samples from two projects: Collaborative Genetic Study of Nicotine Dependence (COGEND; PI: Laura Bierut) and Genetic Study of Nicotine Dependence in African Americans (AAND; PI: Laura Bierut and Eric Johnson). The majority of the COGEND subjects included in the current study overlap with the two datasets already available on dbGaP. GWAS data are available for COGEND subjects through the Study of Addiction: Genetics and Environment (SAGE), dbGaP study accession phs000092. It should be noted that the case definition in the SAGE study is DSM-IV alcohol dependence. GWAS data are available for additional COGEND subjects through The Genetic Architecture of Smoking and Smoking Cessation, dbGaP study accession phs000404. The overall goal of this project is to apply deep sequencing to key genomic regions associated with nicotine dependence in order to accelerate the discovery of variation in molecular pathways that govern the development of nicotine dependence. The sample includes unrelated cases and controls of European American and African American descent. Cases are defined by a commonly used definition of nicotine dependence, a current score of 4 or more (maximum score of 10) on the Fagerstrom Test for Nicotine Dependence (FTND). Control status is defined as an individual who smoked at least 100 cigarettes during their lifetime, yet never became dependent (lifetime FTND<2). By selecting controls who smoked cigarettes, we focus on those genetic effects that are specific to the development of nicotine dependence. COGEND: COGEND was initiated in 2001 as a three-part program project grant funded through the National Cancer Institute (NCI; PI: Laura Bierut). The three projects included a study of the familial transmission of nicotine dependence, a genetic study of nicotine dependence, and a study of the relationship of nicotine dependence with nicotine metabolism. The primary goal was to detect, localize, and characterize genes that predispose or protect an individual with respect to heavy tobacco consumption, nicotine dependence, and related phenotypes and to integrate these findings with the family transmission and nicotine metabolism findings. The primary design was a community based case-control study. Nicotine dependent cases and non-dependent, smoking controls were identified and recruited from Detroit and St. Louis. More than 54,000 subjects aged 25-44 years were screened by telephone; more than 3,100 subjects were personally interviewed; and more than 2,900 subjects donated blood samples for genetic studies. AAND: AAND was initiated in 2009 to identify and characterize genetic determinants of nicotine dependence in a large African American population. Community-based recruitment of approximately 100,000 people was conducted to ascertain 1,000 African American nicotine dependent cases and 1,000 African American non-dependent, smoking controls. All subjects were between the ages of 25-44. Subjects were screened by telephone; if they qualified as a case or control, they completed the same interview that was used in COGEND and donated a blood sample. Both studies (COGEND and AAND) included measures of basic socio-demographic variables, including age, sex, race/ethnicity, family income, and educational attainment using the Semi-Structured Assessment for the Genetics of Nicotine Dependence. Information on nicotine dependence, as assessed by the Fagerstrom Test for Nicotine Dependence (FTND) is available for all subjects. In addition, participants also completed the Nicotine Dependence Syndrome Scale (NDSS; Shiffman et al., 2004) and the Wisconsin Inventory of Smoking Dependence Motives (WISDM-68; Piper et al, 2004). All subjects were assessed in person by trained research assistants.
A case-control study was designed with 80 participants diagnosed with Barrett's Esophagus (BE) selected from a larger case-cohort study (Li et al., 2014, PMID: 24253313) within the Seattle Barrett's Esophagus Program (SBEP) at the Fred Hutchinson Cancer Research Center. The study included 40 cases with BE with non-cancer outcomes, "NCO", who did not progress to esophageal adenocarcinoma (EA), and 40 controls who progressed to an endoscopically detected, incident EA (cancer outcome, "CO"). For each patient, two timepoints were evaluated (T1 and T2). NCO were matched to CO based on age at T1 (T1=first endoscopy with sufficient sample availability), and time between T1 and T2 (T2 in CO=time of diagnosis of incident EA). In 10 NCO, a third time point (T3), a mean of 13.2 years after T1, was also sequenced. Mapped endoscopic biopsies, at 1/3 and 2/3 annotated distances from the gastroesophageal junction within the Barrett's segment, were sampled per patient at each timepoint. Each biopsy was purified to separate BE epithelium from stroma, and DNA was extracted from purified epithelium for 60X WGS and 2.5M SNP array; normal control blood (N=62) or normal gastric sample (N=18) were analyzed by 30X WGS and 2.5M SNP array. An additional seven normal gastric biopsy samples in seven patients with normal control blood were also sequenced at 60X as controls. All research participants contributing clinical data and esophageal samples to this study provided written informed consent, subject to oversight by the Fred Hutchinson Cancer Research Center IRB Committee D (Reg ID 5619).
Methotrexate plasma concentration is related to its clinical effects. To identify the genetic basis of interindividual variability in methotrexate pharmacokinetics in children with newly diagnosed acute lymphoblastic leukemia (ALL), we performed a genome-wide analysis (GWAS) of 500,568 germline single-nucleotide polymorphisms (SNPs) in 434 children with ALL who received 3,014 courses of methotrexate at 2 to 5 g/m2. SNPs were validated in an independent cohort of 206 patients. Adjusting for age, race, sex, and methotrexate regimen, the most significant associations were with SNPs in the organic anion transporter polypeptide, SLCO1B1 (rs11045879 (P = 1.7 x 10-10) and rs4149081 (P = 1.7 x 10-9) (Trevino et al, PMID: 19901119). To test whether rare variants in SLCO1B1 could alter its function, we genotyped SLCO1B1 exons in a slightly larger group of 699 children with ALL who received methotrexate and identified 93 single nucleotide polymorphisms (SNPs). We found several common and rare non-synonymous (NS) SNPs associated with methotrexate clearance. NS SNPs predicted to be functionally damaging (common or rare) were more likely to be found among patients with the lowest adjusted methotrexate clearance (lowest 10%) than patients with high clearance (highest 10%). Four SLCO1B1 haplotypes were associated with reduced methotrexate clearance and we verified that these haplotypes have lower function with in vitro transport assays. We were able to quantitatively account for a third of the population variability in clearance of methotrexate with clinical and genetic covariates. This data set includes the dependent variable of methotrexate clearance and all of the SNP data available from arrays, sequencing, and genotyping.
Age-related Macular Degeneration (AMD) is a leading cause of incurable blindness in people over the age of 65. AMD is a late-onset multi-factorial neurodegenerative disease and its pathogenesis involves interaction of genetic and environmental factors. Several chromosomal regions have been associated with AMD susceptibility through linkage analysis (Swaroop et al., 2009). More recent studies provide strong evidence that variants within the CFH gene cluster on chromosome 1 and at/near LOC387715/ARMS2 on chromosome 10 are strongly associated with disease. Variants at other genes including C2/BF, C3, CFI and APOE4, also contribute to AMD susceptibility. Our primary goals are to identify genetic variants and haplotypes that are associated with AMD. The underlying hypothesis is that DNA variation(s) in multiple genetic susceptibility loci will predispose individuals to AMD pathogenesis, and comparison of DNA of cases and controls should identify these susceptibility variants. Our studies are focused on the genetic analysis of advanced AMD and should provide novel insights into disease diagnosis, progression and pathology. We have assembled a collaborative group of researchers from the University of Michigan, Mayo Clinic, University of Pennsylvania, and the AREDS group including National Eye Institute intramural investigators, who collected clinical data and DNA from a large number of patients affected with AMD and from unaffected controls. The primary source of funding was National Eye Institute. Through this collaborative effort, we submitted and obtained usable genotyping data on 2184 patients and 1155 controls from the Center for Inherited Disease Research (CIDR).
The heart, which is the first organ to develop, is highly dependent on its form to function. However, how diverse cardiac cell types spatially coordinate to create complex morphological structures critical for heart function remains to be elucidated. Here, we show that integration of single cell RNA-sequencing with high-resolution multiplexed error-robust fluorescent in situ hybridization (MERFISH) not only resolves the identity of cardiac cell types developing the human heart but also provides a spatial mapping of individual cells that enables illumination of their organization into cellular communities forming distinct cardiac structures. We discovered that many of these cardiac cell types further specified into subpopulations exclusive to specific communities, supporting their specialization according to cellular ecosystem and anatomic region. In particular, ventricular cardiomyocyte subpopulations displayed an unexpected complex laminar organization across the ventricular wall and formed, with other cell subpopulations, several cellular communities. Interrogating cell-cell interactions within these communities revealed signaling pathways orchestrating the spatial organization of cardiac cell subpopulations during ventricular wall morphogenesis. In vivo conditional genetic mouse models and in vitro human pluripotent stem cell studies confirmed an intricate multicellular PLXN-SEMA crosstalk among specific ventricular cardiomyocyte, fibroblast and endothelial cell subpopulations that directs the compaction of the ventricular wall layers. Thus, these detailed findings into the cellular social interactions and specialization of cardiac cell types constructing and remodeling the human heart offer new insights into structural heart diseases as well as engineering complex multi-cellular tissues for human heart repair. Reprinted from Farah et al. (In press), with permission from Nature.
Previous work has shown that melanoma brain metastases (MBM) have a unique molecular profile compared to extracranial metastases (ECM). Description of the biology of MBM and associations between biological features and clinical outcomes will facilitate the design of rational therapies for patients with MBM. To better characterize the molecular profile of MBM, we conducted whole exome sequencing (WES) and total RNA sequencing on MBM tissues obtained from an independent cohort of 14 patients from Duke University Medical Center. This cohort adds valuable patient samples to the field and enables strengthened investigation of somatic mutations and gene expression profiles within and between cohorts. The mutational landscape and gene expression of MBM from the Duke cohort resembled those previously reported in a previously published dataset of WES and RNA sequencing in MBM and matched ECM tissues from UT MD Anderson Cancer Center (Fischer et al., 2019 30787016). Duke MBM demonstrated similar rates of mutations in genes associated with melanoma, including BRAF, NRAS, and PTEN, as the MD Anderson cohort. Immune populations in MBM were similar between Duke and MD Anderson MBM, though Duke samples demonstrated more resting CD4+ memory T cells. The ratio of M2 to M1 macrophages in the Duke cohort was similar to that of the MD Anderson cohort. In MBM, a higher M2:M1 ratio may contribute to an immune-suppressive tumor microenvironment. Sequencing data and sample attributes from MBM tissues obtained from 14 patients from Duke University Medical Center will be available through dbGaP.
The purpose of this dataset is to facilitate development of technical implementations for rare disease data integration, analysis, discovery, and federated access. This synthetic dataset includes clinical and genomic data from 6 rare disease cases. It consists of 18 whole genomes (6 index cases with their parents) which have genetic background based on public human data sequenced in the context of the Illumina Platinum initiative (Eberle, MA et al. (2017)) and made available by the HapMap project (https://www.genome.gov/10001688/international-hapmap-project). In each of the cases, real causative variants correlating with the phenotypic data provided were spiked-in. The cases included in this synthetic dataset correspond to the following type of disorders: CASE 1- Congenital myasthenic syndrome (Autosomal Dominant -de novo variant) CASE 2- Macular dystrophy (Autosomal Dominant) CASE 3- Muscular dystrophy (Autosomal Recessive-compound heterozygous variants) CASE 4- Mitochondrial disorder (Autosomal Recessive-consanguineous case - homozygous variant) CASE 5- Breast cancer (Autosomal Dominant) CASE 6- Similar as case 1 for patient matchmaking tests: Congenital myasthenic syndrome (Autosomal Dominant-de novo variant) For each case you will be able to download the following data: clinical information (phenopackets per individual and pedigree per family), raw genomic data (FASTQ and BAMs) and processed genomic data (vcfs). When using the data, the following should be acknowledged: the RD-Connect GPAP (https://platform.rd-connect.eu/), EC H2020 project EJP-RD (grant # 825575), EC H2020 project B1MG (grant # 951724) and Generalitat de Catalunya VEIS project (grant # 001-P-001647).
Single cell RNA sequencing of immune and non-immune cells from healthy ageing thymus Thymus is a primary lymphoid organ that creates an environment for the T lymphocyte precursors differentiation into the naive T cells, which involves TCR gene rearrangement, negative and positive selection. Despite thymus critical function for the recognition of the pathogens it starts to atrophy very early in life. Thymic Epithelial Space (TES), where thymocyte education occurs, starts to decline rapidly just after birth, while with the onset of puberty thymus is also progressively replaced with adipose tissue. This leads to a reduced naive T cell output,which decreases an organism's ability to recognize pathogens. In our lab, we have already collected and characterized cell type composition and gene regulatory networks of the fetal tand adult human thymus (Park et al, Science 2020). In the current project we plan to expand the number of paediatric and adult samples in the study with the aim to understand how the process of thymic involution happens. We will use single-cell transcriptomics to zoom on changes that occur in immune and non-immune cell types in different phases of thymic involution with the hope to understand which gene expression changes are causal for the thymic involution process and how naïve T cell production can be increased in elderly. This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
Original description of the study: From ELLIPSE (linked to the PRACTICAL consortium), we contributed ~78,000 SNPs to the OncoArray. A large fraction of the content was derived from the GWAS meta-analyses in European ancestry populations (overall and aggressive disease; ~27K SNPs). We also selected just over 10,000 SNPs from the meta-analyses in the non-European populations, with a majority of these SNPs coming from the analysis of overall prostate cancer in African ancestry populations as well as from the multiethnic meta-analysis. A substantial fraction of SNPs (~28,000) were also selected for fine-mapping of 53 loci not included in the common fine-mapping regions (tagging at r2>0.9 across ±500kb regions). We also selected a few thousand SNPs related with PSA levels and/or disease survival as well as SNPs from candidate lists provided by study collaborators, as well as from meta-analyses of exome SNP chip data from the Multiethnic Cohort and UK studies. The Contributing Studies: Aarhus: Hospital-based, Retrospective, Observational. Source of cases: Patients treated for prostate adenocarcinoma at Department of Urology, Aarhus University Hospital, Skejby (Aarhus, Denmark). Source of controls: Age-matched males treated for myocardial infarction or undergoing coronary angioplasty, but with no prostate cancer diagnosis based on information retrieved from the Danish Cancer Register and the Danish Cause of Death Register. AHS: Nested case-control study within prospective cohort. Source of cases: linkage to cancer registries in study states. Source of controls: matched controls from cohort ATBC: Prospective, nested case-control. Source of cases: Finnish male smokers aged 50-69 years at baseline. Source of controls: Finnish male smokers aged 50-69 years at baseline BioVu: Cases identified in a biobank linked to electronic health records. Source of cases: A total of 214 cases were identified in the VUMC de-identified electronic health records database (the Synthetic Derivative) and shipped to USC for genotyping in April 2014. The following criteria were used to identify cases: Age 18 or greater; male; African Americans (Black) only. Note that African ancestry is not self-identified, it is administratively or third-party assigned (which has been shown to be highly correlated with genetic ancestry for African Americans in BioVU; see references). Source of controls: Controls were identified in the de-identified electronic health record. Unfortunately, they were not age matched to the cases, and therefore cannot be used for this study. Canary PASS: Prospective, Multi-site, Observational Active Surveillance Study. Source of cases: clinic based from Beth Israel Deaconness Medical Center, Eastern Virginia Medical School, University of California at San Francisco, University of Texas Health Sciences Center San Antonio, University of Washington, VA Puget Sound. Source of controls: N/A CCI: Case series, Hospital-based. Source of cases: Cases identified through clinics at the Cross Cancer Institute. Source of controls: N/A CerePP French Prostate Cancer Case-Control Study (ProGene): Case-Control, Prospective, Observational, Hospital-based. Source of cases: Patients, treated in French departments of Urology, who had histologically confirmed prostate cancer. Source of controls: Controls were recruited as participating in a systematic health screening program and found unaffected (normal digital rectal examination and total PSA < 4 ng/ml, or negative biopsy if PSA > 4 ng/ml). COH: hospital-based cases and controls from outside. Source of cases: Consented prostate cancer cases at City of Hope. Source of controls: Consented unaffected males that were part of other studies where they consented to have their DNA used for other research studies. COSM: Population-based cohort. Source of cases: General population. Source of controls: General population CPCS1: Case-control - Denmark. Source of cases: Hospital referrals. Source of controls: Copenhagen General Population Study CPCS2: Source of cases: Hospital referrals. Source of controls: Copenhagen General Population Study CPDR: Retrospective cohort. Source of cases: Walter Reed National Military Medical Center. Source of controls: Walter Reed National Military Medical Center ACS_CPS-II: Nested case-control derived from a prospective cohort study. Source of cases: Identified through self-report on follow-up questionnaires and verified through medical records or cancer registries, identified through cancer registries or the National Death Index (with prostate cancer as the primary cause of death). Source of controls: Cohort participants who were cancer-free at the time of diagnosis of the matched case, also matched on age (±6 mo) and date of biospecimen donation (±6 mo). EPIC: Case-control - Germany, Greece, Italy, Netherlands, Spain, Sweden, UK. Source of cases: Identified through record linkage with population-based cancer registries in Italy, the Netherlands, Spain, Sweden and UK. In Germany and Greece, follow-up is active and achieved through checks of insurance records and cancer and pathology registries as well as via self-reported questionnaires; self-reported incident cancers are verified through medical records. Source of controls: Cohort participants without a diagnosis of cancer EPICAP: Case-control, Population-based, ages less than 75 years at diagnosis, Hérault, France. Source of cases: Prostate cancer cases in all public hospitals and private urology clinics of département of Hérault in France. Cases validation by the Hérault Cancer Registry. Source of controls: Population-based controls, frequency age matched (5-year groups). Quotas by socio-economic status (SES) in order to obtain a distribution by SES among controls identical to the SES distribution among general population men, conditionally to age. ERSPC: Population-based randomized trial. Source of cases: Men with PrCa from screening arm ERSPC Rotterdam. Source of controls: Men without PrCa from screening arm ERSPC Rotterdam ESTHER: Case-control, Prospective, Observational, Population-based. Source of cases: Prostate cancer cases in all hospitals in the state of Saarland, from 2001-2003. Source of controls: Random sample of participants from routine health check-up in Saarland, in 2000-2002 FHCRC: Population-based, case-control, ages 35-74 years at diagnosis, King County, WA, USA. Source of cases: Identified through the Seattle-Puget Sound SEER cancer registry. Source of controls: Randomly selected, age-frequency matched residents from the same county as cases Gene-PARE: Hospital-based. Source of cases: Patients that received radiotherapy for treatment of prostate cancer. Source of controls: n/a Hamburg-Zagreb: Hospital-based, Prospective. Source of cases: Prostate cancer cases seen at the Department of Oncology, University Hospital Center Zagreb, Croatia. Source of controls: Population-based (Croatia), healthy men, older than 50, with no medical record of cancer, and no family history of cancer (1st & 2nd degree relatives) HPFS: Nested case-control. Source of cases: Participants of the HPFS cohort. Source of controls: Participants of the HPFS cohort IMPACT: Observational. Source of cases: Carriers and non-carriers (with a known mutation in the family) of the BRCA1 and BRCA2 genes, aged between 40 and 69, who are undergoing prostate screening with annual PSA testing. This cohort has been diagnosed with prostate cancer during the study. Source of controls: Carriers and non-carriers (with a known mutation in the family) of the BRCA1 and BRCA2 genes, aged between 40 and 69, who are undergoing prostate screening with annual PSA testing. This cohort has not been diagnosed with prostate cancer during the study. IPO-Porto: Hospital-based. Source of cases: Early onset and/or familial prostate cancer. Source of controls: Blood donors Karuprostate: Case-control, Retrospective, Population-based. Source of cases: From FWI (Guadeloupe): 237 consecutive incident patients with histologically confirmed prostate cancer attending public and private urology clinics; From Democratic Republic of Congo: 148 consecutive incident patients with histologically confirmed prostate cancer attending the University Clinic of Kinshasa. Source of controls: From FWI (Guadeloupe): 277 controls recruited from men participating in a free systematic health screening program open to the general population; From Democratic Republic of Congo: 134 controls recruited from subjects attending the University Clinic of Kinshasa KULEUVEN: Hospital-based, Prospective, Observational. Source of cases: Prostate cancer cases recruited at the University Hospital Leuven. Source of controls: Healthy males with no history of prostate cancer recruited at the University Hospitals, Leuven. LAAPC: Subjects were participants in a population-based case-control study of aggressive prostate cancer conducted in Los Angeles County. Cases were identified through the Los Angeles County Cancer Surveillance Program rapid case ascertainment system. Eligible cases included African American, Hispanic, and non-Hispanic White men diagnosed with a first primary prostate cancer between January 1, 1999 and December 31, 2003. Eligible cases also had (a) prostatectomy with documented tumor extension outside the prostate, (b) metastatic prostate cancer in sites other than prostate, (c) needle biopsy of the prostate with Gleason grade ≥8, or (d) needle biopsy with Gleason grade 7 and tumor in more than two thirds of the biopsy cores. Eligible controls were men never diagnosed with prostate cancer, living in the same neighborhood as a case, and were frequency matched to cases on age (± 5 y) and race/ethnicity. Controls were identified by a neighborhood walk algorithm, which proceeds through an obligatory sequence of adjacent houses or residential units beginning at a specific residence that has a specific geographic relationship to the residence where the case lived at diagnosis. Malaysia: Case-control. Source of cases: Patients attended the outpatient urology or uro-onco clinic at University Malaya Medical Center. Source of controls: Population-based, age matched (5-year groups), ascertained through electoral register, Subang Jaya, Selangor, Malaysia MCC-Spain: Case-control. Source of cases: Identified through the urology departments of the participating hospitals. Source of controls: Population-based, frequency age and region matched, ascertained through the rosters of the primary health care centers MCCS: Nested case-control, Melbourne, Victoria. Source of cases: Identified by linkage to the Victorian Cancer Registry. Source of controls: Cohort participants without a diagnosis of cancer MD Anderson: Participants in this study were identified from epidemiological prostate cancer studies conducted at the University of Texas MD Anderson Cancer Center in the Houston Metropolitan area. Cases were accrued in the Houston Medical Center and were not restricted with respect to Gleason score, stage or PSA. Controls were identified via random-digit-dialing or among hospital visitors and they were frequency matched to cases on age and race. Lifestyle, demographic, and family history data were collected using a standardized questionnaire. MDACC_AS: A prospective cohort study. Source of cases: Men with clinically organ-confined prostate cancer meeting eligibility criteria for a prospective cohort study of active surveillance at MD Anderson Cancer Center. Source of controls: N/A MEC: The Multiethnic Cohort (MEC) is comprised of over 215,000 men and women recruited from Hawaii and the Los Angeles area between 1993 and 1996. Between 1995 and 2006, over 65,000 blood samples were collected from participants for genetic analyses. To identify incident cancer cases, the MEC was cross-linked with the population-based Surveillance, Epidemiology and End Results (SEER) registries in California and Hawaii, and unaffected cohort participants with blood samples were selected as controls MIAMI (WFPCS): Prostate cancer cases and controls were recruited from the Departments of Urology and Internal Medicine of the Wake Forest University School of Medicine using sequential patient populations as described previously (PMID:15342424). All study subjects received a detailed description of the study protocol and signed their informed consent, as approved by the medical center's Institutional Review Board. The general eligibility criteria were (i) able to comprehend informed consent and (ii) without previously diagnosed cancer. The exclusion criteria were (i) clinical diagnosis of autoimmune diseases; (ii) chronic inflammatory conditions; and (iii) infections within the past 6 weeks. Blood samples were collected from all subjects. MOFFITT: Hospital-based. Source of cases: clinic based from Moffitt Cancer Center. Source of controls: Moffitt Cancer Center affiliated Lifetime cancer screening center NMHS: Case-control, clinic based, Nashville TN. Source of cases: All urology clinics in Nashville, TN. Source of controls: Men without prostate cancer at prostate biopsy. PCaP: The North Carolina-Louisiana Prostate Cancer Project (PCaP) is a multidisciplinary population-based case-only study designed to address racial differences in prostate cancer through a comprehensive evaluation of social, individual and tumor level influences on prostate cancer aggressiveness. PCaP enrolled approximately equal numbers of African Americans and Caucasian Americans with newly-diagnosed prostate cancer from North Carolina (42 counties) and Louisiana (30 parishes) identified through state tumor registries. African American PCaP subjects with DNA, who agreed to future use of specimens for research, participated in OncoArray analysis. PCMUS: Case-control - Sofia, Bulgaria. Source of cases: Patients of Clinic of Urology, Alexandrovska University Hospital, Sofia, Bulgaria, PrCa histopathologically confirmed. Source of controls: 72 patients with verified BPH and PSA<3,5; 78 healthy controls from the MMC Biobank, no history of PrCa PHS: Nested case-control. Source of cases: Participants of the PHS1 trial/cohort. Source of controls: Participants of the PHS1 trial/cohort PLCO: Nested case-control. Source of cases: Men with a confirmed diagnosis of prostate cancer from the PLCO Cancer Screening Trial. Source of controls: Controls were men enrolled in the PLCO Cancer Screening Trial without a diagnosis of cancer at the time of case ascertainment. Poland: Case-control. Source of cases: men with unselected prostate cancer, diagnosed in north-western Poland at the University Hospital in Szczecin. Source of controls: cancer-free men from the same population, taken from the healthy adult patients of family doctors in the Szczecin region PROCAP: Population-based, Retrospective, Observational. Source of cases: Cases were ascertained from the National Prostate Cancer Register of Sweden Follow-Up Study, a retrospective nationwide cohort study of patients with localized prostate cancer. Source of controls: Controls were selected among men referred for PSA testing in laboratories in Stockholm County, Sweden, between 2010 and 2012. PROGReSS: Hospital-based, Prospective, Observational. Source of cases: Prostate cancer cases from the Hospital Clínico Universitario de Santiago de Compostela, Galicia, Spain. Source of controls: Cancer-free men from the same population ProMPT: A study to collect samples and data from subjects with and without prostate cancer. Retrospective, Experimental. Source of cases: Subjects attending outpatient clinics in hospitals. Source of controls: Subjects attending outpatient clinics in hospitals ProtecT: Trial of treatment. Samples taken from subjects invited for PSA testing from the community at nine centers across United Kingdom. Source of cases: Subjects who have a proven diagnosis of prostate cancer following testing. Source of controls: Identified through invitation of subjects in the community. PROtEuS: Case-control, population-based. Source of cases: All new histologically-confirmed cases, aged less or equal to 75 years, diagnosed between 2005 and 2009, actively ascertained across Montreal French hospitals. Source of controls: Randomly selected from the Provincial electoral list of French-speaking men between 2005 and 2009, from the same area of residence as cases and frequency-matched on age. QLD: Case-control. Source of cases: A longitudinal cohort study (Prostate Cancer Supportive Care and Patient Outcomes Project: ProsCan) conducted in Queensland, through which men newly diagnosed with prostate cancer from 26 private practices and 10 public hospitals were directly referred to ProsCan at the time of diagnosis by their treating clinician (age range 43-88 years). All cases had histopathologically confirmed prostate cancer, following presentation with an abnormal serum PSA and/or lower urinary tract symptoms. Source of controls: Controls comprised healthy male blood donors with no personal history of prostate cancer, recruited through (i) the Australian Red Cross Blood Services in Brisbane (age range 19-76 years) and (ii) the Australian Electoral Commission (AEC) (age and post-code/ area matched to ProsCan, age range 54-90 years). RAPPER: Multi-centre, hospital based blood sample collection study in patients enrolled in clinical trials with prospective collection of radiotherapy toxicity data. Source of cases: Prostate cancer patients enrolled in radiotherapy trials: CHHiP, RT01, Dose Escalation, RADICALS, Pelvic IMRT, PIVOTAL. Source of controls: N/A SABOR: Prostate Cancer Screening Cohort. Source of cases: Men >45 yrs of age participating in annual PSA screening. Source of controls: Males participating in annual PSA prostate cancer risk evaluations (funded by NCI biomarkers discovery and validation grant), recruited through University of Texas Health Science Center at San Antonio and affiliated sites or through study advertisements, enrolment open to the community SCCS: Case-control in cohort, Southeastern USA. Prospective, Observational, Population-based. Source of cases: SCCS entry population. Source of controls: SCCS entry population SCPCS: Population-based, Retrospective, Observational. Source of cases: South Carolina Central Cancer Registry. Source of controls: Health Care Financing Administration beneficiary file SEARCH: Case-control - East Anglia, UK. Source of cases: Men < 70 years of age registered with prostate cancer at the population-based cancer registry, Eastern Cancer Registration and Information Centre, East Anglia, UK. Source of controls: Men attending general practice in East Anglia with no known prostate cancer diagnosis, frequency matched to cases by age and geographic region SNP_Prostate_Ghent: Hospital-based, Retrospective, Observational. Source of cases: Men treated with IMRT as primary or postoperative treatment for prostate cancer at the Ghent University Hospital between 2000 and 2010. Source of controls: Employees of the University hospital and members of social activity clubs, without a history of any cancer. SPAG: Hospital-based, Retrospective, Observational. Source of cases: Guernsey. Source of controls: Guernsey STHM2: Population-based, Retrospective, Observational. Source of cases: Cases were selected among men referred for PSA testing in laboratories in Stockholm County, Sweden, between 2010 and 2012. Source of controls: Controls were selected among men referred for PSA testing in laboratories in Stockholm County, Sweden, between 2010 and 2012. PCPT: Case-control from a randomized clinical trial. Source of cases: Randomized clinical trial. Source of controls: Randomized clinical trial SELECT: Case-cohort from a randomized clinical trial. Source of cases: Randomized clinical trial. Source of controls: Randomized clinical trial TAMPERE: Case-control - Finland, Retrospective, Observational, Population-based. Source of cases: Identified through linkage to the Finnish Cancer Registry and patient records; and the Finnish arm of the ERSPC study. Source of controls: Cohort participants without a diagnosis of cancer UGANDA: Uganda Prostate Cancer Study: Uganda is a case-control study of prostate cancer in Kampala Uganda that was initiated in 2011. Men with prostate cancer were enrolled from the Urology unit at Mulago Hospital and men without prostate cancer (i.e. controls) were enrolled from other clinics (i.e. surgery) at the hospital. UKGPCS: ICR, UK. Source of cases: Cases identified through clinics at the Royal Marsden hospital and nationwide NCRN hospitals. Source of controls: Ken Muir's control- 2000 ULM: Case-control - Germany. Source of cases: familial cases (n=162): identified through questionnaires for family history by collaborating urologists all over Germany; sporadic cases (n=308): prostatectomy series performed in the Clinic of Urology Ulm between 2012 and 2014. Source of controls: age-matched controls (n=188): age-matched men without prostate cancer and negative family history collected in hospitals of Ulm WUGS/WUPCS: Cases Series, USA. Source of cases: Identified through clinics at Washington University in St. Louis. Source of controls: Men diagnosed and managed with prostate cancer in University based clinic. Acknowledgement Statements: Aarhus: This study was supported by the Danish Strategic Research Council (now Innovation Fund Denmark) and the Danish Cancer Society. The Danish Cancer Biobank (DCB) is acknowledged for biological material. AHS: This work was supported by the Intramural Research Program of the NIH, National Cancer Institute, Division of Cancer Epidemiology and Genetics (Z01CP010119). ATBC: This research was supported in part by the Intramural Research Program of the NIH and the National Cancer Institute. Additionally, this research was supported by U.S. Public Health Service contracts N01-CN-45165, N01-RC-45035, N01-RC-37004, HHSN261201000006C, and HHSN261201500005C from the National Cancer Institute, Department of Health and Human Services. BioVu: The dataset(s) used for the analyses described were obtained from Vanderbilt University Medical Center's BioVU which is supported by institutional funding and by the National Center for Research Resources, Grant UL1 RR024975-01 (which is now at the National Center for Advancing Translational Sciences, Grant 2 UL1 TR000445-06). Canary PASS: PASS was supported by Canary Foundation and the National Cancer Institute's Early Detection Research Network (U01 CA086402) CCI: This work was awarded by Prostate Cancer Canada and is proudly funded by the Movember Foundation - Grant # D2013-36.The CCI group would like to thank David Murray, Razmik Mirzayans, and April Scott for their contribution to this work. CerePP French Prostate Cancer Case-Control Study (ProGene): None reported COH: SLN is partially supported by the Morris and Horowitz Families Endowed Professorship COSM: The Swedish Research Council, the Swedish Cancer Foundation CPCS1 & CPCS2: Department of Clinical Biochemistry, Herlev and Gentofte Hospital, Copenhagen University Hospital, Herlev Ringvej 75, DK-2730 Herlev, DenmarkCPCS1 would like to thank the participants and staff of the Copenhagen General Population Study for their important contributions. CPDR: Uniformed Services University for the Health Sciences HU0001-10-2-0002 (PI: David G. McLeod, MD) CPS-II: The American Cancer Society funds the creation, maintenance, and updating of the Cancer Prevention Study II cohort. CPS-II thanks the participants and Study Management Group for their invaluable contributions to this research. We would also like to acknowledge the contribution to this study from central cancer registries supported through the Centers for Disease Control and Prevention National Program of Cancer Registries, and cancer registries supported by the National Cancer Institute Surveillance Epidemiology and End Results program. EPIC: The coordination of EPIC is financially supported by the European Commission (DG-SANCO) and the International Agency for Research on Cancer. The national cohorts are supported by the Danish Cancer Society (Denmark); the Deutsche Krebshilfe, Deutsches Krebsforschungszentrum and Federal Ministry of Education and Research (Germany); the Hellenic Health Foundation, Greek Ministry of Health; Greek Ministry of Education (Greece); the Italian Association for Research on Cancer (AIRC) and National Research Council (Italy); the Dutch Ministry of Public Health, Welfare and Sports (VWS), Netherlands Cancer Registry (NKR), LK Research Funds, Dutch Prevention Funds, Dutch ZON (Zorg Onderzoek Nederland), World Cancer Research Fund (WCRF); the Statistics Netherlands (The Netherlands); the Health Research Fund (FIS), Regional Governments of Andalucía, Asturias, Basque Country, Murcia and Navarra, Spanish Ministry of Health ISCIII RETIC (RD06/0020), Red de Centros RCESP, C03/09 (Spain); the Swedish Cancer Society, Swedish Scientific Council and Regional Government of Skåne and Västerbotten, Fundacion Federico SA (Sweden); the Cancer Research UK, Medical Research Council (United Kingdom). EPICAP: The EPICAP study was supported by grants from Ligue Nationale Contre le Cancer, Ligue départementale du Val de Marne; Fondation de France; Agence Nationale de sécurité sanitaire de l'alimentation, de l'environnement et du travail (ANSES). The EPICAP study group would like to thank all urologists, Antoinette Anger and Hasina Randrianasolo (study monitors), Anne-Laure Astolfi, Coline Bernard, Oriane Noyer, Marie-Hélène De Campo, Sandrine Margaroline, Louise N'Diaye, and Sabine Perrier-Bonnet (Clinical Research nurses). ERSPC: This study was supported by the DutchCancerSociety (KWF94-869,98-1657,2002-277,2006-3518, 2010-4800), The Netherlands Organisation for Health Research and Development (ZonMW-002822820, 22000106, 50-50110-98-311, 62300035), The Dutch Cancer Research Foundation (SWOP), and an unconditional grant from Beckman-Coulter-HybritechInc. ESTHER: The ESTHER study was supported by a grant from the Baden Württemberg Ministry of Science, Research and Arts. The ESTHER group would like to thank Hartwig Ziegler, Sonja Wolf, Volker Hermann, Heiko Müller, Karina Dieffenbach, Katja Butterbach for valuable contributions to the study. FHCRC: The FHCRC studies were supported by grants R01-CA056678, R01-CA082664, and R01-CA092579 from the US National Cancer Institute, National Institutes of Health, with additional support from the Fred Hutchinson Cancer Research Center. FHCRC would like to thank all the men who participated in these studies. Gene-PARE: The Gene-PARE study was supported by grants 1R01CA134444 from the U.S. National Institutes of Health, PC074201 and W81XWH-15-1-0680 from the Prostate Cancer Research Program of the Department of Defense and RSGT-05-200-01-CCE from the American Cancer Society. Hamburg-Zagreb: None reported HPFS: The Health Professionals Follow-up Study was supported by grants UM1CA167552, CA133891, CA141298, and P01CA055075. HPFS are grateful to the participants and staff of the Physicians' Health Study and Health Professionals Follow-Up Study for their valuable contributions, as well as the following state cancer registries for their help: AL, AZ, AR, CA, CO, CT, DE, FL, GA, ID, IL, IN, IA, KY, LA, ME, MD, MA, MI, NE, NH, NJ, NY, NC, ND, OH, OK, OR, PA, RI, SC, TN, TX, VA, WA, and WY. IMPACT: The IMPACT study was funded by The Ronald and Rita McAulay Foundation, CR-UK Project grant (C5047/A1232), Cancer Australia, AICR Netherlands A10-0227, Cancer Australia and Cancer Council Tasmania, NIHR, EU Framework 6, Cancer Councils of Victoria and South Australia, and Philanthropic donation to Northshore University Health System. We acknowledge support from the National Institute for Health Research (NIHR) to the Biomedical Research Centre at The Institute of Cancer Research and Royal Marsden Foundation NHS Trust. IMPACT acknowledges the IMPACT study steering committee, collaborating centres, and participants. IPO-Porto: The IPO-Porto study was funded by Fundaçäo para a Ciência e a Tecnologia (FCT; UID/DTP/00776/2013 and PTDC/DTP-PIC/1308/2014) and by IPO-Porto Research Center (CI-IPOP-16-2012 and CI-IPOP-24-2015). MC and MPS are research fellows from Liga Portuguesa Contra o Cancro, Núcleo Regional do Norte. SM is a research fellow from FCT (SFRH/BD/71397/2010). IPO-Porto would like to express our gratitude to all patients and families who have participated in this study. Karuprostate: The Karuprostate study was supported by the the Frech National Health Directorate and by the Association pour la Recherche sur les Tumeurs de la ProstateKarusprostate thanks Séverine Ferdinand. KULEUVEN: F.C. and S.J. are holders of grants from FWO Vlaanderen (G.0684.12N and G.0830.13N), the Belgian federal government (National Cancer Plan KPC_29_023), and a Concerted Research Action of the KU Leuven (GOA/15/017). TVDB is holder of a doctoral fellowship of the FWO. LAAPC: This study was funded by grant R01CA84979 (to S.A. Ingles) from the National Cancer Institute, National Institutes of Health. Malaysia: The study was funded by the University Malaya High Impact Research Grant (HIR/MOHE/MED/35). Malaysia thanks all associates in the Urology Unit, University of Malaya, Cancer Research Initiatives Foundation (CARIF) and the Malaysian Men's Health Initiative (MMHI). MCCS: MCCS cohort recruitment was funded by VicHealth and Cancer Council Victoria. The MCCS was further supported by Australian NHMRC grants 209057, 251553, and 504711, and by infrastructure provided by Cancer Council Victoria. Cases and their vital status were ascertained through the Victorian Cancer Registry (VCR) and the Australian Institute of Health and Welfare (AIHW), including the National Death Index and the Australian Cancer Database. MCC-Spain: The study was partially funded by the Accion Transversal del Cancer, approved on the Spanish Ministry Council on the 11th October 2007, by the Instituto de Salud Carlos III-FEDER (PI08/1770, PI09/00773-Cantabria, PI11/01889-FEDER, PI12/00265, PI12/01270, and PI12/00715), by the Fundación Marqués de Valdecilla (API 10/09), by the Spanish Association Against Cancer (AECC) Scientific Foundation and by the Catalan Government DURSI grant 2009SGR1489. Samples: Biological samples were stored at the Parc de Salut MAR Biobank (MARBiobanc; Barcelona) which is supported by Instituto de Salud Carlos III FEDER (RD09/0076/00036). Also sample collection was supported by the Xarxa de Bancs de Tumors de Catalunya sponsored by Pla Director d'Oncologia de Catalunya (XBTC). MCC-Spain acknowledges the contribution from Esther Gracia-Lavedan in preparing the data. We thank all the subjects who participated in the study and all MCC-Spain collaborators. MD Anderson: Prostate Cancer Case-Control Studies at MD Anderson (MDA) supported by grants CA68578, ES007784, DAMD W81XWH-07-1-0645, and CA140388. MDACC_AS: None reported MEC: Funding provided by NIH grant U19CA148537 and grant U01CA164973. MIAMI (WFPCS): ACS MOFFITT: The Moffitt group was supported by the US National Cancer Institute (R01CA128813, PI: J.Y. Park). NMHS: Funding for the Nashville Men's Health Study (NMHS) was provided by the National Institutes of Health Grant numbers: RO1CA121060. PCaP only data: The North Carolina - Louisiana Prostate Cancer Project (PCaP) is carried out as a collaborative study supported by the Department of Defense contract DAMD 17-03-2-0052. For HCaP-NC follow-up data: The Health Care Access and Prostate Cancer Treatment in North Carolina (HCaP-NC) study is carried out as a collaborative study supported by the American Cancer Society award RSGT-08-008-01-CPHPS. For studies using both PCaP and HCaP-NC follow-up data please use: The North Carolina - Louisiana Prostate Cancer Project (PCaP) and the Health Care Access and Prostate Cancer Treatment in North Carolina (HCaP-NC) study are carried out as collaborative studies supported by the Department of Defense contract DAMD 17-03-2-0052 and the American Cancer Society award RSGT-08-008-01-CPHPS, respectively. For any PCaP data, please include: The authors thank the staff, advisory committees and research subjects participating in the PCaP study for their important contributions. For studies using PCaP DNA/genotyping data, please include: We would like to acknowledge the UNC BioSpecimen Facility and LSUHSC Pathology Lab for our DNA extractions, blood processing, storage and sample disbursement (https://genome.unc.edu/bsp). For studies using PCaP tissue, please include: We would like to acknowledge the RPCI Department of Urology Tissue Microarray and Immunoanalysis Core for our tissue processing, storage and sample disbursement. For studies using HCaP-NC follow-up data, please use: The Health Care Access and Prostate Cancer Treatment in North Carolina (HCaP-NC) study is carried out as a collaborative study supported by the American Cancer Society award RSGT-08-008-01-CPHPS. The authors thank the staff, advisory committees and research subjects participating in the HCaP-NC study for their important contributions. For studies that use both PCaP and HCaP-NC, please use: The authors thank the staff, advisory committees and research subjects participating in the PCaP and HCaP-NC studies for their important contributions. PCMUS: The PCMUS study was supported by the Bulgarian National Science Fund, Ministry of Education and Science (contract DOO-119/2009; DUNK01/2-2009; DFNI-B01/28/2012) with additional support from the Science Fund of Medical University - Sofia (contract 51/2009; 8I/2009; 28/2010). PHS: The Physicians' Health Study was supported by grants CA34944, CA40360, CA097193, HL26490, and HL34595. PHS members are grateful to the participants and staff of the Physicians' Health Study and Health Professionals Follow-Up Study for their valuable contributions, as well as the following state cancer registries for their help: AL, AZ, AR, CA, CO, CT, DE, FL, GA, ID, IL, IN, IA, KY, LA, ME, MD, MA, MI, NE, NH, NJ, NY, NC, ND, OH, OK, OR, PA, RI, SC, TN, TX, VA, WA, and WY. PLCO: This PLCO study was supported by the Intramural Research Program of the Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIHPLCO thanks Drs. Christine Berg and Philip Prorok, Division of Cancer Prevention at the National Cancer Institute, the screening center investigators and staff of the PLCO Cancer Screening Trial for their contributions to the PLCO Cancer Screening Trial. We thank Mr. Thomas Riley, Mr. Craig Williams, Mr. Matthew Moore, and Ms. Shannon Merkle at Information Management Services, Inc., for their management of the data and Ms. Barbara O'Brien and staff at Westat, Inc. for their contributions to the PLCO Cancer Screening Trial. We also thank the PLCO study participants for their contributions to making this study possible. Poland: None reported PROCAP: PROCAP was supported by the Swedish Cancer Foundation (08-708, 09-0677). PROCAP thanks and acknowledges all of the participants in the PROCAP study. We thank Carin Cavalli-Björkman and Ami Rönnberg Karlsson for their dedicated work in the collection of data. Michael Broms is acknowledged for his skilful work with the databases. KI Biobank is acknowledged for handling the samples and for DNA extraction. We acknowledge The NPCR steering group: Pär Stattin (chair), Anders Widmark, Stefan Karlsson, Magnus Törnblom, Jan Adolfsson, Anna Bill-Axelson, Ove Andrén, David Robinson, Bill Pettersson, Jonas Hugosson, Jan-Erik Damber, Ola Bratt, Göran Ahlgren, Lars Egevad, and Roy Ehrnström. PROGReSS: The PROGReSS study is founded by grants from the Spanish Ministry of Health (INT15/00070; INT16/00154; FIS PI10/00164, FIS PI13/02030; FIS PI16/00046); the Spanish Ministry of Economy and Competitiveness (PTA2014-10228-I), and Fondo Europeo de Desarrollo Regional (FEDER 2007-2013). ProMPT: Founded by CRUK, NIHR, MRC, Cambride Biomedical Research Centre ProtecT: Founded by NIHR. ProtecT and ProMPT would like to acknowledge the support of The University of Cambridge, Cancer Research UK. Cancer Research UK grants (C8197/A10123) and (C8197/A10865) supported the genotyping team. We would also like to acknowledge the support of the National Institute for Health Research which funds the Cambridge Bio-medical Research Centre, Cambridge, UK. We would also like to acknowledge the support of the National Cancer Research Prostate Cancer: Mechanisms of Progression and Treatment (PROMPT) collaborative (grant code G0500966/75466) which has funded tissue and urine collections in Cambridge. We are grateful to staff at the Welcome Trust Clinical Research Facility, Addenbrooke's Clinical Research Centre, Cambridge, UK for their help in conducting the ProtecT study. We also acknowledge the support of the NIHR Cambridge Biomedical Research Centre, the DOH HTA (ProtecT grant), and the NCRI/MRC (ProMPT grant) for help with the bio-repository. The UK Department of Health funded the ProtecT study through the NIHR Health Technology Assessment Programme (projects 96/20/06, 96/20/99). The ProtecT trial and its linked ProMPT and CAP (Comparison Arm for ProtecT) studies are supported by Department of Health, England; Cancer Research UK grant number C522/A8649, Medical Research Council of England grant number G0500966, ID 75466, and The NCRI, UK. The epidemiological data for ProtecT were generated though funding from the Southwest National Health Service Research and Development. DNA extraction in ProtecT was supported by USA Dept of Defense award W81XWH-04-1-0280, Yorkshire Cancer Research and Cancer Research UK. The authors would like to acknowledge the contribution of all members of the ProtecT study research group. The views and opinions expressed therein are those of the authors and do not necessarily reflect those of the Department of Health of England. The bio-repository from ProtecT is supported by the NCRI (ProMPT) Prostate Cancer Collaborative and the Cambridge BMRC grant from NIHR. We thank the National Institute for Health Research, Hutchison Whampoa Limited, the Human Research Tissue Bank (Addenbrooke's Hospital), and Cancer Research UK. PROtEuS: PROtEuS was supported financially through grants from the Canadian Cancer Society (13149, 19500, 19864, 19865) and the Cancer Research Society, in partnership with the Ministère de l'enseignement supérieur, de la recherche, de la science et de la technologie du Québec, and the Fonds de la recherche du Québec - Santé.PROtEuS would like to thank its collaborators and research personnel, and the urologists involved in subjects recruitment. We also wish to acknowledge the special contribution made by Ann Hsing and Anand Chokkalingam to the conception of the genetic component of PROtEuS. QLD: The QLD research is supported by The National Health and Medical Research Council (NHMRC) Australia Project Grants (390130, 1009458) and NHMRC Career Development Fellowship and Cancer Australia PdCCRS funding to J Batra. The QLD team would like to acknowledge and sincerely thank the urologists, pathologists, data managers and patient participants who have generously and altruistically supported the QLD cohort. RAPPER: RAPPER is funded by Cancer Research UK (C1094/A11728; C1094/A18504) and Experimental Cancer Medicine Centre funding (C1467/A7286). The RAPPER group thank Rebecca Elliott for project management. SABOR: The SABOR research is supported by NIH/NCI Early Detection Research Network, grant U01 CA0866402-12. Also supported by the Cancer Center Support Grant to the Cancer Therapy and Research Center from the National Cancer Institute (US) P30 CA054174. SCCS: SCCS is funded by NIH grant R01 CA092447, and SCCS sample preparation was conducted at the Epidemiology Biospecimen Core Lab that is supported in part by the Vanderbilt-Ingram Cancer Center (P30 CA68485). Data on SCCS cancer cases used in this publication were provided by the Alabama Statewide Cancer Registry; Kentucky Cancer Registry, Lexington, KY; Tennessee Department of Health, Office of Cancer Surveillance; Florida Cancer Data System; North Carolina Central Cancer Registry, North Carolina Division of Public Health; Georgia Comprehensive Cancer Registry; Louisiana Tumor Registry; Mississippi Cancer Registry; South Carolina Central Cancer Registry; Virginia Department of Health, Virginia Cancer Registry; Arkansas Department of Health, Cancer Registry, 4815 W. Markham, Little Rock, AR 72205. The Arkansas Central Cancer Registry is fully funded by a grant from National Program of Cancer Registries, Centers for Disease Control and Prevention (CDC). Data on SCCS cancer cases from Mississippi were collected by the Mississippi Cancer Registry which participates in the National Program of Cancer Registries (NPCR) of the Centers for Disease Control and Prevention (CDC). The contents of this publication are solely the responsibility of the authors and do not necessarily represent the official views of the CDC or the Mississippi Cancer Registry. SCPCS: SCPCS is funded by CDC grant S1135-19/19, and SCPCS sample preparation was conducted at the Epidemiology Biospecimen Core Lab that is supported in part by the Vanderbilt-Ingram Cancer Center (P30 CA68485). SEARCH: SEARCH is funded by a program grant from Cancer Research UK (C490/A10124) and supported by the UK National Institute for Health Research Biomedical Research Centre at the University of Cambridge. SNP_Prostate_Ghent: The study was supported by the National Cancer Plan, financed by the Federal Office of Health and Social Affairs, Belgium. SPAG: Wessex Medical ResearchHope for Guernsey, MUG, HSSD, MSG, Roger Allsopp STHM2: STHM2 was supported by grants from The Strategic Research Programme on Cancer (StratCan), Karolinska Institutet; the Linné Centre for Breast and Prostate Cancer (CRISP, number 70867901), Karolinska Institutet; The Swedish Research Council (number K2010-70X-20430-04-3) and The Swedish Cancer Society (numbers 11-0287 and 11-0624); Stiftelsen Johanna Hagstrand och Sigfrid Linnérs minne; Swedish Council for Working Life and Social Research (FAS), number 2012-0073STHM2 acknowledges the Karolinska University Laboratory, Aleris Medilab, Unilabs and the Regional Prostate Cancer Registry for performing analyses and help to retrieve data. Carin Cavalli-Björkman and Britt-Marie Hune for their enthusiastic work as research nurses. Astrid Björklund for skilful data management. We wish to thank the BBMRI.se biobank facility at Karolinska Institutet for biobank services. PCPT & SELECT are funded by Public Health Service grants U10CA37429 and 5UM1CA182883 from the National Cancer Institute. SWOG and SELECT thank the site investigators and staff and, most importantly, the participants who donated their time to this trial. TAMPERE: The Tampere (Finland) study was supported by the Academy of Finland (251074), The Finnish Cancer Organisations, Sigrid Juselius Foundation, and the Competitive Research Funding of the Tampere University Hospital (X51003). The PSA screening samples were collected by the Finnish part of ERSPC (European Study of Screening for Prostate Cancer). TAMPERE would like to thank Riina Liikanen, Liisa Maeaettaenen and Kirsi Talala for their work on samples and databases. UGANDA: None reported UKGPCS: UKGPCS would also like to thank the following for funding support: The Institute of Cancer Research and The Everyman Campaign, The Prostate Cancer Research Foundation, Prostate Research Campaign UK (now Prostate Action), The Orchid Cancer Appeal, The National Cancer Research Network UK, The National Cancer Research Institute (NCRI) UK. We are grateful for support of NIHR funding to the NIHR Biomedical Research Centre at The Institute of Cancer Research and The Royal Marsden NHS Foundation Trust. UKGPCS should also like to acknowledge the NCRN nurses, data managers, and consultants for their work in the UKGPCS study. UKGPCS would like to thank all urologists and other persons involved in the planning, coordination, and data collection of the study. ULM: The Ulm group received funds from the German Cancer Aid (Deutsche Krebshilfe). WUGS/WUPCS: WUGS would like to thank the following for funding support: The Anthony DeNovi Fund, the Donald C. McGraw Foundation, and the St. Louis Men's Group Against Cancer.
Each tumor sample was cut into three pieces, yielding two end-pieces for cryovials and a middle portion placed in 10% buffered formalin. End pieces were homogenized manually and with a paddle blender (Stomacher). All paraffin-embedded blocks, including formalin-fixed tumor samples and molecular-fixed fallopian tubes, were sectioned and stained with hematoxylin and eosin prior to expert histopathological review to confirm the presence of high grade serous carcinoma. Homogenized end pieces were then flash frozen, and RNA was extracted using the miRNeasy Mini kit. Nanodrop was used to assess quality (260/280) and quantity. Total RNA samples were also QC checked using the Caliper HT RNA HiSens assay. Samples ranging from 60-255ng RNA were re-arrayed into a 96-well plate. 5'-RACE PCR was carried out as described in "The interface of malignant and immunologic clonal dynamics in high-grade serous ovarian cancer" (Zhang et al.). Briefly, this involved first round and nested PCR with TRB (TCR beta chain) and IGH (immunoglobulin heavy chain) gene-specific primers. The indexed libraries were sequenced on the Illumina HiSeq platform with paired-end 250bp reads using v2 chemistry reagents.
This administrative supplement to the project, "The Genetic Epidemiology of Asthma in Costa Rica" (R37 HL066289) is in response to NOT-HL-14-029 to perform whole genome sequencing (WGS) on existing NHLBI populations. We focus on asthma because of its public health significance. Asthma affects 26 million U.S. children and adults, remains a major cause of morbidity (one-half million hospitalizations a year) and is the most common cause of school and work days lost. Asthma-related costs are estimated to be over $12.7 billion annually. The Asthma Probands for both the extended pedigrees and the trios utilized in this study were selected on the basis of a physician diagnosis of asthma; a history of recurrent asthma attacks or at least 2 respiratory symptoms; and either airway hyperresponsiveness to methacholine or significant response to bronchodilator (Albuterol) administration. These criteria are identical to the criteria used in the Childhood Asthma Management Program (CAMP). The three primary goals of this project are to: (1) identify common and rare genetic variants that determine asthma and its associated phenotypes (height, weight, IgE level, lung function, bronchodilator response, steroid treatment response) through whole genome sequencing (WGS); (2) perform novel family based association analysis of our WGS data to identify novel genes for asthma; and (3) integrate epigenomic and transcriptomic data with our WGS data and determine the epistatic interactions present using systems genomics approaches. Identification of the molecular determinants of asthma remains an important priority in translational science. Genome-wide association studies (GWAS) have been successful in this regard, identifying at least 10 novel susceptibility genes for asthma. However, as with most complex traits, the variants identified by GWAS explain only a fraction of the estimated heritability of this disorder. Herein, we propose a novel family-based study design and state-of-the-art genome sequencing techniques to map a set of sequence variants for asthma and its associated phenotypes and assess the interrelationships of the identified genes and variants using systems genomics methods. We have assembled a team of investigators highly-skilled and expert in whole genome sequencing (Drs. Michael Cho and Benjamin Raby), genetic association analysis (Drs. Scott T. Weiss, Jessica Lasky-Su and Christoph Lange), integrative genomics (Drs. Raby, Kelan Tantisira, Augusto Litonjua and Dawn DeMeo), and systems genomics (Drs. Weiss, Amitabh Sharma, Lange and Raby) to address this important problem with both a novel study design and data set.