De novo loss-of-function (LoF) variants in the KMT2A gene are associated with Wiedemann-Steiner Syndrome (WSS). Recently, de novo KMT2A variants have been identified in sequencing studies of cohorts of individuals with neurodevelopmental disorders (NDDs). However, most of these studies lack the detailed clinical information required to determine whether those individuals have isolated NDDs or WSS (i.e. syndromic NDDs). We performed thorough clinical and neurodevelopmental phenotyping on six individuals with de novo KMT2A variants. From these data, we found that all six patients met clinical criteria for WSS and we further define the neurodevelopmental phenotypes associated with KMT2A variants and WSS. In particular, we identified a subtype of Autism Spectrum Disorder (ASD) in five individuals, characterized by marked rigid, repetitive and inflexible behaviours, emotional dysregulation, externalizing behaviours, but relative social motivation. To further explore the clinical spectrum associated with KMT2A variants, we also conducted a meta-analysis of individuals with KMT2A variants reported in the published literature. We found that de novo LoF or missense variants in KMT2A were significantly more prevalent than predicted by a previously established statistical model of de novo mutation rate for KMT2A. Our genotype-phenotype findings better define the clinical spectrum associated with KMT2A variants and suggest that individuals with de novo LoF and missense variants likely have a clinically unrecognized diagnosis of WSS, rather than isolated NDD or ASD alone. This highlights the importance of a clinical genetic and neurodevelopmental assessment for individuals with such variants in KMT2A.
Genetic risk factors for ulcerative colitis are not well understood. We performed a family-based whole exome sequencing analysis to identify potential causal mutations. In the proband, we found a heterozygous de novo mutation in the gene HSPA1L. Our results indicate that the de novo mutation in HSPA1L is associated with ulcerative colitis.
This project is to explore the contribution of de novo mutations to severe structural malformations diagnosed prenatally using ultrasound. These malformations include heart, CNS, renal and GI abnormalities. In this pilot project we aim to exome sequence 30 parent-foetus trios to ~50X mean coverage and identify de novo functional variants using an algorithm developed in the Hurles group
Well-differentiated (WD) and de-differentiated (DD) liposarcoma, subtypes of adipocytic sarcomas, are pathologically and clinically dissimilar, but are poorly distinguishable at the molecular level. These tumors harbor neochromosomes formed from amplifications and rearrangements of chr12q. Nineteen selected patients with matched WD and DD tumors underwent extensive exomic and transcriptomic profiling to distinguish genomic features between the two subtypes. Shared point mutations suggest a common tumor origin and de-differentiated tumors have higher burdens of deletions.
CD4+ regulatory T cells (Tregs) are key mediators of immunological tolerance and promising effector cells for immuno-suppressive adoptive cellular therapy to fight autoimmunity and chronic inflammation. Their functional stability is critical for their clinical utility and has been correlated to the demethylated state of the TSDR/CNS2 enhancer element in the Treg lineage transcription factor FOXP3. However, proof for a causal contribution of the TSDR de-methylation to FOXP3 stability and Treg induction is so far lacking. We here established a powerful transient-transfection CRISPR-Cas9-based epigenetic-editing method for the selective de-methylation of the TSDR within the endogenous chromatin environment of a living cell. The induced de-methylated state was stable over weeks in clonal T cell proliferation cultures even after expression of the editing complex had ceased. Epigenetic editing of the TSDR resulted in FOXP3 expression, even in its physiological isoform distribution, proving a causal role for the de-methylated TSDR in FOXP3 regulation. However, successful FOXP3 induction was not associated with a switch towards a functional Treg phenotype, in contrast to what has been reported from FOXP3 overexpression approaches. Thus, TSDR de-methylation is required, but not sufficient for a stable Treg phenotype induction. Therefore, targeted demethylation of the TSDR may be a critical addition to published in vitro Treg induction protocols which so far lack FOXP3 stability.
We performed whole-genome sequencing of 8 trios with intellectual disability using Pacbio HiFi long-reads. We then called variants with PBSV and Deepvariant. We filtered for high-quality de novo variants and validated them.
This is the PacBio long read data used for performing de novo assembly of the EGYPT individual (mapped against GRCh38).
Mixed exocrine and purified ductal and de-differentiated acinar human cells obtained from normal pancreases. Ductal and de-differentiatedacinar cells where isolated by FACS after 4 days culture of the exocrine mixed population
Fastq files from RNA sequencing of normal and tumor samples from 16 patients from the de novo metastatic prostate cancer cohort.
10x Genomics linked read data used in variant phasing and de novo assembly scaffolding for the EGYPT individual (mapped against GRCh38).
In this study, we obtained DNA from the Coriell Repository from individuals with complex chromosomal changes. The database ("Chromosome Abnormalities") of these de-identified samples contains 1,366 individuals as per https://www.coriell.org/1/NIGMS. We started with 16 individuals in this pilot study that we will hopefully expand out in a potential large NIH grant using information derived in this pilot study. De-identified samples were sequenced using internal funding from Washington University in St. Louis. A description of the sequencing we did on these individuals is provided in our medRxiv preprint at https://www.medrxiv.org/content/10.1101/2025.03.28.25324850v1.
Small cell prostatic carcinoma (SCPC) is an aggressive pathology that is managed similar to small cell lung cancer. SCPC can evolve from prostatic adenocarcinoma in response to androgen deprivation therapy but in rare cases is present at initial cancer diagnosis. The molecular etiology of de novo SCPC is incompletely understood due to the scarcity of tumor tissue and the brief life expectancy of patients. Through a retrospective search of our regional oncology pharmacy database, we identified 18 patients diagnosed with de novo SCPC between 2004 and 2017. Ten patients had pure SCPC pathology and the remainder had some admixed adenocarcinoma foci, but all were treated with first line platinum-based chemotherapy. Median overall survival was 28 months. We performed targeted DNA sequencing, whole exome sequencing and mRNA profiling on formalin-fixed paraffin embedded archival tumor tissue. We observed frequent biallelic deletion and/or mutation of tumor suppressors TP53, RB1 and PTEN, similar to treatment related SCPC. Indeed, at the RNA level pure de novo SCPC closely resembled treatment-related SCPC. However, five patients had biallelic loss of DNA repair genes including BRCA1, BRCA2, ATM and MSH2/6: potentially underlying the high genomic instability of this rare disease variant. Two patients with pure de novo SCPC harbored ETS gene rearrangements to androgen-driven promoters, consistent with evolution of de novo SCPC from an androgen-driven ancestor. Overall, our results reveal a highly aggressive molecular landscape underlies this unusual pathologic variant, and propose opportunities for targeted therapy strategies in a disease with few treatment options.
Effects of accidental and prolonged radiation exposure on the human germline remain a topic of medical interest. Transgenerational signatures of ionizing radiation exposure are primarily implicated to be clustered de-novo mutations (cDNMs, multiple de-novo mutations within 20bp). We sequenced the whole genome of 110 children from 70 families (Radar cohort). The father in this cohort were exposed to 0-353mSv of ionizing radiation and serve as exposed samples.
Data Access Committee means the following members of the Institute of Experimental Immunology, UZH, and investigators of the publication, to which the Data Access refers and who are responsible for making the data access decisions for the genotype and gene expression generated by UZH or respectively the epigenetic data, which had been generated at Karolinska Institute. Christian Münz, PhD; Co-Director of Institute of Experimental Immunology, University of Zurich Roland Martin, MD; Affiliated Group Leader (to Institute Münz) of Institute of Experimental Immunology Institute of Experimental Immunology Ivan Jelcic, PhD, Roche Pharma Research and Early Development (pRED), Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd, Basel, Switzerland Maja Jagodic, PhD, Department of Clinical Neuroscience, Karolinska Institutet, Center for Molecular Medicine, Karolinska University Hospital, Stockholm, Sweden
Sacituzumab Govitecan (SG), a novel antibody drug conjugate (ADC) incorporating the anti-TROP2 antibody hRS7 conjugated to a topoisomerase-1 inhibitor (SN-38) payload, is the first ADC to be approved for advanced triple negative breast cancer (TNBC). However, mechanisms governing therapeutic resistance to SG are not known. We sought to identify mechanisms of de novo and acquired resistance to SG through unbiased whole-exome sequencing (WES) and RNA sequencing analysis of pre-treatment and multi-site post-progression (autopsy) tumor specimens. We examined three metastatic TNBC cases exhibiting (1) de novo progression, (2) stable disease, and (3) a deep response followed by progression, then mapped the temporal and spatial genomic evolution of acquired resistance in the responding patient. We then conducted additional pre-clinical experiments to validate the observed resistance mechanisms. TROP2 RNA and gene copy number were associated with de novo resistance, as case (1) was found to have absent TROP2 expression in all specimens, case (2) expressed TROP2, while case (3) exhibited both expression and focal genomic amplification of the TACSTD2/TROP2 locus. The genomic phylogeny tree inferred from case (3) post-progression specimens revealed one branch harboring an acquired canonical E418K resistance mutation in TOP1 (encoding topoisomerase 1) and a subsequent sub-clonal TOP1 inactivating frameshift mutation, while a distinct branch exhibited acquisition of a novel T256R missense mutation in TACSTD2 (encoding TROP2). Both the TOP1- and TACSTD2-mutant clones seeded multiple distinct metastatic sites. Through reconstitution experiments in TROP2 negative cells we found that TROP2 T256R is a stable protein with defective cell membrane localization and reduced cell surface binding by RS7 compared to wild-type TROP2. Collectively, these findings underscore TROP2 as a determinant of initial response to SG, and they reveal parallel and mutually exclusive polyclonal molecular mechanisms of acquired resistance involving the direct antibody target and drug payload target in distinct metastatic subclones of a single patient. While further research is needed to extend these novel findings, this study highlights the specificity of SG and illustrates how identifying such mechanisms will inform rational therapeutic strategies to overcome ADC resistance.
De- and trans-differentiation is a rare and only poorly understood phenomenon in cutaneous melanoma. To study this disease more comprehensively we have retrieved 11 primary cutaneous melanomas from our pathology archives showing biphasic features characterized by a conventional melanoma and additional areas of de-/trans-differentiation as defined by a lack of immunohistochemical expression of all conventional melanocytic markers (S-100 protein, SOX10, Melan-A and HMB-45). The clinical, histologic and immunohistochemical findings were recorded and follow-up was obtained. The patients were mostly elderly (median: 81 years; range: 42-86 years) without significant gender predilection, and the sun-exposed skin of the head and neck area was most commonly affected. The tumors were deeply invasive with a mean tumor thickness of 7 mm (range: 4-80 mm). The dedifferentiated component showed atypical fibroxanthoma-like features in the majority (7), while additional rhabdomyosarcomatous and epithelial transdifferentiation was noted histologically and/or immunohistochemically in two tumors each. The background conventional melanoma component was of desmoplastic (4), superficial spreading (3), nodular (2), lentigo maligna (1) or spindle cell (1) types. For the 7 patients with available follow-up data (median follow-up period of 25 months; range: 8-36 months), 2 died from their disease and 3 developed metastases. Next-generation sequencing of the cohort revealed somatic mutation of established melanoma drivers including mainly NF1 mutations in the conventional component (5 cases), which were also detected in the corresponding de-/trans-differentiated components. In summary, the diagnosis of de-/trans-differentiated melanoma is challenging and depends on the morphologic identification of the conventional melanoma component. Molecular analysis is diagnostically helpful as the mutated gene profile is shared between the conventional and de-/trans-differentiated components. Importantly, de-/trans-differentiation does not appear to confer a more aggressive behavior.
Original description of the study: From ELLIPSE (linked to the PRACTICAL consortium), we contributed ~78,000 SNPs to the OncoArray. A large fraction of the content was derived from the GWAS meta-analyses in European ancestry populations (overall and aggressive disease; ~27K SNPs). We also selected just over 10,000 SNPs from the meta-analyses in the non-European populations, with a majority of these SNPs coming from the analysis of overall prostate cancer in African ancestry populations as well as from the multiethnic meta-analysis. A substantial fraction of SNPs (~28,000) were also selected for fine-mapping of 53 loci not included in the common fine-mapping regions (tagging at r2>0.9 across ±500kb regions). We also selected a few thousand SNPs related with PSA levels and/or disease survival as well as SNPs from candidate lists provided by study collaborators, as well as from meta-analyses of exome SNP chip data from the Multiethnic Cohort and UK studies. The Contributing Studies: Aarhus: Hospital-based, Retrospective, Observational. Source of cases: Patients treated for prostate adenocarcinoma at Department of Urology, Aarhus University Hospital, Skejby (Aarhus, Denmark). Source of controls: Age-matched males treated for myocardial infarction or undergoing coronary angioplasty, but with no prostate cancer diagnosis based on information retrieved from the Danish Cancer Register and the Danish Cause of Death Register. AHS: Nested case-control study within prospective cohort. Source of cases: linkage to cancer registries in study states. Source of controls: matched controls from cohort ATBC: Prospective, nested case-control. Source of cases: Finnish male smokers aged 50-69 years at baseline. Source of controls: Finnish male smokers aged 50-69 years at baseline BioVu: Cases identified in a biobank linked to electronic health records. Source of cases: A total of 214 cases were identified in the VUMC de-identified electronic health records database (the Synthetic Derivative) and shipped to USC for genotyping in April 2014. The following criteria were used to identify cases: Age 18 or greater; male; African Americans (Black) only. Note that African ancestry is not self-identified, it is administratively or third-party assigned (which has been shown to be highly correlated with genetic ancestry for African Americans in BioVU; see references). Source of controls: Controls were identified in the de-identified electronic health record. Unfortunately, they were not age matched to the cases, and therefore cannot be used for this study. Canary PASS: Prospective, Multi-site, Observational Active Surveillance Study. Source of cases: clinic based from Beth Israel Deaconness Medical Center, Eastern Virginia Medical School, University of California at San Francisco, University of Texas Health Sciences Center San Antonio, University of Washington, VA Puget Sound. Source of controls: N/A CCI: Case series, Hospital-based. Source of cases: Cases identified through clinics at the Cross Cancer Institute. Source of controls: N/A CerePP French Prostate Cancer Case-Control Study (ProGene): Case-Control, Prospective, Observational, Hospital-based. Source of cases: Patients, treated in French departments of Urology, who had histologically confirmed prostate cancer. Source of controls: Controls were recruited as participating in a systematic health screening program and found unaffected (normal digital rectal examination and total PSA < 4 ng/ml, or negative biopsy if PSA > 4 ng/ml). COH: hospital-based cases and controls from outside. Source of cases: Consented prostate cancer cases at City of Hope. Source of controls: Consented unaffected males that were part of other studies where they consented to have their DNA used for other research studies. COSM: Population-based cohort. Source of cases: General population. Source of controls: General population CPCS1: Case-control - Denmark. Source of cases: Hospital referrals. Source of controls: Copenhagen General Population Study CPCS2: Source of cases: Hospital referrals. Source of controls: Copenhagen General Population Study CPDR: Retrospective cohort. Source of cases: Walter Reed National Military Medical Center. Source of controls: Walter Reed National Military Medical Center ACS_CPS-II: Nested case-control derived from a prospective cohort study. Source of cases: Identified through self-report on follow-up questionnaires and verified through medical records or cancer registries, identified through cancer registries or the National Death Index (with prostate cancer as the primary cause of death). Source of controls: Cohort participants who were cancer-free at the time of diagnosis of the matched case, also matched on age (±6 mo) and date of biospecimen donation (±6 mo). EPIC: Case-control - Germany, Greece, Italy, Netherlands, Spain, Sweden, UK. Source of cases: Identified through record linkage with population-based cancer registries in Italy, the Netherlands, Spain, Sweden and UK. In Germany and Greece, follow-up is active and achieved through checks of insurance records and cancer and pathology registries as well as via self-reported questionnaires; self-reported incident cancers are verified through medical records. Source of controls: Cohort participants without a diagnosis of cancer EPICAP: Case-control, Population-based, ages less than 75 years at diagnosis, Hérault, France. Source of cases: Prostate cancer cases in all public hospitals and private urology clinics of département of Hérault in France. Cases validation by the Hérault Cancer Registry. Source of controls: Population-based controls, frequency age matched (5-year groups). Quotas by socio-economic status (SES) in order to obtain a distribution by SES among controls identical to the SES distribution among general population men, conditionally to age. ERSPC: Population-based randomized trial. Source of cases: Men with PrCa from screening arm ERSPC Rotterdam. Source of controls: Men without PrCa from screening arm ERSPC Rotterdam ESTHER: Case-control, Prospective, Observational, Population-based. Source of cases: Prostate cancer cases in all hospitals in the state of Saarland, from 2001-2003. Source of controls: Random sample of participants from routine health check-up in Saarland, in 2000-2002 FHCRC: Population-based, case-control, ages 35-74 years at diagnosis, King County, WA, USA. Source of cases: Identified through the Seattle-Puget Sound SEER cancer registry. Source of controls: Randomly selected, age-frequency matched residents from the same county as cases Gene-PARE: Hospital-based. Source of cases: Patients that received radiotherapy for treatment of prostate cancer. Source of controls: n/a Hamburg-Zagreb: Hospital-based, Prospective. Source of cases: Prostate cancer cases seen at the Department of Oncology, University Hospital Center Zagreb, Croatia. Source of controls: Population-based (Croatia), healthy men, older than 50, with no medical record of cancer, and no family history of cancer (1st & 2nd degree relatives) HPFS: Nested case-control. Source of cases: Participants of the HPFS cohort. Source of controls: Participants of the HPFS cohort IMPACT: Observational. Source of cases: Carriers and non-carriers (with a known mutation in the family) of the BRCA1 and BRCA2 genes, aged between 40 and 69, who are undergoing prostate screening with annual PSA testing. This cohort has been diagnosed with prostate cancer during the study. Source of controls: Carriers and non-carriers (with a known mutation in the family) of the BRCA1 and BRCA2 genes, aged between 40 and 69, who are undergoing prostate screening with annual PSA testing. This cohort has not been diagnosed with prostate cancer during the study. IPO-Porto: Hospital-based. Source of cases: Early onset and/or familial prostate cancer. Source of controls: Blood donors Karuprostate: Case-control, Retrospective, Population-based. Source of cases: From FWI (Guadeloupe): 237 consecutive incident patients with histologically confirmed prostate cancer attending public and private urology clinics; From Democratic Republic of Congo: 148 consecutive incident patients with histologically confirmed prostate cancer attending the University Clinic of Kinshasa. Source of controls: From FWI (Guadeloupe): 277 controls recruited from men participating in a free systematic health screening program open to the general population; From Democratic Republic of Congo: 134 controls recruited from subjects attending the University Clinic of Kinshasa KULEUVEN: Hospital-based, Prospective, Observational. Source of cases: Prostate cancer cases recruited at the University Hospital Leuven. Source of controls: Healthy males with no history of prostate cancer recruited at the University Hospitals, Leuven. LAAPC: Subjects were participants in a population-based case-control study of aggressive prostate cancer conducted in Los Angeles County. Cases were identified through the Los Angeles County Cancer Surveillance Program rapid case ascertainment system. Eligible cases included African American, Hispanic, and non-Hispanic White men diagnosed with a first primary prostate cancer between January 1, 1999 and December 31, 2003. Eligible cases also had (a) prostatectomy with documented tumor extension outside the prostate, (b) metastatic prostate cancer in sites other than prostate, (c) needle biopsy of the prostate with Gleason grade ≥8, or (d) needle biopsy with Gleason grade 7 and tumor in more than two thirds of the biopsy cores. Eligible controls were men never diagnosed with prostate cancer, living in the same neighborhood as a case, and were frequency matched to cases on age (± 5 y) and race/ethnicity. Controls were identified by a neighborhood walk algorithm, which proceeds through an obligatory sequence of adjacent houses or residential units beginning at a specific residence that has a specific geographic relationship to the residence where the case lived at diagnosis. Malaysia: Case-control. Source of cases: Patients attended the outpatient urology or uro-onco clinic at University Malaya Medical Center. Source of controls: Population-based, age matched (5-year groups), ascertained through electoral register, Subang Jaya, Selangor, Malaysia MCC-Spain: Case-control. Source of cases: Identified through the urology departments of the participating hospitals. Source of controls: Population-based, frequency age and region matched, ascertained through the rosters of the primary health care centers MCCS: Nested case-control, Melbourne, Victoria. Source of cases: Identified by linkage to the Victorian Cancer Registry. Source of controls: Cohort participants without a diagnosis of cancer MD Anderson: Participants in this study were identified from epidemiological prostate cancer studies conducted at the University of Texas MD Anderson Cancer Center in the Houston Metropolitan area. Cases were accrued in the Houston Medical Center and were not restricted with respect to Gleason score, stage or PSA. Controls were identified via random-digit-dialing or among hospital visitors and they were frequency matched to cases on age and race. Lifestyle, demographic, and family history data were collected using a standardized questionnaire. MDACC_AS: A prospective cohort study. Source of cases: Men with clinically organ-confined prostate cancer meeting eligibility criteria for a prospective cohort study of active surveillance at MD Anderson Cancer Center. Source of controls: N/A MEC: The Multiethnic Cohort (MEC) is comprised of over 215,000 men and women recruited from Hawaii and the Los Angeles area between 1993 and 1996. Between 1995 and 2006, over 65,000 blood samples were collected from participants for genetic analyses. To identify incident cancer cases, the MEC was cross-linked with the population-based Surveillance, Epidemiology and End Results (SEER) registries in California and Hawaii, and unaffected cohort participants with blood samples were selected as controls MIAMI (WFPCS): Prostate cancer cases and controls were recruited from the Departments of Urology and Internal Medicine of the Wake Forest University School of Medicine using sequential patient populations as described previously (PMID:15342424). All study subjects received a detailed description of the study protocol and signed their informed consent, as approved by the medical center's Institutional Review Board. The general eligibility criteria were (i) able to comprehend informed consent and (ii) without previously diagnosed cancer. The exclusion criteria were (i) clinical diagnosis of autoimmune diseases; (ii) chronic inflammatory conditions; and (iii) infections within the past 6 weeks. Blood samples were collected from all subjects. MOFFITT: Hospital-based. Source of cases: clinic based from Moffitt Cancer Center. Source of controls: Moffitt Cancer Center affiliated Lifetime cancer screening center NMHS: Case-control, clinic based, Nashville TN. Source of cases: All urology clinics in Nashville, TN. Source of controls: Men without prostate cancer at prostate biopsy. PCaP: The North Carolina-Louisiana Prostate Cancer Project (PCaP) is a multidisciplinary population-based case-only study designed to address racial differences in prostate cancer through a comprehensive evaluation of social, individual and tumor level influences on prostate cancer aggressiveness. PCaP enrolled approximately equal numbers of African Americans and Caucasian Americans with newly-diagnosed prostate cancer from North Carolina (42 counties) and Louisiana (30 parishes) identified through state tumor registries. African American PCaP subjects with DNA, who agreed to future use of specimens for research, participated in OncoArray analysis. PCMUS: Case-control - Sofia, Bulgaria. Source of cases: Patients of Clinic of Urology, Alexandrovska University Hospital, Sofia, Bulgaria, PrCa histopathologically confirmed. Source of controls: 72 patients with verified BPH and PSA<3,5; 78 healthy controls from the MMC Biobank, no history of PrCa PHS: Nested case-control. Source of cases: Participants of the PHS1 trial/cohort. Source of controls: Participants of the PHS1 trial/cohort PLCO: Nested case-control. Source of cases: Men with a confirmed diagnosis of prostate cancer from the PLCO Cancer Screening Trial. Source of controls: Controls were men enrolled in the PLCO Cancer Screening Trial without a diagnosis of cancer at the time of case ascertainment. Poland: Case-control. Source of cases: men with unselected prostate cancer, diagnosed in north-western Poland at the University Hospital in Szczecin. Source of controls: cancer-free men from the same population, taken from the healthy adult patients of family doctors in the Szczecin region PROCAP: Population-based, Retrospective, Observational. Source of cases: Cases were ascertained from the National Prostate Cancer Register of Sweden Follow-Up Study, a retrospective nationwide cohort study of patients with localized prostate cancer. Source of controls: Controls were selected among men referred for PSA testing in laboratories in Stockholm County, Sweden, between 2010 and 2012. PROGReSS: Hospital-based, Prospective, Observational. Source of cases: Prostate cancer cases from the Hospital Clínico Universitario de Santiago de Compostela, Galicia, Spain. Source of controls: Cancer-free men from the same population ProMPT: A study to collect samples and data from subjects with and without prostate cancer. Retrospective, Experimental. Source of cases: Subjects attending outpatient clinics in hospitals. Source of controls: Subjects attending outpatient clinics in hospitals ProtecT: Trial of treatment. Samples taken from subjects invited for PSA testing from the community at nine centers across United Kingdom. Source of cases: Subjects who have a proven diagnosis of prostate cancer following testing. Source of controls: Identified through invitation of subjects in the community. PROtEuS: Case-control, population-based. Source of cases: All new histologically-confirmed cases, aged less or equal to 75 years, diagnosed between 2005 and 2009, actively ascertained across Montreal French hospitals. Source of controls: Randomly selected from the Provincial electoral list of French-speaking men between 2005 and 2009, from the same area of residence as cases and frequency-matched on age. QLD: Case-control. Source of cases: A longitudinal cohort study (Prostate Cancer Supportive Care and Patient Outcomes Project: ProsCan) conducted in Queensland, through which men newly diagnosed with prostate cancer from 26 private practices and 10 public hospitals were directly referred to ProsCan at the time of diagnosis by their treating clinician (age range 43-88 years). All cases had histopathologically confirmed prostate cancer, following presentation with an abnormal serum PSA and/or lower urinary tract symptoms. Source of controls: Controls comprised healthy male blood donors with no personal history of prostate cancer, recruited through (i) the Australian Red Cross Blood Services in Brisbane (age range 19-76 years) and (ii) the Australian Electoral Commission (AEC) (age and post-code/ area matched to ProsCan, age range 54-90 years). RAPPER: Multi-centre, hospital based blood sample collection study in patients enrolled in clinical trials with prospective collection of radiotherapy toxicity data. Source of cases: Prostate cancer patients enrolled in radiotherapy trials: CHHiP, RT01, Dose Escalation, RADICALS, Pelvic IMRT, PIVOTAL. Source of controls: N/A SABOR: Prostate Cancer Screening Cohort. Source of cases: Men >45 yrs of age participating in annual PSA screening. Source of controls: Males participating in annual PSA prostate cancer risk evaluations (funded by NCI biomarkers discovery and validation grant), recruited through University of Texas Health Science Center at San Antonio and affiliated sites or through study advertisements, enrolment open to the community SCCS: Case-control in cohort, Southeastern USA. Prospective, Observational, Population-based. Source of cases: SCCS entry population. Source of controls: SCCS entry population SCPCS: Population-based, Retrospective, Observational. Source of cases: South Carolina Central Cancer Registry. Source of controls: Health Care Financing Administration beneficiary file SEARCH: Case-control - East Anglia, UK. Source of cases: Men < 70 years of age registered with prostate cancer at the population-based cancer registry, Eastern Cancer Registration and Information Centre, East Anglia, UK. Source of controls: Men attending general practice in East Anglia with no known prostate cancer diagnosis, frequency matched to cases by age and geographic region SNP_Prostate_Ghent: Hospital-based, Retrospective, Observational. Source of cases: Men treated with IMRT as primary or postoperative treatment for prostate cancer at the Ghent University Hospital between 2000 and 2010. Source of controls: Employees of the University hospital and members of social activity clubs, without a history of any cancer. SPAG: Hospital-based, Retrospective, Observational. Source of cases: Guernsey. Source of controls: Guernsey STHM2: Population-based, Retrospective, Observational. Source of cases: Cases were selected among men referred for PSA testing in laboratories in Stockholm County, Sweden, between 2010 and 2012. Source of controls: Controls were selected among men referred for PSA testing in laboratories in Stockholm County, Sweden, between 2010 and 2012. PCPT: Case-control from a randomized clinical trial. Source of cases: Randomized clinical trial. Source of controls: Randomized clinical trial SELECT: Case-cohort from a randomized clinical trial. Source of cases: Randomized clinical trial. Source of controls: Randomized clinical trial TAMPERE: Case-control - Finland, Retrospective, Observational, Population-based. Source of cases: Identified through linkage to the Finnish Cancer Registry and patient records; and the Finnish arm of the ERSPC study. Source of controls: Cohort participants without a diagnosis of cancer UGANDA: Uganda Prostate Cancer Study: Uganda is a case-control study of prostate cancer in Kampala Uganda that was initiated in 2011. Men with prostate cancer were enrolled from the Urology unit at Mulago Hospital and men without prostate cancer (i.e. controls) were enrolled from other clinics (i.e. surgery) at the hospital. UKGPCS: ICR, UK. Source of cases: Cases identified through clinics at the Royal Marsden hospital and nationwide NCRN hospitals. Source of controls: Ken Muir's control- 2000 ULM: Case-control - Germany. Source of cases: familial cases (n=162): identified through questionnaires for family history by collaborating urologists all over Germany; sporadic cases (n=308): prostatectomy series performed in the Clinic of Urology Ulm between 2012 and 2014. Source of controls: age-matched controls (n=188): age-matched men without prostate cancer and negative family history collected in hospitals of Ulm WUGS/WUPCS: Cases Series, USA. Source of cases: Identified through clinics at Washington University in St. Louis. Source of controls: Men diagnosed and managed with prostate cancer in University based clinic. Acknowledgement Statements: Aarhus: This study was supported by the Danish Strategic Research Council (now Innovation Fund Denmark) and the Danish Cancer Society. The Danish Cancer Biobank (DCB) is acknowledged for biological material. AHS: This work was supported by the Intramural Research Program of the NIH, National Cancer Institute, Division of Cancer Epidemiology and Genetics (Z01CP010119). ATBC: This research was supported in part by the Intramural Research Program of the NIH and the National Cancer Institute. Additionally, this research was supported by U.S. Public Health Service contracts N01-CN-45165, N01-RC-45035, N01-RC-37004, HHSN261201000006C, and HHSN261201500005C from the National Cancer Institute, Department of Health and Human Services. BioVu: The dataset(s) used for the analyses described were obtained from Vanderbilt University Medical Center's BioVU which is supported by institutional funding and by the National Center for Research Resources, Grant UL1 RR024975-01 (which is now at the National Center for Advancing Translational Sciences, Grant 2 UL1 TR000445-06). Canary PASS: PASS was supported by Canary Foundation and the National Cancer Institute's Early Detection Research Network (U01 CA086402) CCI: This work was awarded by Prostate Cancer Canada and is proudly funded by the Movember Foundation - Grant # D2013-36.The CCI group would like to thank David Murray, Razmik Mirzayans, and April Scott for their contribution to this work. CerePP French Prostate Cancer Case-Control Study (ProGene): None reported COH: SLN is partially supported by the Morris and Horowitz Families Endowed Professorship COSM: The Swedish Research Council, the Swedish Cancer Foundation CPCS1 & CPCS2: Department of Clinical Biochemistry, Herlev and Gentofte Hospital, Copenhagen University Hospital, Herlev Ringvej 75, DK-2730 Herlev, DenmarkCPCS1 would like to thank the participants and staff of the Copenhagen General Population Study for their important contributions. CPDR: Uniformed Services University for the Health Sciences HU0001-10-2-0002 (PI: David G. McLeod, MD) CPS-II: The American Cancer Society funds the creation, maintenance, and updating of the Cancer Prevention Study II cohort. CPS-II thanks the participants and Study Management Group for their invaluable contributions to this research. We would also like to acknowledge the contribution to this study from central cancer registries supported through the Centers for Disease Control and Prevention National Program of Cancer Registries, and cancer registries supported by the National Cancer Institute Surveillance Epidemiology and End Results program. EPIC: The coordination of EPIC is financially supported by the European Commission (DG-SANCO) and the International Agency for Research on Cancer. The national cohorts are supported by the Danish Cancer Society (Denmark); the Deutsche Krebshilfe, Deutsches Krebsforschungszentrum and Federal Ministry of Education and Research (Germany); the Hellenic Health Foundation, Greek Ministry of Health; Greek Ministry of Education (Greece); the Italian Association for Research on Cancer (AIRC) and National Research Council (Italy); the Dutch Ministry of Public Health, Welfare and Sports (VWS), Netherlands Cancer Registry (NKR), LK Research Funds, Dutch Prevention Funds, Dutch ZON (Zorg Onderzoek Nederland), World Cancer Research Fund (WCRF); the Statistics Netherlands (The Netherlands); the Health Research Fund (FIS), Regional Governments of Andalucía, Asturias, Basque Country, Murcia and Navarra, Spanish Ministry of Health ISCIII RETIC (RD06/0020), Red de Centros RCESP, C03/09 (Spain); the Swedish Cancer Society, Swedish Scientific Council and Regional Government of Skåne and Västerbotten, Fundacion Federico SA (Sweden); the Cancer Research UK, Medical Research Council (United Kingdom). EPICAP: The EPICAP study was supported by grants from Ligue Nationale Contre le Cancer, Ligue départementale du Val de Marne; Fondation de France; Agence Nationale de sécurité sanitaire de l'alimentation, de l'environnement et du travail (ANSES). The EPICAP study group would like to thank all urologists, Antoinette Anger and Hasina Randrianasolo (study monitors), Anne-Laure Astolfi, Coline Bernard, Oriane Noyer, Marie-Hélène De Campo, Sandrine Margaroline, Louise N'Diaye, and Sabine Perrier-Bonnet (Clinical Research nurses). ERSPC: This study was supported by the DutchCancerSociety (KWF94-869,98-1657,2002-277,2006-3518, 2010-4800), The Netherlands Organisation for Health Research and Development (ZonMW-002822820, 22000106, 50-50110-98-311, 62300035), The Dutch Cancer Research Foundation (SWOP), and an unconditional grant from Beckman-Coulter-HybritechInc. ESTHER: The ESTHER study was supported by a grant from the Baden Württemberg Ministry of Science, Research and Arts. The ESTHER group would like to thank Hartwig Ziegler, Sonja Wolf, Volker Hermann, Heiko Müller, Karina Dieffenbach, Katja Butterbach for valuable contributions to the study. FHCRC: The FHCRC studies were supported by grants R01-CA056678, R01-CA082664, and R01-CA092579 from the US National Cancer Institute, National Institutes of Health, with additional support from the Fred Hutchinson Cancer Research Center. FHCRC would like to thank all the men who participated in these studies. Gene-PARE: The Gene-PARE study was supported by grants 1R01CA134444 from the U.S. National Institutes of Health, PC074201 and W81XWH-15-1-0680 from the Prostate Cancer Research Program of the Department of Defense and RSGT-05-200-01-CCE from the American Cancer Society. Hamburg-Zagreb: None reported HPFS: The Health Professionals Follow-up Study was supported by grants UM1CA167552, CA133891, CA141298, and P01CA055075. HPFS are grateful to the participants and staff of the Physicians' Health Study and Health Professionals Follow-Up Study for their valuable contributions, as well as the following state cancer registries for their help: AL, AZ, AR, CA, CO, CT, DE, FL, GA, ID, IL, IN, IA, KY, LA, ME, MD, MA, MI, NE, NH, NJ, NY, NC, ND, OH, OK, OR, PA, RI, SC, TN, TX, VA, WA, and WY. IMPACT: The IMPACT study was funded by The Ronald and Rita McAulay Foundation, CR-UK Project grant (C5047/A1232), Cancer Australia, AICR Netherlands A10-0227, Cancer Australia and Cancer Council Tasmania, NIHR, EU Framework 6, Cancer Councils of Victoria and South Australia, and Philanthropic donation to Northshore University Health System. We acknowledge support from the National Institute for Health Research (NIHR) to the Biomedical Research Centre at The Institute of Cancer Research and Royal Marsden Foundation NHS Trust. IMPACT acknowledges the IMPACT study steering committee, collaborating centres, and participants. IPO-Porto: The IPO-Porto study was funded by Fundaçäo para a Ciência e a Tecnologia (FCT; UID/DTP/00776/2013 and PTDC/DTP-PIC/1308/2014) and by IPO-Porto Research Center (CI-IPOP-16-2012 and CI-IPOP-24-2015). MC and MPS are research fellows from Liga Portuguesa Contra o Cancro, Núcleo Regional do Norte. SM is a research fellow from FCT (SFRH/BD/71397/2010). IPO-Porto would like to express our gratitude to all patients and families who have participated in this study. Karuprostate: The Karuprostate study was supported by the the Frech National Health Directorate and by the Association pour la Recherche sur les Tumeurs de la ProstateKarusprostate thanks Séverine Ferdinand. KULEUVEN: F.C. and S.J. are holders of grants from FWO Vlaanderen (G.0684.12N and G.0830.13N), the Belgian federal government (National Cancer Plan KPC_29_023), and a Concerted Research Action of the KU Leuven (GOA/15/017). TVDB is holder of a doctoral fellowship of the FWO. LAAPC: This study was funded by grant R01CA84979 (to S.A. Ingles) from the National Cancer Institute, National Institutes of Health. Malaysia: The study was funded by the University Malaya High Impact Research Grant (HIR/MOHE/MED/35). Malaysia thanks all associates in the Urology Unit, University of Malaya, Cancer Research Initiatives Foundation (CARIF) and the Malaysian Men's Health Initiative (MMHI). MCCS: MCCS cohort recruitment was funded by VicHealth and Cancer Council Victoria. The MCCS was further supported by Australian NHMRC grants 209057, 251553, and 504711, and by infrastructure provided by Cancer Council Victoria. Cases and their vital status were ascertained through the Victorian Cancer Registry (VCR) and the Australian Institute of Health and Welfare (AIHW), including the National Death Index and the Australian Cancer Database. MCC-Spain: The study was partially funded by the Accion Transversal del Cancer, approved on the Spanish Ministry Council on the 11th October 2007, by the Instituto de Salud Carlos III-FEDER (PI08/1770, PI09/00773-Cantabria, PI11/01889-FEDER, PI12/00265, PI12/01270, and PI12/00715), by the Fundación Marqués de Valdecilla (API 10/09), by the Spanish Association Against Cancer (AECC) Scientific Foundation and by the Catalan Government DURSI grant 2009SGR1489. Samples: Biological samples were stored at the Parc de Salut MAR Biobank (MARBiobanc; Barcelona) which is supported by Instituto de Salud Carlos III FEDER (RD09/0076/00036). Also sample collection was supported by the Xarxa de Bancs de Tumors de Catalunya sponsored by Pla Director d'Oncologia de Catalunya (XBTC). MCC-Spain acknowledges the contribution from Esther Gracia-Lavedan in preparing the data. We thank all the subjects who participated in the study and all MCC-Spain collaborators. MD Anderson: Prostate Cancer Case-Control Studies at MD Anderson (MDA) supported by grants CA68578, ES007784, DAMD W81XWH-07-1-0645, and CA140388. MDACC_AS: None reported MEC: Funding provided by NIH grant U19CA148537 and grant U01CA164973. MIAMI (WFPCS): ACS MOFFITT: The Moffitt group was supported by the US National Cancer Institute (R01CA128813, PI: J.Y. Park). NMHS: Funding for the Nashville Men's Health Study (NMHS) was provided by the National Institutes of Health Grant numbers: RO1CA121060. PCaP only data: The North Carolina - Louisiana Prostate Cancer Project (PCaP) is carried out as a collaborative study supported by the Department of Defense contract DAMD 17-03-2-0052. For HCaP-NC follow-up data: The Health Care Access and Prostate Cancer Treatment in North Carolina (HCaP-NC) study is carried out as a collaborative study supported by the American Cancer Society award RSGT-08-008-01-CPHPS. For studies using both PCaP and HCaP-NC follow-up data please use: The North Carolina - Louisiana Prostate Cancer Project (PCaP) and the Health Care Access and Prostate Cancer Treatment in North Carolina (HCaP-NC) study are carried out as collaborative studies supported by the Department of Defense contract DAMD 17-03-2-0052 and the American Cancer Society award RSGT-08-008-01-CPHPS, respectively. For any PCaP data, please include: The authors thank the staff, advisory committees and research subjects participating in the PCaP study for their important contributions. For studies using PCaP DNA/genotyping data, please include: We would like to acknowledge the UNC BioSpecimen Facility and LSUHSC Pathology Lab for our DNA extractions, blood processing, storage and sample disbursement (https://genome.unc.edu/bsp). For studies using PCaP tissue, please include: We would like to acknowledge the RPCI Department of Urology Tissue Microarray and Immunoanalysis Core for our tissue processing, storage and sample disbursement. For studies using HCaP-NC follow-up data, please use: The Health Care Access and Prostate Cancer Treatment in North Carolina (HCaP-NC) study is carried out as a collaborative study supported by the American Cancer Society award RSGT-08-008-01-CPHPS. The authors thank the staff, advisory committees and research subjects participating in the HCaP-NC study for their important contributions. For studies that use both PCaP and HCaP-NC, please use: The authors thank the staff, advisory committees and research subjects participating in the PCaP and HCaP-NC studies for their important contributions. PCMUS: The PCMUS study was supported by the Bulgarian National Science Fund, Ministry of Education and Science (contract DOO-119/2009; DUNK01/2-2009; DFNI-B01/28/2012) with additional support from the Science Fund of Medical University - Sofia (contract 51/2009; 8I/2009; 28/2010). PHS: The Physicians' Health Study was supported by grants CA34944, CA40360, CA097193, HL26490, and HL34595. PHS members are grateful to the participants and staff of the Physicians' Health Study and Health Professionals Follow-Up Study for their valuable contributions, as well as the following state cancer registries for their help: AL, AZ, AR, CA, CO, CT, DE, FL, GA, ID, IL, IN, IA, KY, LA, ME, MD, MA, MI, NE, NH, NJ, NY, NC, ND, OH, OK, OR, PA, RI, SC, TN, TX, VA, WA, and WY. PLCO: This PLCO study was supported by the Intramural Research Program of the Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIHPLCO thanks Drs. Christine Berg and Philip Prorok, Division of Cancer Prevention at the National Cancer Institute, the screening center investigators and staff of the PLCO Cancer Screening Trial for their contributions to the PLCO Cancer Screening Trial. We thank Mr. Thomas Riley, Mr. Craig Williams, Mr. Matthew Moore, and Ms. Shannon Merkle at Information Management Services, Inc., for their management of the data and Ms. Barbara O'Brien and staff at Westat, Inc. for their contributions to the PLCO Cancer Screening Trial. We also thank the PLCO study participants for their contributions to making this study possible. Poland: None reported PROCAP: PROCAP was supported by the Swedish Cancer Foundation (08-708, 09-0677). PROCAP thanks and acknowledges all of the participants in the PROCAP study. We thank Carin Cavalli-Björkman and Ami Rönnberg Karlsson for their dedicated work in the collection of data. Michael Broms is acknowledged for his skilful work with the databases. KI Biobank is acknowledged for handling the samples and for DNA extraction. We acknowledge The NPCR steering group: Pär Stattin (chair), Anders Widmark, Stefan Karlsson, Magnus Törnblom, Jan Adolfsson, Anna Bill-Axelson, Ove Andrén, David Robinson, Bill Pettersson, Jonas Hugosson, Jan-Erik Damber, Ola Bratt, Göran Ahlgren, Lars Egevad, and Roy Ehrnström. PROGReSS: The PROGReSS study is founded by grants from the Spanish Ministry of Health (INT15/00070; INT16/00154; FIS PI10/00164, FIS PI13/02030; FIS PI16/00046); the Spanish Ministry of Economy and Competitiveness (PTA2014-10228-I), and Fondo Europeo de Desarrollo Regional (FEDER 2007-2013). ProMPT: Founded by CRUK, NIHR, MRC, Cambride Biomedical Research Centre ProtecT: Founded by NIHR. ProtecT and ProMPT would like to acknowledge the support of The University of Cambridge, Cancer Research UK. Cancer Research UK grants (C8197/A10123) and (C8197/A10865) supported the genotyping team. We would also like to acknowledge the support of the National Institute for Health Research which funds the Cambridge Bio-medical Research Centre, Cambridge, UK. We would also like to acknowledge the support of the National Cancer Research Prostate Cancer: Mechanisms of Progression and Treatment (PROMPT) collaborative (grant code G0500966/75466) which has funded tissue and urine collections in Cambridge. We are grateful to staff at the Welcome Trust Clinical Research Facility, Addenbrooke's Clinical Research Centre, Cambridge, UK for their help in conducting the ProtecT study. We also acknowledge the support of the NIHR Cambridge Biomedical Research Centre, the DOH HTA (ProtecT grant), and the NCRI/MRC (ProMPT grant) for help with the bio-repository. The UK Department of Health funded the ProtecT study through the NIHR Health Technology Assessment Programme (projects 96/20/06, 96/20/99). The ProtecT trial and its linked ProMPT and CAP (Comparison Arm for ProtecT) studies are supported by Department of Health, England; Cancer Research UK grant number C522/A8649, Medical Research Council of England grant number G0500966, ID 75466, and The NCRI, UK. The epidemiological data for ProtecT were generated though funding from the Southwest National Health Service Research and Development. DNA extraction in ProtecT was supported by USA Dept of Defense award W81XWH-04-1-0280, Yorkshire Cancer Research and Cancer Research UK. The authors would like to acknowledge the contribution of all members of the ProtecT study research group. The views and opinions expressed therein are those of the authors and do not necessarily reflect those of the Department of Health of England. The bio-repository from ProtecT is supported by the NCRI (ProMPT) Prostate Cancer Collaborative and the Cambridge BMRC grant from NIHR. We thank the National Institute for Health Research, Hutchison Whampoa Limited, the Human Research Tissue Bank (Addenbrooke's Hospital), and Cancer Research UK. PROtEuS: PROtEuS was supported financially through grants from the Canadian Cancer Society (13149, 19500, 19864, 19865) and the Cancer Research Society, in partnership with the Ministère de l'enseignement supérieur, de la recherche, de la science et de la technologie du Québec, and the Fonds de la recherche du Québec - Santé.PROtEuS would like to thank its collaborators and research personnel, and the urologists involved in subjects recruitment. We also wish to acknowledge the special contribution made by Ann Hsing and Anand Chokkalingam to the conception of the genetic component of PROtEuS. QLD: The QLD research is supported by The National Health and Medical Research Council (NHMRC) Australia Project Grants (390130, 1009458) and NHMRC Career Development Fellowship and Cancer Australia PdCCRS funding to J Batra. The QLD team would like to acknowledge and sincerely thank the urologists, pathologists, data managers and patient participants who have generously and altruistically supported the QLD cohort. RAPPER: RAPPER is funded by Cancer Research UK (C1094/A11728; C1094/A18504) and Experimental Cancer Medicine Centre funding (C1467/A7286). The RAPPER group thank Rebecca Elliott for project management. SABOR: The SABOR research is supported by NIH/NCI Early Detection Research Network, grant U01 CA0866402-12. Also supported by the Cancer Center Support Grant to the Cancer Therapy and Research Center from the National Cancer Institute (US) P30 CA054174. SCCS: SCCS is funded by NIH grant R01 CA092447, and SCCS sample preparation was conducted at the Epidemiology Biospecimen Core Lab that is supported in part by the Vanderbilt-Ingram Cancer Center (P30 CA68485). Data on SCCS cancer cases used in this publication were provided by the Alabama Statewide Cancer Registry; Kentucky Cancer Registry, Lexington, KY; Tennessee Department of Health, Office of Cancer Surveillance; Florida Cancer Data System; North Carolina Central Cancer Registry, North Carolina Division of Public Health; Georgia Comprehensive Cancer Registry; Louisiana Tumor Registry; Mississippi Cancer Registry; South Carolina Central Cancer Registry; Virginia Department of Health, Virginia Cancer Registry; Arkansas Department of Health, Cancer Registry, 4815 W. Markham, Little Rock, AR 72205. The Arkansas Central Cancer Registry is fully funded by a grant from National Program of Cancer Registries, Centers for Disease Control and Prevention (CDC). Data on SCCS cancer cases from Mississippi were collected by the Mississippi Cancer Registry which participates in the National Program of Cancer Registries (NPCR) of the Centers for Disease Control and Prevention (CDC). The contents of this publication are solely the responsibility of the authors and do not necessarily represent the official views of the CDC or the Mississippi Cancer Registry. SCPCS: SCPCS is funded by CDC grant S1135-19/19, and SCPCS sample preparation was conducted at the Epidemiology Biospecimen Core Lab that is supported in part by the Vanderbilt-Ingram Cancer Center (P30 CA68485). SEARCH: SEARCH is funded by a program grant from Cancer Research UK (C490/A10124) and supported by the UK National Institute for Health Research Biomedical Research Centre at the University of Cambridge. SNP_Prostate_Ghent: The study was supported by the National Cancer Plan, financed by the Federal Office of Health and Social Affairs, Belgium. SPAG: Wessex Medical ResearchHope for Guernsey, MUG, HSSD, MSG, Roger Allsopp STHM2: STHM2 was supported by grants from The Strategic Research Programme on Cancer (StratCan), Karolinska Institutet; the Linné Centre for Breast and Prostate Cancer (CRISP, number 70867901), Karolinska Institutet; The Swedish Research Council (number K2010-70X-20430-04-3) and The Swedish Cancer Society (numbers 11-0287 and 11-0624); Stiftelsen Johanna Hagstrand och Sigfrid Linnérs minne; Swedish Council for Working Life and Social Research (FAS), number 2012-0073STHM2 acknowledges the Karolinska University Laboratory, Aleris Medilab, Unilabs and the Regional Prostate Cancer Registry for performing analyses and help to retrieve data. Carin Cavalli-Björkman and Britt-Marie Hune for their enthusiastic work as research nurses. Astrid Björklund for skilful data management. We wish to thank the BBMRI.se biobank facility at Karolinska Institutet for biobank services. PCPT & SELECT are funded by Public Health Service grants U10CA37429 and 5UM1CA182883 from the National Cancer Institute. SWOG and SELECT thank the site investigators and staff and, most importantly, the participants who donated their time to this trial. TAMPERE: The Tampere (Finland) study was supported by the Academy of Finland (251074), The Finnish Cancer Organisations, Sigrid Juselius Foundation, and the Competitive Research Funding of the Tampere University Hospital (X51003). The PSA screening samples were collected by the Finnish part of ERSPC (European Study of Screening for Prostate Cancer). TAMPERE would like to thank Riina Liikanen, Liisa Maeaettaenen and Kirsi Talala for their work on samples and databases. UGANDA: None reported UKGPCS: UKGPCS would also like to thank the following for funding support: The Institute of Cancer Research and The Everyman Campaign, The Prostate Cancer Research Foundation, Prostate Research Campaign UK (now Prostate Action), The Orchid Cancer Appeal, The National Cancer Research Network UK, The National Cancer Research Institute (NCRI) UK. We are grateful for support of NIHR funding to the NIHR Biomedical Research Centre at The Institute of Cancer Research and The Royal Marsden NHS Foundation Trust. UKGPCS should also like to acknowledge the NCRN nurses, data managers, and consultants for their work in the UKGPCS study. UKGPCS would like to thank all urologists and other persons involved in the planning, coordination, and data collection of the study. ULM: The Ulm group received funds from the German Cancer Aid (Deutsche Krebshilfe). WUGS/WUPCS: WUGS would like to thank the following for funding support: The Anthony DeNovi Fund, the Donald C. McGraw Foundation, and the St. Louis Men's Group Against Cancer.
Previously we performed deep WGS on 6 parents and 13 children from 3 large families from the Scottish Family Health Study to identify de novo mutations. This prelim is cover the additional sequencing of one grandchild from one of these three families. The inclusion of a third generation individual will provide additional experimental validation for the de novo mutations found in the initial trio. As in the previous study, the DNA will be WGS to a depth of approximately 25X to achieve this purpose.
Diffuse large B-cell lymphoma is a heterogenous malignancy that can arise de novo or can result from histologic transformation from indolent lymphomas. Considerable sequencing effort has been placed on determining the mutations that are common at the time of diagnosis in DLBCL but little is known about the nature of relapsed disease. We have sequenced tumour and matched normal exomes of 38 rrDLBCL patients including 25 de novo DLBCLs and 13 transformed lymphomas to identify somatic mutations and copy number alterations associated with relapse.
We performed whole exome sequencing to trios with bipolar disorder to elucidate genetic architecture of bipolar disorder including de novo mutations and transmitted variants.
De novo assembly of eight immune system regions for individual HV31, generated using a multi-platform pipeline. A full description of the generation of these assemblies can be found at https://doi.org/10.1101/2021.02.03.429586.
This study investigated the impact of insulin resistance on chromatin acetylation and inflammation. Blood samples were obtained from women consented to City of Hope Institutional Review Board (IRB)-approved COH Protocol#/Ref#:18306/158149. Women provided demographic information and completed a survey assessing diet, housing and medical history. BMI was obtained from medical records. HgbA1c was measured and cytokine arrays were used to measure serum TNFalpha, IL6, IL15, IL16, IL23, Leptin, and CXCL1.
De novo metastatic prostate cancer is highly aggressive, but the paucity of routinely-collected tissue in this setting has hindered genomic stratification and jeopardizes precision oncology efforts. Here, we leveraged a rare study of surgical intervention in 43 de novo metastatic prostate cancers to assess somatic genotype across 607 synchronous primary and metastatic tissue foci plus circulating tumor DNA. Intra-prostate heterogeneity was pervasive and impacted clinically-relevant genes, resulting in frequent discordance between select primary-restricted foci and synchronous metastases. Additional complexity was driven in several instances by polyclonal metastatic seeding from the reservoir of phylogenetically-related primary populations. When simulating standard clinical practice relying on a single tissue focus, genomic heterogeneity plus highly variable per-sample tumor fraction caused false genotyping of dominant disease. However, in silico modeling demonstrated that analysis of multiple biopsy cores can rescue misassigned somatic genotypes. Our results define the relationship between synchronous treatment-naive primary and metastatic lesions in de novo metastatic prostate cancer and provide a framework for implementing genomics-guided patient management.
Intellectual Disability (ID) is one of the most common global disorders. However for ~30% of patients with ID no cause is identified, despite a clinical diagnostic odyssey that includes genome wide clinical microarray (CMA). We carefully selected 8 trios, each composed of a child with ID and brain structural defect, and both normal parents for whole genome sequencing (WGS). We conducted WGS on the trio and filtered out de novo single nucleotide variants (SNVs) and then used a pathway-based refinement to select candidates. We confirmed a de novo pathogenic SNV in ARID1B, and likely pathogenic SNVs in CACNB3, SPRY4, PHF6, SQSTM1 and UPF1 in 5 of the 8 children. All genes except ARID1B and PHF6 are previously unreported for ID. We analysed our WGS data using 4 independent algorthims for copy number and structural variation including using de novo whole genome assembly. We confirmed one likely contributory 165kb de novo CNV (missed by CMA). We conducted a validation study of over 2000 exomes for our novel candidate genes and also present a strategy to analyze the non-coding sequence space. This study provides an extensive analysis of WGS in the context of ID and yielded causative variants in >60% of cases.
RNAseq data set (Mollaoglu et al., MYC drives progression of small cell lung cancer to a variant neuroendocrine subtype with vulnerability to Aurora kinase inhibition) RNA isolation from primary tumors and healthy lungs was performed using RNeasy Mini Kit (Qiagen) with the standard protocol. RNA was subjected to library construction with the Illumina TruSeq Stranded mRNA Sample Preparation Kit (cat# RS-122-2101, RS-122-2102) according to manufacturer’s protocol. Chemically denatured sequencing libraries (25 pM) are applied to an Illumina HiSeq v4 single read flow cell using an Illumina cBot. Hybridized molecules were clonally amplified and annealed to sequencing primers with reagents from an Illumina HiSeq SR Cluster Kit v4-cBot (GD-401-4001). Following transfer of the flowcell to an Illumina HiSeq 2500 instrument (HCSv2.2.38 and RTA v1.18.61), a 50 cycle single-read sequence run was performed using HiSeq SBS Kit v4 sequencing reagents (FC-401-4002).
This study is a pilot study designed to describe the clinical and genetic landscape of Parkinson's disease in Costa Rica. While most genetic studies have been conducted on Europeans, few have examined Parkinson's disease in Costa Ricans. 118 patients and 97 controls were recruited at the Movement Disorders Unit of the Department of Neurology, Hospital San Juan de Dios, Caja Costarricense de Seguro Social, Costa Rica. Familial Parkinson's disease genes were sequenced using Next-Generation Sequencing. We also measured and obtained different sociodemographic and clinical information.
We whole genome sequence 2 patients with congenital abnormalities and known complex de novo chromosomal rearrangements using the Nanopore MinION. We show how we can detect most previoulsy detected de novo rearrangements as well as uncover a few new ones. Furthermore, we perform genome-wide analyses showing the accuracy of SV detection and of phasing, using long reads. The files uploaded here are the whole genome Nanopore data for each of the two patients used, as well as the corresponding whole genome illumina data, used in our comparisons.
In this, whole exome sequencing was performed on 185 patients suffering from Azoospermia or severe oligozoospermia and on their respective fertile parents in order to investigate de novo and inherited genetic variants likely to be the cause of infertility in these men. After bioinformatic analysis and comprehensive variant interpretation several de novo point mutations and copy number variations were identified as well as multiple homozygous and compound heterozygous mutations in recessive genes that are likely influencing the infertility phenotypes observed in the patients.
DNA methylation, together with chromatin modifications, constitute the epigenome that functions to regulate gene expression and genome integrity. DNA methylation alterations are ubiquitous in human cancers, as many genes acquire DNA methylation in a cancer-specific manner. DNA methylation at these sites in the genome of cancer cells not only serves as a marker for tumor identification, but together with gene mutation and gene expression data, can also be used to describe subsets of tumors of the same organ source. We have previously shown distinct human colorectal cancer subtypes based on DNA methylation differences. Correlating these DNA methylation differences with clinical co-variates will serve to further understand how these distinct subtypes are generated. We have collected 100 colorectal tumor tissues (for which clinical information is known) and have obtained (unprotected) genome-wide DNA methylation and chromatin modification information for each sample for the purposes of identifying and classifying unique tumor subtypes of colorectal cancers. In addition, we have determined mutations of key genes relevant to colorectal cancer as well as gene expression profiles. We will use the clinical data for each de-identified sample to correlate with the DNA methylation, mutation and gene expression information so as to understand the driving forces behind these distinct colorectal subtypes. We have selected the most promising tumors for whole-genome bisulfite sequencing using next-generation sequencing technology to obtain complete maps of colon cancer methylomes. Researchers will be unable to identify the subjects because the samples and associated information have been de-identified and anonymized by the tissue source site. In addition, upon receipt by the USC Epigenome Center, we have assigned new random identifiers for each sample. The data generated using these new codes are not traceable to the patient identity.
Sant Joan de Déu Research Institute (IRSJD) institutional Data Access Committee. The objective of the DAC is to manage, review and assess datasets access requests. The DAC is managed by the Knowledge Management Unit, and an adhoc committee per dataset will be formed to take into consideration each access request. Contact the Knowledge Management Unit at gestio.coneixement@sjd.es
Autism spectrum disorder (ASD) is highly heritable with recent sibling recurrence risk estimates of 19% overall and 26% in males. The heritable phenotype of hyperserotonemia, or elevated levels of platelet serotonin (5HT), in ~35% of people with ASD is a well-established biomarker. The efficacy of drugs like risperidone and potent serotonin transporter inhibitors at treating behaviors related to Insistence on Sameness, along with evidence of the contribution of hyperserotonemia to autism susceptibility collectively support the hypothesis that dysfunction in the 5HT system is a significant etiological target for investigation of the genetic component to autism. ASD genetic architecture is complex; common variants of large effect do not contribute substantially to overall ASD risk, but there are clearly common variants with small effects and rare genetic/genomic variation of larger effect among ASD genetic risk factors. The NIH ARRA Autism Sequencing Consortium, including Dr. Sutcliffe and Dr. Cook among others, has initiated exome sequencing studies to identify more of the rare variants contributing to ASD. Data from our group and others reveal numerous rare de novo and inherited sequence mutations, and the number of de novo and other functional mutations that are found to affect molecules encoding 5HT signaling and its regulation further reinforces our hypothesis that regulation of serotonin levels is important in autism genetic susceptibility. Integrin receptor signaling pathways were prominently featured among identified de novo mutations, thus defining a set of interrelated gene networks that control 5HT signaling. We propose to extend our findings to date by conducting exome sequencing on ACE subjects, for whom we also have 5HT biochemical data, to identify rare variants (including CNVs) in 5HT-related gene networks. In total we propose to have exome sequencing completed on 523 samples. The purpose of the proposed research is two-fold: 1) we will conduct more extensive investigations to determine the support for the hypothesis that genetic variation at genes regulating platelet serotonin levels affect susceptibility to ASD and 2) we will combine the exome sequence data from these studies with sequence data for autism being accumulated in a variety of other research projects to further the goal of identifying and characterizing the genetic component to ASD.
Single-cell RNA-seq profiling of immune cells sorted from human Melanoma tumors (and several matching PBMC samples). Contains de-multiplexed FASTQ files per plate (MARS-seq amplification batch, total 204 samples) and also de-multiplexed FASTQ files of single-cell TCRb-seq.
It is well established that autism spectrum disorders (ASD) have a strong genetic component; however, for at least 70% of cases, the underlying genetic cause is unknown. Under the hypothesis that de novo mutations underlie a substantial fraction of the risk for developing ASD in families with no previous history of ASD or related phenotypes-so-called sporadic or simplex families-we sequenced all coding regions of the genome (the exome) for parent-child trios exhibiting sporadic ASD, including 189 new trios and 20 that were previously reported. Additionally, we also sequenced the exomes of 50 unaffected siblings corresponding to these new (n = 31) and previously reported trios (n = 19), for a total of 617 individual exomes from 209 families deposited in dbGaP. Here we show that de novo point mutations are overwhelmingly paternal in origin (4:1 bias) and positively correlated with paternal age, consistent with the modest increased risk for children of older fathers to develop ASD. Moreover, 39% (49 of 126) of the most severe or disruptive de novo mutations map to a highly interconnected beta-catenin/chromatin remodelling protein network ranked significantly for autism candidate genes. In proband exomes, recurrent protein-altering mutations were observed in two genes: CHD8 and NTNG1. Mutation screening of six candidate genes in 1,703 ASD probands identified additional de novo, protein-altering mutations in GRIN2B, LAMC3 and SCN1A. Combined with copy number variant (CNV) data, these results indicate extreme locus heterogeneity but also provide a target for future discovery, diagnostics and therapeutics.
A pulldown experiment with Agilent SureSelect probes designed on regions that were more likely to contain de novo mutations. 266 candidate sites were selected based on whole genome sequencing data. The probes also included the exons of genes that have been identified as neurodevelopmental disorder genes in DDD (the DDG2P genes) 1,336 targets. In addition, the design included the standard iPLEX sites.
We set out to determine ctDNA abundance at de novo mCSPC diagnosis and whether ctDNA provides complementary clinically relevant information to a prostate biopsy. We collected and sequenced 77 plasma cell-free DNA samples from 53 newly diagnosed patients with mCSPC. Targeted sequencing was also performed on DNA from 48 diagnostic prostate tissue samples.
High-precision human leukocyte antigen (HLA) genotyping is crucial for anti-cancer immunotherapy, but existing tools predicting HLA genotypes using next-generation sequencing (NGS) data are insufficiently accurate. We compared the availability and accuracy of eight HLA genotyping tools (OptiType, HLA-HD, PHLAT, seq2HLA, arcasHLA, HLAscan, HLA*LA, and Kourami) using 1,275 cases from the 1000 Genomes Project data and created a new HLA-genotyping algorithm combining tools. Then, we assessed the new algorithm’s performance in 39 in-house samples with normal whole-exome sequencing (WES) data and polymerase chain reaction–sequencing-based typing (PCR-SBT) results.
Current methods available for pre-implantation genetic diagnosis (PGD) of in vitro fertilized (IVF) embryos do not detect de novo. Detection of these types of mutations requires whole genome sequencing (WGS). In this study advanced massively parallel WGS was performed on three 5-10 cell biopsies from two blastocyst-stage embryos. Overall, greater than 95% of each genome was called and experimentally derived haplotypes and barcoded read data were used to detect and phase up to 82% of de novo single base mutations with a false positive rate of ~1 error per Gb. These results suggest that phased WGS using barcoded DNA could be used in the future as part of the PGD process to maximize comprehensiveness in detecting disease causing mutations and reduce the incidence of genetic diseases.
Samples from Magdalena de Cao, Peru
This study is part of the 'First 1,000 Days of Life and Beyond' study at the Inova Translational Medicine Institute. Whole-genome sequencing data from 1,291 parent-offspring trios was used to study the properties of clustered de novo mutations. The maternal clusters were found to be enriched in regions with accelerated maternal mutation rate and show distinct mutational signatures. For additional details, please refer to: "Germline de novo mutation clusters arise during oocyte aging in genomic regions with increased double-strand break incidence". Jakob M. Goldmann, Vladimir B. Seplyarskiy, Wendy S.W. Wong, Thierry Vilboux, Pieter B. Neerincx, Dale L. Bodian, Benjamin D. Solomon, Joris A. Veltman, John F. Deeken, Christian Gilissen, John E. Niederhuber. Nature Genetics.