Hirschsprung's disease (HD) is an abdomen disorder which is caused by absence of enteric neurons. HD is a complex genetic disorder and several causing gene mutations have been identified including the receptor tyrosine kinase gene. However, the whole mechanism of HD remains unknown and additional genetic causalities are implicated. To further investigate the genetic causalities of HD, we conducted whole-exome sequencing of patients with HD and unaffected patient's families.
Pediatric T-cell acute lymphoblastic leukemia (T-ALL) is rare and its pathogenesis is poorly understood. To investigate molecular basis of pediatric T-ALL, we performed whole transcriptome sequencing in 123 cases of pediatric T-ALL.
A complete hydatidiform mole (CHM) is an abnormal pregnancy caused by genetic fertilization disorders. The aim of this study is to establish human trophoblast stem (TS) cell lines from CHMs and elucidate their pathogenesis.
Illumina Immunochip genotype data for coeliac disease and control samples of North Indian samples origin. Data is in PLINK binary format. Calling algorithm for genotypes is based on GenomeStudio (GenTrain), with manual clustering of selected variants.
The Cancer Model Development Center (CMDC) project is part of the Human Cancer Model Initiative (HCMI) and is managed by the Office of Cancer Genomics within the Center for Cancer Genomics at the National Cancer Institute (NCI). The goal of the HCMI is to generate approximately 1000 novel human "next generation" tumor-derived models as a community resource. The models aim to faithfully recapitulate the cellular complexity and heterogeneity of human tumors. The CMDCs are tasked by the NCI with producing a proportion of the cancer models. Wellcome Sanger Institute (WSI) is a contributing member of the HCMI. Models developed by HCMI are being derived from a number of tumor types and subtypes (see below), including rare adult, pediatric and understudied cancers. The number of models and the clinical sites from which patient samples are acquired is increasing. Normal tissue, originating tumor, and tumor-derived models are sequenced. The next generation cancer models are associated with rich clinical and genomic data through the NCI Genomic Data Commons for all researchers to advance cancer research. The CMDC and HCMI models will serve as excellent tools in research to provide insight into the pathways that influence tumor growth and progression, how tumors respond to therapy, etc. Ultimately, the data generated using these models will support the development of personalized oncology. There is a delay between registered IDs and model availability. The current model count is found in The HCMI Searchable Catalog which is updated when new models are available for distribution to the research community.
Count Me In - The Metastatic Breast Cancer Project: A Patient-Driven Research Initiative to Accelerate Metastatic Breast Cancer Research The Metastatic Breast Cancer Project is a research study that directly engages patients with metastatic breast cancer via social media and advocacy groups and empowers them to accelerate cancer research by sharing their samples and clinical information. Our goal is to create a publicly available dataset of genomic, molecular, clinical, and patient-reported data to enable research. Patients in the US or Canada may register online. Registered patients are sent an online consent form that asks for permission to obtain and analyze their medical records, tumor tissue, saliva, and blood samples. Once enrolled, patients are sent a saliva kit and asked to mail back a saliva sample, which is used to extract germline DNA. Study staff contact participants' medical providers and obtain medical records and a portion of their stored tumor biopsies. Patients may be asked to mail in a blood sample, which is used to extract cell free DNA (cfDNA). Whole exome sequencing (WES) is performed on tumor DNA, germline DNA, and cfDNA; transcriptome sequencing is performed on tumor RNA. Clinically annotated genomic data are used to study specific patient cohorts (including outliers) and to identify mechanisms of response and resistance to therapies. All de-identified data, including genomic, clinical, and patient-reported data, are shared via public databases on a pre-publication and recurring basis as it is generated. The latest data release in cBioPortal is available here. Study updates are shared with participants regularly.
Background: The disease course of patients with diffuse low-grade glioma is notoriously unpredictable. Temporal and spatially distinct samples may provide insight into the evolution of clinically relevant copy number aberrations (CNAs). The purpose of this study is to identify CNAs that are indicative of aggressive tumor behaviour and can thereby complement the prognostically favorable 1p/19q co-deletion. Results: Genome-wide, 50 base pair single-end, sequencing was performed to detect CNAs in a clinically well-characterized cohort of 98 formalin-fixed paraffin-embedded low-grade gliomas. CNAs are correlated with overall survival as an endpoint. Seventy-five additional samples from spatially distinct regions and paired recurrent tumors of the discovery cohort were analysed to interrogate the intratumoral heterogeneity and spatial evolution. Loss of 10q25.2-qter is a frequent subclonal event and significantly correlates with an unfavorable prognosis. A significant correlation is furthermore observed in a validation set of 126 and confirmation set of 184 patients. Loss of 10q25.2-qter arises in a longitudinal manner in paired recurrent tumor specimens, whereas the prognostically favorable 1p/ 19q co-deletion is the only CNA that is stable across spatial regions and recurrent tumors. Conclusions: CNAs in low-grade gliomas display extensive intratumoral heterogeneity. Distal loss of 10q is a late onset event and a marker for reduced overall survival in low-grade glioma patients. Intratumoral heterogeneity and higher frequencies of distal 10q loss in recurrences suggest this event is involved in outgrowth to the recurrent tumor.
Turner syndrome is a disorder affecting only girls and women, which is caused by a complete or partial loss of the second sex chromosome. Girls and women with Turner syndrome have a variety of phenotypes. Of interest to this study is aortopathy, including bicuspid aortic valve (BAV) and thoracic aortic dilation (TAD) in which we know that 25 - 50% of girls with Turner syndrome are born with. The aorta is the main artery that conducts the blood out of the heart. The aortic valve guards the entrance to the aorta. When the aortic valve has two leaflets instead of the usual three it is called a BAV. BAV is more common than all other congenital heart defects combined. BAV commonly causes obstructions to blood flow out of the heart or become leaky over time. BAV often occurs in combination with TAD. In Turner syndrome, TADs are increased 50-100 fold compared to the general population. Currently, there are no effective approaches or pre-surgical treatments for aortic disease. Therefore, progression of aortic disease in individuals with BAV/TAD can result in catastrophic aortic dissection, rupture, and death. Our goal is to leverage DNA sequencing and carefully curated patient samples to gain a better understanding of the molecular mechanisms that cause these inter-related aortic diseases.
This dataset contains a gene-cell matrix derived from single-cell RNA sequencing (scRNA-seq) data of ileal tissue from Crohn's disease (CD) patients and colorectal cancer (CRC) patients. It includes: Crohn's Disease Patients: A trio of transmural lesions (stenotic, inflamed, and non-inflamed) from each patient. Colorectal Cancer Patients: Unaffected ileal tissue used as external non-inflamed control. Cell Level Metadata: The dataset includes relevant cell-level metadata such as cell type annotations used in the study. Experimental Details: Platform: 10x Genomics Chromium Single Cell 3' GEX Sequencing: Illumina NovaSeq Processing: Data processed with Cell Ranger software. Resulting count matrices were merged for downstream analysis, including integration and dimensionality reduction. Dataset Composition: Crohn's Disease Patients: 10 patients with 3 samples each (non-inflamed, inflamed, stenotic), totaling 30 samples. Colorectal Cancer Patients: 5 patients with 1 sample each of unaffected tissue, totaling 5 samples. Data Provided: Merged Raw Count Matrix: The final merged raw count matrix used for downstream analysis. Cell Metadata File: Contains details of sample, tissue, and patient for each cell in the count matrix. Barcodes File: Indicate each cell barcode which also encodes the sample, tissue, and patient details for each cell. CD.S_Inf: Stenotic Corhn's disease inflamed samples CD.S_Sten: Stenotic CD patient stenosis sample CD.S_Prox: Stenotic CD Patient - proximal non-inflamed sample CC.C_Prox: CRC Patient proximal unaffected sample eg: A barcode 'CC.C_1_Prox_AAGTCGTAGACCCTTA' indicates CRC Patient unaffected proximal sampe from CRC Patient no.1 and the nucleic acid sequence indicate a unique cell from this sample. Total Samples: Crohn's Disease (CD) Patients: 30 samples Colorectal Cancer (CRC) Patients: 5 samples Patient_no Sample Sample_type 1 CC.C_1 CC.C_1_Prox CC.C_Prox 2 CD.S_1 CD.S_1_Prox CD.S_Prox 3 CD.S_1 CD.S_1_Infl CD.S_Infl 4 CD.S_1 CD.S_1_Sten CD.S_Sten 5 CC.C_2 CC.C_2_Prox CC.C_Prox 6 CD.S_2 CD.S_2_Prox CD.S_Prox 7 CD.S_2 CD.S_2_Infl CD.S_Infl 8 CD.S_2 CD.S_2_Sten CD.S_Sten 9 CC.C_3 CC.C_3_Prox CC.C_Prox 10 CC.C_4 CC.C_4_Prox CC.C_Prox 11 CD.S_3 CD.S_3_Prox CD.S_Prox 12 CD.S_3 CD.S_3_Infl CD.S_Infl 13 CD.S_3 CD.S_3_Sten CD.S_Sten 14 CD.S_4 CD.S_4_Prox CD.S_Prox 15 CD.S_4 CD.S_4_Infl CD.S_Infl 16 CD.S_4 CD.S_4_Sten CD.S_Sten 17 CC.C_5 CC.C_5_Prox CC.C_Prox 18 CD.S_5 CD.S_5_Prox CD.S_Prox 19 CD.S_5 CD.S_5_Infl CD.S_Infl 20 CD.S_5 CD.S_5_Sten CD.S_Sten 21 CD.S_6 CD.S_6_Prox CD.S_Prox 22 CD.S_6 CD.S_6_Infl CD.S_Infl 23 CD.S_6 CD.S_6_Sten CD.S_Sten 24 CD.S_7 CD.S_7_Prox CD.S_Prox 25 CD.S_7 CD.S_7_Infl CD.S_Infl 26 CD.S_7 CD.S_7_Sten CD.S_Sten 27 CD.S_8 CD.S_8_Prox CD.S_Prox 28 CD.S_8 CD.S_8_Infl CD.S_Infl 29 CD.S_8 CD.S_8_Sten CD.S_Sten 30 CD.S_9 CD.S_9_Prox CD.S_Prox 31 CD.S_9 CD.S_9_Infl CD.S_Infl 32 CD.S_9 CD.S_9_Sten CD.S_Sten 33 CD.S_10 CD.S_10_Prox CD.S_Prox 34 CD.S_10 CD.S_10_Infl CD.S_Infl 35 CD.S_10 CD.S_10_Sten CD.S_Sten
Data Use Ontology Data Use Ontology at EGA The EGA is committed to its involvement in the work of GA4GH. In an effort to enhance data discoverability and streamline data access, EGA have implemented the use of the Data Use Ontology (DUO), based on consent codes as described in Dyke et al. 2017. The Data Use Ontology codes will be displayed on the live dataset page of your submission to advise any would be requestor on how the data can be used and also to enhance data discoverability as users will be able to search on these codes to find applicable datasets. DUO can be browsed online via the Ontology Lookup Service Learn more reading the Data Use Ontology publication and the GA4GH Machine-Readable Consent Guidance! Check our DAC Portal Take The Tour and learn how to add DUO codes to your policy! Term Shorthand Label Description DUO:0000004 NRES no restriction This data use permission indicates there is no restriction on use. DUO:0000042 GRU general research use This data use permission indicates that use is allowed for general research use for any research purpose. DUO:0000006 HMB health or medical or biomedical research This data use permission indicates that use is allowed for health/medical/biomedical purposes; does not include the study of population origins or ancestry. DUO:0000007 DS disease specific research This data use permission indicates that use is allowed provided it is related to the specified disease. DUO:0000011 POA population origins or ancestry research only This data use permission indicates that use of the data is limited to the study of population origins or ancestry. DUO:0000012 RS research specific restrictions This data use modifier indicates that use is limited to studies of a certain research type. DUO:0000015 NMDS no general methods research This data use modifier indicates that use does not allow methods development research (e.g., development of software or algorithms). DUO:0000016 GSO genetic studies only This data use modifier indicates that use is limited to genetic studies only (i.e., studies that include genotype research alone or both genotype and phenotype research, but not phenotype research exclusively) DUO:0000018 NPUNCU not for profit, non commercial use only This data use modifier indicates that use of the data is limited to not-for-profit organizations and not-for-profit use, non-commercial use. DUO:0000019 PUB publication required This data use modifier indicates that requestor agrees to make results of studies using the data available to the larger scientific community. DUO:0000020 COL collaboration required This data use modifier indicates that the requestor must agree to collaboration with the primary study investigator(s). DUO:0000021 IRB ethics approval required This data use modifier indicates that the requestor must provide documentation of local IRB/ERB approval. DUO:0000022 GS geographical restriction This data use modifier indicates that use is limited to within a specific geographic region. DUO:0000024 MOR publication moratorium This data use modifier indicates that requestor agrees not to publish results of studies until a specific date. DUO:0000025 TS time limit on use This data use modifier indicates that use is approved for a specific number of months. DUO:0000026 US user specific restriction This data use modifier indicates that use is limited to use by approved users. DUO:0000027 PS project specific restriction This data use modifier indicates that use is limited to use within an approved project. DUO:0000028 IS institution specific restriction This data use modifier indicates that use is limited to use within an approved institution. DUO:0000029 RTN return to database or resource This data use modifier indicates that the requestor must return derived/enriched data to the database/resource. DUO:0000043 CC clinical care use This data use modifier indicates that use is allowed for clinical use and care. DUO:0000044 NPOA population origins or ancestry research prohibited This data use modifier indicates use for purposes of population, origin, or ancestry research is prohibited. DUO:0000045 NPU not for profit organisation use only This data use modifier indicates that use of the data is limited to not-for-profit organisations. DUO:0000046 NCU non-commercial use only This data use modifier indicates that use of the data is limited to not-for-profit use. Point to Notice: For the consent code DUO:0000007 where data is restricted to use on a specific disease, please accompany it with an appropriate ontology from MONDO e.g., If the data is restricted to the use of research into juvenile idiopathic arthritis the code should be displayed as DUO:0000007; MONDO:0011429.
This study is to understand (1) the clonal evolution of metastatic colorectal cancer using DNA sequencing and (2) long non-coding RNAs associated with metastatic colon cancer using RNA sequencing. Our goal is to better understand the progression of aggressive colorectal cancer.
Pediatric T-cell acute lymphoblastic leukemia (T-ALL) is rare and its pathogenesis is poorly understood. To investigate epigenetic profiles of pediatric T-ALL, we performed DNA methylation array analysis in 79 cases of pediatric T-ALL.
This study contains whole genome sequencing data for 3 matched tumour / normal pairs from Mesothelioma patients. The sequencing is 50bp paired-end generated using the BGISEQ-500 and is provided as 6 aligned and duplicate-marked BAM files.
This study is part of EuCanImage project. It contains the standardized clinical information following FHIR model. The data was collected together with MRI of patients diagnosed with liver cancer (HCC) and controls (non-HCC). All data is synthetic.
The aim of this study is to compare DCIS and IBC in a subtype stratified manner in several genomic levels (gene expression, copy number, methylation).The data in this archive is supplementary to previously published data.
The primary aim of the study is to conduct a genome-wide association study to identify genetic variants associated with the incidence of myocardial infarction (MI), stroke, and heart failure (HF) among participants enrolled in the Cardiovascular Health Study (CHS) who were free of clinical cardiovascular disease at baseline. The secondary aim is to conduct genome-wide association study of other phenotypes in CHS. The study is an ancillary study to CHS. CHS is a population-based cohort study of risk factors for heart disease and stroke among older adults recruited at 4 US sites in 1989-1990. Subjects underwent an extensive baseline examination, and annual follow-up examinations through 1988-1999.
Bipolar affective disorder is a severe, heritable condition affecting about one percent of the population. The mode of inheritance is poorly understood and probably involves multiple loci of small to moderate effect. In this project, we use genetic mapping and sequencing methods to identify genetic markers and variations that contribute to the risk of bipolar disorder. Individuals diagnosed with bipolar disorder are studied, along with their relatives. Phenotypic information obtained from clinical interviews and family history is correlated with genotypic information obtained from genetic marker and sequencing methods. The goal is to identify genes involved in bipolar disorder and related conditions so that better methods of diagnosis, treatment, and prevention can be developed.
Thrombocytopenia with Absent Radii is an inherited disorder that manifests itself with major limb skeletal abnormalities and low platelet count (and therefore a bleeding diathesis). The syndrome is well-characterised and defined phenotypically and there is a well-established database of UK-based families affected with ths disorder. The causative mutation for the disorder is yet to be identified. If known, it would allow for pre-natal screening and counselling avoiding life-long care for patients who are affected and are therefore severely disabled. We postulate that exon sequencing of 4 unrelated affected individuals would give enough power to narrow down potential candidate mutations which would thereafter be confirmed using DNA from other affected families.
Please note: This study contains synthetic data (with cohort “participants” / ”subjects” marked with FAKE) has no identifiable data and cannot be used to make any inference about cohort data or results. The purpose of this dataset is to aid development of technical implementations for cohort data discovery, harmonization, access, and federated analysis. In support of FAIRness in data sharing, this dataset is made freely available under the Creative Commons Licence (CC-BY). Please ensure this preamble is included with this dataset and that the CINECA project (funding: EC H2020 grant 825775) is acknowledged. For any questions please contact isuru@ebi.ac.uk or cthomas@ebi.ac.uk
Prostate cancer is a leading cause of cancer-related death and morbidity worldwide. Androgen deprivation therapy (ADT) is the cornerstone of management for advanced disease. The use of androgen deprivation therapies is associated with multiple side effects, including metabolic syndrome and truncal obesity. At the same time, obesity has been associated with both prostate cancer development and disease progression, linked to its effects on chronic inflammation at a tissue level. The connection between androgen deprivation therapy, obesity, inflammation, and prostate cancer progression is well-established in clinical settings; however, an understanding of the changes in adipose tissue at the molecular level induced by castrating therapies is missing. Here we investigated the transcriptional changes in periprostatic fat tissue induced by profound androgen deprivation therapy in a group of patients with high-risk tumours compared to a matching untreated cohort. We find that androgen deprivation therapy is associated with a pro-inflammatory and obesity-like adipose tissue microenvironment. This study suggests that the beneficial effect of androgen deprivation therapy may be partially counteracted by metabolic and inflammatory side effects in the adipose tissue surrounding the prostate.
Original description of the study: From ELLIPSE (linked to the PRACTICAL consortium), we contributed ~78,000 SNPs to the OncoArray. A large fraction of the content was derived from the GWAS meta-analyses in European ancestry populations (overall and aggressive disease; ~27K SNPs). We also selected just over 10,000 SNPs from the meta-analyses in the non-European populations, with a majority of these SNPs coming from the analysis of overall prostate cancer in African ancestry populations as well as from the multiethnic meta-analysis. A substantial fraction of SNPs (~28,000) were also selected for fine-mapping of 53 loci not included in the common fine-mapping regions (tagging at r2>0.9 across ±500kb regions). We also selected a few thousand SNPs related with PSA levels and/or disease survival as well as SNPs from candidate lists provided by study collaborators, as well as from meta-analyses of exome SNP chip data from the Multiethnic Cohort and UK studies. The Contributing Studies: Aarhus: Hospital-based, Retrospective, Observational. Source of cases: Patients treated for prostate adenocarcinoma at Department of Urology, Aarhus University Hospital, Skejby (Aarhus, Denmark). Source of controls: Age-matched males treated for myocardial infarction or undergoing coronary angioplasty, but with no prostate cancer diagnosis based on information retrieved from the Danish Cancer Register and the Danish Cause of Death Register. AHS: Nested case-control study within prospective cohort. Source of cases: linkage to cancer registries in study states. Source of controls: matched controls from cohort ATBC: Prospective, nested case-control. Source of cases: Finnish male smokers aged 50-69 years at baseline. Source of controls: Finnish male smokers aged 50-69 years at baseline BioVu: Cases identified in a biobank linked to electronic health records. Source of cases: A total of 214 cases were identified in the VUMC de-identified electronic health records database (the Synthetic Derivative) and shipped to USC for genotyping in April 2014. The following criteria were used to identify cases: Age 18 or greater; male; African Americans (Black) only. Note that African ancestry is not self-identified, it is administratively or third-party assigned (which has been shown to be highly correlated with genetic ancestry for African Americans in BioVU; see references). Source of controls: Controls were identified in the de-identified electronic health record. Unfortunately, they were not age matched to the cases, and therefore cannot be used for this study. Canary PASS: Prospective, Multi-site, Observational Active Surveillance Study. Source of cases: clinic based from Beth Israel Deaconness Medical Center, Eastern Virginia Medical School, University of California at San Francisco, University of Texas Health Sciences Center San Antonio, University of Washington, VA Puget Sound. Source of controls: N/A CCI: Case series, Hospital-based. Source of cases: Cases identified through clinics at the Cross Cancer Institute. Source of controls: N/A CerePP French Prostate Cancer Case-Control Study (ProGene): Case-Control, Prospective, Observational, Hospital-based. Source of cases: Patients, treated in French departments of Urology, who had histologically confirmed prostate cancer. Source of controls: Controls were recruited as participating in a systematic health screening program and found unaffected (normal digital rectal examination and total PSA < 4 ng/ml, or negative biopsy if PSA > 4 ng/ml). COH: hospital-based cases and controls from outside. Source of cases: Consented prostate cancer cases at City of Hope. Source of controls: Consented unaffected males that were part of other studies where they consented to have their DNA used for other research studies. COSM: Population-based cohort. Source of cases: General population. Source of controls: General population CPCS1: Case-control - Denmark. Source of cases: Hospital referrals. Source of controls: Copenhagen General Population Study CPCS2: Source of cases: Hospital referrals. Source of controls: Copenhagen General Population Study CPDR: Retrospective cohort. Source of cases: Walter Reed National Military Medical Center. Source of controls: Walter Reed National Military Medical Center ACS_CPS-II: Nested case-control derived from a prospective cohort study. Source of cases: Identified through self-report on follow-up questionnaires and verified through medical records or cancer registries, identified through cancer registries or the National Death Index (with prostate cancer as the primary cause of death). Source of controls: Cohort participants who were cancer-free at the time of diagnosis of the matched case, also matched on age (±6 mo) and date of biospecimen donation (±6 mo). EPIC: Case-control - Germany, Greece, Italy, Netherlands, Spain, Sweden, UK. Source of cases: Identified through record linkage with population-based cancer registries in Italy, the Netherlands, Spain, Sweden and UK. In Germany and Greece, follow-up is active and achieved through checks of insurance records and cancer and pathology registries as well as via self-reported questionnaires; self-reported incident cancers are verified through medical records. Source of controls: Cohort participants without a diagnosis of cancer EPICAP: Case-control, Population-based, ages less than 75 years at diagnosis, Hérault, France. Source of cases: Prostate cancer cases in all public hospitals and private urology clinics of département of Hérault in France. Cases validation by the Hérault Cancer Registry. Source of controls: Population-based controls, frequency age matched (5-year groups). Quotas by socio-economic status (SES) in order to obtain a distribution by SES among controls identical to the SES distribution among general population men, conditionally to age. ERSPC: Population-based randomized trial. Source of cases: Men with PrCa from screening arm ERSPC Rotterdam. Source of controls: Men without PrCa from screening arm ERSPC Rotterdam ESTHER: Case-control, Prospective, Observational, Population-based. Source of cases: Prostate cancer cases in all hospitals in the state of Saarland, from 2001-2003. Source of controls: Random sample of participants from routine health check-up in Saarland, in 2000-2002 FHCRC: Population-based, case-control, ages 35-74 years at diagnosis, King County, WA, USA. Source of cases: Identified through the Seattle-Puget Sound SEER cancer registry. Source of controls: Randomly selected, age-frequency matched residents from the same county as cases Gene-PARE: Hospital-based. Source of cases: Patients that received radiotherapy for treatment of prostate cancer. Source of controls: n/a Hamburg-Zagreb: Hospital-based, Prospective. Source of cases: Prostate cancer cases seen at the Department of Oncology, University Hospital Center Zagreb, Croatia. Source of controls: Population-based (Croatia), healthy men, older than 50, with no medical record of cancer, and no family history of cancer (1st & 2nd degree relatives) HPFS: Nested case-control. Source of cases: Participants of the HPFS cohort. Source of controls: Participants of the HPFS cohort IMPACT: Observational. Source of cases: Carriers and non-carriers (with a known mutation in the family) of the BRCA1 and BRCA2 genes, aged between 40 and 69, who are undergoing prostate screening with annual PSA testing. This cohort has been diagnosed with prostate cancer during the study. Source of controls: Carriers and non-carriers (with a known mutation in the family) of the BRCA1 and BRCA2 genes, aged between 40 and 69, who are undergoing prostate screening with annual PSA testing. This cohort has not been diagnosed with prostate cancer during the study. IPO-Porto: Hospital-based. Source of cases: Early onset and/or familial prostate cancer. Source of controls: Blood donors Karuprostate: Case-control, Retrospective, Population-based. Source of cases: From FWI (Guadeloupe): 237 consecutive incident patients with histologically confirmed prostate cancer attending public and private urology clinics; From Democratic Republic of Congo: 148 consecutive incident patients with histologically confirmed prostate cancer attending the University Clinic of Kinshasa. Source of controls: From FWI (Guadeloupe): 277 controls recruited from men participating in a free systematic health screening program open to the general population; From Democratic Republic of Congo: 134 controls recruited from subjects attending the University Clinic of Kinshasa KULEUVEN: Hospital-based, Prospective, Observational. Source of cases: Prostate cancer cases recruited at the University Hospital Leuven. Source of controls: Healthy males with no history of prostate cancer recruited at the University Hospitals, Leuven. LAAPC: Subjects were participants in a population-based case-control study of aggressive prostate cancer conducted in Los Angeles County. Cases were identified through the Los Angeles County Cancer Surveillance Program rapid case ascertainment system. Eligible cases included African American, Hispanic, and non-Hispanic White men diagnosed with a first primary prostate cancer between January 1, 1999 and December 31, 2003. Eligible cases also had (a) prostatectomy with documented tumor extension outside the prostate, (b) metastatic prostate cancer in sites other than prostate, (c) needle biopsy of the prostate with Gleason grade ≥8, or (d) needle biopsy with Gleason grade 7 and tumor in more than two thirds of the biopsy cores. Eligible controls were men never diagnosed with prostate cancer, living in the same neighborhood as a case, and were frequency matched to cases on age (± 5 y) and race/ethnicity. Controls were identified by a neighborhood walk algorithm, which proceeds through an obligatory sequence of adjacent houses or residential units beginning at a specific residence that has a specific geographic relationship to the residence where the case lived at diagnosis. Malaysia: Case-control. Source of cases: Patients attended the outpatient urology or uro-onco clinic at University Malaya Medical Center. Source of controls: Population-based, age matched (5-year groups), ascertained through electoral register, Subang Jaya, Selangor, Malaysia MCC-Spain: Case-control. Source of cases: Identified through the urology departments of the participating hospitals. Source of controls: Population-based, frequency age and region matched, ascertained through the rosters of the primary health care centers MCCS: Nested case-control, Melbourne, Victoria. Source of cases: Identified by linkage to the Victorian Cancer Registry. Source of controls: Cohort participants without a diagnosis of cancer MD Anderson: Participants in this study were identified from epidemiological prostate cancer studies conducted at the University of Texas MD Anderson Cancer Center in the Houston Metropolitan area. Cases were accrued in the Houston Medical Center and were not restricted with respect to Gleason score, stage or PSA. Controls were identified via random-digit-dialing or among hospital visitors and they were frequency matched to cases on age and race. Lifestyle, demographic, and family history data were collected using a standardized questionnaire. MDACC_AS: A prospective cohort study. Source of cases: Men with clinically organ-confined prostate cancer meeting eligibility criteria for a prospective cohort study of active surveillance at MD Anderson Cancer Center. Source of controls: N/A MEC: The Multiethnic Cohort (MEC) is comprised of over 215,000 men and women recruited from Hawaii and the Los Angeles area between 1993 and 1996. Between 1995 and 2006, over 65,000 blood samples were collected from participants for genetic analyses. To identify incident cancer cases, the MEC was cross-linked with the population-based Surveillance, Epidemiology and End Results (SEER) registries in California and Hawaii, and unaffected cohort participants with blood samples were selected as controls MIAMI (WFPCS): Prostate cancer cases and controls were recruited from the Departments of Urology and Internal Medicine of the Wake Forest University School of Medicine using sequential patient populations as described previously (PMID:15342424). All study subjects received a detailed description of the study protocol and signed their informed consent, as approved by the medical center's Institutional Review Board. The general eligibility criteria were (i) able to comprehend informed consent and (ii) without previously diagnosed cancer. The exclusion criteria were (i) clinical diagnosis of autoimmune diseases; (ii) chronic inflammatory conditions; and (iii) infections within the past 6 weeks. Blood samples were collected from all subjects. MOFFITT: Hospital-based. Source of cases: clinic based from Moffitt Cancer Center. Source of controls: Moffitt Cancer Center affiliated Lifetime cancer screening center NMHS: Case-control, clinic based, Nashville TN. Source of cases: All urology clinics in Nashville, TN. Source of controls: Men without prostate cancer at prostate biopsy. PCaP: The North Carolina-Louisiana Prostate Cancer Project (PCaP) is a multidisciplinary population-based case-only study designed to address racial differences in prostate cancer through a comprehensive evaluation of social, individual and tumor level influences on prostate cancer aggressiveness. PCaP enrolled approximately equal numbers of African Americans and Caucasian Americans with newly-diagnosed prostate cancer from North Carolina (42 counties) and Louisiana (30 parishes) identified through state tumor registries. African American PCaP subjects with DNA, who agreed to future use of specimens for research, participated in OncoArray analysis. PCMUS: Case-control - Sofia, Bulgaria. Source of cases: Patients of Clinic of Urology, Alexandrovska University Hospital, Sofia, Bulgaria, PrCa histopathologically confirmed. Source of controls: 72 patients with verified BPH and PSA<3,5; 78 healthy controls from the MMC Biobank, no history of PrCa PHS: Nested case-control. Source of cases: Participants of the PHS1 trial/cohort. Source of controls: Participants of the PHS1 trial/cohort PLCO: Nested case-control. Source of cases: Men with a confirmed diagnosis of prostate cancer from the PLCO Cancer Screening Trial. Source of controls: Controls were men enrolled in the PLCO Cancer Screening Trial without a diagnosis of cancer at the time of case ascertainment. Poland: Case-control. Source of cases: men with unselected prostate cancer, diagnosed in north-western Poland at the University Hospital in Szczecin. Source of controls: cancer-free men from the same population, taken from the healthy adult patients of family doctors in the Szczecin region PROCAP: Population-based, Retrospective, Observational. Source of cases: Cases were ascertained from the National Prostate Cancer Register of Sweden Follow-Up Study, a retrospective nationwide cohort study of patients with localized prostate cancer. Source of controls: Controls were selected among men referred for PSA testing in laboratories in Stockholm County, Sweden, between 2010 and 2012. PROGReSS: Hospital-based, Prospective, Observational. Source of cases: Prostate cancer cases from the Hospital Clínico Universitario de Santiago de Compostela, Galicia, Spain. Source of controls: Cancer-free men from the same population ProMPT: A study to collect samples and data from subjects with and without prostate cancer. Retrospective, Experimental. Source of cases: Subjects attending outpatient clinics in hospitals. Source of controls: Subjects attending outpatient clinics in hospitals ProtecT: Trial of treatment. Samples taken from subjects invited for PSA testing from the community at nine centers across United Kingdom. Source of cases: Subjects who have a proven diagnosis of prostate cancer following testing. Source of controls: Identified through invitation of subjects in the community. PROtEuS: Case-control, population-based. Source of cases: All new histologically-confirmed cases, aged less or equal to 75 years, diagnosed between 2005 and 2009, actively ascertained across Montreal French hospitals. Source of controls: Randomly selected from the Provincial electoral list of French-speaking men between 2005 and 2009, from the same area of residence as cases and frequency-matched on age. QLD: Case-control. Source of cases: A longitudinal cohort study (Prostate Cancer Supportive Care and Patient Outcomes Project: ProsCan) conducted in Queensland, through which men newly diagnosed with prostate cancer from 26 private practices and 10 public hospitals were directly referred to ProsCan at the time of diagnosis by their treating clinician (age range 43-88 years). All cases had histopathologically confirmed prostate cancer, following presentation with an abnormal serum PSA and/or lower urinary tract symptoms. Source of controls: Controls comprised healthy male blood donors with no personal history of prostate cancer, recruited through (i) the Australian Red Cross Blood Services in Brisbane (age range 19-76 years) and (ii) the Australian Electoral Commission (AEC) (age and post-code/ area matched to ProsCan, age range 54-90 years). RAPPER: Multi-centre, hospital based blood sample collection study in patients enrolled in clinical trials with prospective collection of radiotherapy toxicity data. Source of cases: Prostate cancer patients enrolled in radiotherapy trials: CHHiP, RT01, Dose Escalation, RADICALS, Pelvic IMRT, PIVOTAL. Source of controls: N/A SABOR: Prostate Cancer Screening Cohort. Source of cases: Men >45 yrs of age participating in annual PSA screening. Source of controls: Males participating in annual PSA prostate cancer risk evaluations (funded by NCI biomarkers discovery and validation grant), recruited through University of Texas Health Science Center at San Antonio and affiliated sites or through study advertisements, enrolment open to the community SCCS: Case-control in cohort, Southeastern USA. Prospective, Observational, Population-based. Source of cases: SCCS entry population. Source of controls: SCCS entry population SCPCS: Population-based, Retrospective, Observational. Source of cases: South Carolina Central Cancer Registry. Source of controls: Health Care Financing Administration beneficiary file SEARCH: Case-control - East Anglia, UK. Source of cases: Men < 70 years of age registered with prostate cancer at the population-based cancer registry, Eastern Cancer Registration and Information Centre, East Anglia, UK. Source of controls: Men attending general practice in East Anglia with no known prostate cancer diagnosis, frequency matched to cases by age and geographic region SNP_Prostate_Ghent: Hospital-based, Retrospective, Observational. Source of cases: Men treated with IMRT as primary or postoperative treatment for prostate cancer at the Ghent University Hospital between 2000 and 2010. Source of controls: Employees of the University hospital and members of social activity clubs, without a history of any cancer. SPAG: Hospital-based, Retrospective, Observational. Source of cases: Guernsey. Source of controls: Guernsey STHM2: Population-based, Retrospective, Observational. Source of cases: Cases were selected among men referred for PSA testing in laboratories in Stockholm County, Sweden, between 2010 and 2012. Source of controls: Controls were selected among men referred for PSA testing in laboratories in Stockholm County, Sweden, between 2010 and 2012. PCPT: Case-control from a randomized clinical trial. Source of cases: Randomized clinical trial. Source of controls: Randomized clinical trial SELECT: Case-cohort from a randomized clinical trial. Source of cases: Randomized clinical trial. Source of controls: Randomized clinical trial TAMPERE: Case-control - Finland, Retrospective, Observational, Population-based. Source of cases: Identified through linkage to the Finnish Cancer Registry and patient records; and the Finnish arm of the ERSPC study. Source of controls: Cohort participants without a diagnosis of cancer UGANDA: Uganda Prostate Cancer Study: Uganda is a case-control study of prostate cancer in Kampala Uganda that was initiated in 2011. Men with prostate cancer were enrolled from the Urology unit at Mulago Hospital and men without prostate cancer (i.e. controls) were enrolled from other clinics (i.e. surgery) at the hospital. UKGPCS: ICR, UK. Source of cases: Cases identified through clinics at the Royal Marsden hospital and nationwide NCRN hospitals. Source of controls: Ken Muir's control- 2000 ULM: Case-control - Germany. Source of cases: familial cases (n=162): identified through questionnaires for family history by collaborating urologists all over Germany; sporadic cases (n=308): prostatectomy series performed in the Clinic of Urology Ulm between 2012 and 2014. Source of controls: age-matched controls (n=188): age-matched men without prostate cancer and negative family history collected in hospitals of Ulm WUGS/WUPCS: Cases Series, USA. Source of cases: Identified through clinics at Washington University in St. Louis. Source of controls: Men diagnosed and managed with prostate cancer in University based clinic. Acknowledgement Statements: Aarhus: This study was supported by the Danish Strategic Research Council (now Innovation Fund Denmark) and the Danish Cancer Society. The Danish Cancer Biobank (DCB) is acknowledged for biological material. AHS: This work was supported by the Intramural Research Program of the NIH, National Cancer Institute, Division of Cancer Epidemiology and Genetics (Z01CP010119). ATBC: This research was supported in part by the Intramural Research Program of the NIH and the National Cancer Institute. Additionally, this research was supported by U.S. Public Health Service contracts N01-CN-45165, N01-RC-45035, N01-RC-37004, HHSN261201000006C, and HHSN261201500005C from the National Cancer Institute, Department of Health and Human Services. BioVu: The dataset(s) used for the analyses described were obtained from Vanderbilt University Medical Center's BioVU which is supported by institutional funding and by the National Center for Research Resources, Grant UL1 RR024975-01 (which is now at the National Center for Advancing Translational Sciences, Grant 2 UL1 TR000445-06). Canary PASS: PASS was supported by Canary Foundation and the National Cancer Institute's Early Detection Research Network (U01 CA086402) CCI: This work was awarded by Prostate Cancer Canada and is proudly funded by the Movember Foundation - Grant # D2013-36.The CCI group would like to thank David Murray, Razmik Mirzayans, and April Scott for their contribution to this work. CerePP French Prostate Cancer Case-Control Study (ProGene): None reported COH: SLN is partially supported by the Morris and Horowitz Families Endowed Professorship COSM: The Swedish Research Council, the Swedish Cancer Foundation CPCS1 & CPCS2: Department of Clinical Biochemistry, Herlev and Gentofte Hospital, Copenhagen University Hospital, Herlev Ringvej 75, DK-2730 Herlev, DenmarkCPCS1 would like to thank the participants and staff of the Copenhagen General Population Study for their important contributions. CPDR: Uniformed Services University for the Health Sciences HU0001-10-2-0002 (PI: David G. McLeod, MD) CPS-II: The American Cancer Society funds the creation, maintenance, and updating of the Cancer Prevention Study II cohort. CPS-II thanks the participants and Study Management Group for their invaluable contributions to this research. We would also like to acknowledge the contribution to this study from central cancer registries supported through the Centers for Disease Control and Prevention National Program of Cancer Registries, and cancer registries supported by the National Cancer Institute Surveillance Epidemiology and End Results program. EPIC: The coordination of EPIC is financially supported by the European Commission (DG-SANCO) and the International Agency for Research on Cancer. The national cohorts are supported by the Danish Cancer Society (Denmark); the Deutsche Krebshilfe, Deutsches Krebsforschungszentrum and Federal Ministry of Education and Research (Germany); the Hellenic Health Foundation, Greek Ministry of Health; Greek Ministry of Education (Greece); the Italian Association for Research on Cancer (AIRC) and National Research Council (Italy); the Dutch Ministry of Public Health, Welfare and Sports (VWS), Netherlands Cancer Registry (NKR), LK Research Funds, Dutch Prevention Funds, Dutch ZON (Zorg Onderzoek Nederland), World Cancer Research Fund (WCRF); the Statistics Netherlands (The Netherlands); the Health Research Fund (FIS), Regional Governments of Andalucía, Asturias, Basque Country, Murcia and Navarra, Spanish Ministry of Health ISCIII RETIC (RD06/0020), Red de Centros RCESP, C03/09 (Spain); the Swedish Cancer Society, Swedish Scientific Council and Regional Government of Skåne and Västerbotten, Fundacion Federico SA (Sweden); the Cancer Research UK, Medical Research Council (United Kingdom). EPICAP: The EPICAP study was supported by grants from Ligue Nationale Contre le Cancer, Ligue départementale du Val de Marne; Fondation de France; Agence Nationale de sécurité sanitaire de l'alimentation, de l'environnement et du travail (ANSES). The EPICAP study group would like to thank all urologists, Antoinette Anger and Hasina Randrianasolo (study monitors), Anne-Laure Astolfi, Coline Bernard, Oriane Noyer, Marie-Hélène De Campo, Sandrine Margaroline, Louise N'Diaye, and Sabine Perrier-Bonnet (Clinical Research nurses). ERSPC: This study was supported by the DutchCancerSociety (KWF94-869,98-1657,2002-277,2006-3518, 2010-4800), The Netherlands Organisation for Health Research and Development (ZonMW-002822820, 22000106, 50-50110-98-311, 62300035), The Dutch Cancer Research Foundation (SWOP), and an unconditional grant from Beckman-Coulter-HybritechInc. ESTHER: The ESTHER study was supported by a grant from the Baden Württemberg Ministry of Science, Research and Arts. The ESTHER group would like to thank Hartwig Ziegler, Sonja Wolf, Volker Hermann, Heiko Müller, Karina Dieffenbach, Katja Butterbach for valuable contributions to the study. FHCRC: The FHCRC studies were supported by grants R01-CA056678, R01-CA082664, and R01-CA092579 from the US National Cancer Institute, National Institutes of Health, with additional support from the Fred Hutchinson Cancer Research Center. FHCRC would like to thank all the men who participated in these studies. Gene-PARE: The Gene-PARE study was supported by grants 1R01CA134444 from the U.S. National Institutes of Health, PC074201 and W81XWH-15-1-0680 from the Prostate Cancer Research Program of the Department of Defense and RSGT-05-200-01-CCE from the American Cancer Society. Hamburg-Zagreb: None reported HPFS: The Health Professionals Follow-up Study was supported by grants UM1CA167552, CA133891, CA141298, and P01CA055075. HPFS are grateful to the participants and staff of the Physicians' Health Study and Health Professionals Follow-Up Study for their valuable contributions, as well as the following state cancer registries for their help: AL, AZ, AR, CA, CO, CT, DE, FL, GA, ID, IL, IN, IA, KY, LA, ME, MD, MA, MI, NE, NH, NJ, NY, NC, ND, OH, OK, OR, PA, RI, SC, TN, TX, VA, WA, and WY. IMPACT: The IMPACT study was funded by The Ronald and Rita McAulay Foundation, CR-UK Project grant (C5047/A1232), Cancer Australia, AICR Netherlands A10-0227, Cancer Australia and Cancer Council Tasmania, NIHR, EU Framework 6, Cancer Councils of Victoria and South Australia, and Philanthropic donation to Northshore University Health System. We acknowledge support from the National Institute for Health Research (NIHR) to the Biomedical Research Centre at The Institute of Cancer Research and Royal Marsden Foundation NHS Trust. IMPACT acknowledges the IMPACT study steering committee, collaborating centres, and participants. IPO-Porto: The IPO-Porto study was funded by Fundaçäo para a Ciência e a Tecnologia (FCT; UID/DTP/00776/2013 and PTDC/DTP-PIC/1308/2014) and by IPO-Porto Research Center (CI-IPOP-16-2012 and CI-IPOP-24-2015). MC and MPS are research fellows from Liga Portuguesa Contra o Cancro, Núcleo Regional do Norte. SM is a research fellow from FCT (SFRH/BD/71397/2010). IPO-Porto would like to express our gratitude to all patients and families who have participated in this study. Karuprostate: The Karuprostate study was supported by the the Frech National Health Directorate and by the Association pour la Recherche sur les Tumeurs de la ProstateKarusprostate thanks Séverine Ferdinand. KULEUVEN: F.C. and S.J. are holders of grants from FWO Vlaanderen (G.0684.12N and G.0830.13N), the Belgian federal government (National Cancer Plan KPC_29_023), and a Concerted Research Action of the KU Leuven (GOA/15/017). TVDB is holder of a doctoral fellowship of the FWO. LAAPC: This study was funded by grant R01CA84979 (to S.A. Ingles) from the National Cancer Institute, National Institutes of Health. Malaysia: The study was funded by the University Malaya High Impact Research Grant (HIR/MOHE/MED/35). Malaysia thanks all associates in the Urology Unit, University of Malaya, Cancer Research Initiatives Foundation (CARIF) and the Malaysian Men's Health Initiative (MMHI). MCCS: MCCS cohort recruitment was funded by VicHealth and Cancer Council Victoria. The MCCS was further supported by Australian NHMRC grants 209057, 251553, and 504711, and by infrastructure provided by Cancer Council Victoria. Cases and their vital status were ascertained through the Victorian Cancer Registry (VCR) and the Australian Institute of Health and Welfare (AIHW), including the National Death Index and the Australian Cancer Database. MCC-Spain: The study was partially funded by the Accion Transversal del Cancer, approved on the Spanish Ministry Council on the 11th October 2007, by the Instituto de Salud Carlos III-FEDER (PI08/1770, PI09/00773-Cantabria, PI11/01889-FEDER, PI12/00265, PI12/01270, and PI12/00715), by the Fundación Marqués de Valdecilla (API 10/09), by the Spanish Association Against Cancer (AECC) Scientific Foundation and by the Catalan Government DURSI grant 2009SGR1489. Samples: Biological samples were stored at the Parc de Salut MAR Biobank (MARBiobanc; Barcelona) which is supported by Instituto de Salud Carlos III FEDER (RD09/0076/00036). Also sample collection was supported by the Xarxa de Bancs de Tumors de Catalunya sponsored by Pla Director d'Oncologia de Catalunya (XBTC). MCC-Spain acknowledges the contribution from Esther Gracia-Lavedan in preparing the data. We thank all the subjects who participated in the study and all MCC-Spain collaborators. MD Anderson: Prostate Cancer Case-Control Studies at MD Anderson (MDA) supported by grants CA68578, ES007784, DAMD W81XWH-07-1-0645, and CA140388. MDACC_AS: None reported MEC: Funding provided by NIH grant U19CA148537 and grant U01CA164973. MIAMI (WFPCS): ACS MOFFITT: The Moffitt group was supported by the US National Cancer Institute (R01CA128813, PI: J.Y. Park). NMHS: Funding for the Nashville Men's Health Study (NMHS) was provided by the National Institutes of Health Grant numbers: RO1CA121060. PCaP only data: The North Carolina - Louisiana Prostate Cancer Project (PCaP) is carried out as a collaborative study supported by the Department of Defense contract DAMD 17-03-2-0052. For HCaP-NC follow-up data: The Health Care Access and Prostate Cancer Treatment in North Carolina (HCaP-NC) study is carried out as a collaborative study supported by the American Cancer Society award RSGT-08-008-01-CPHPS. For studies using both PCaP and HCaP-NC follow-up data please use: The North Carolina - Louisiana Prostate Cancer Project (PCaP) and the Health Care Access and Prostate Cancer Treatment in North Carolina (HCaP-NC) study are carried out as collaborative studies supported by the Department of Defense contract DAMD 17-03-2-0052 and the American Cancer Society award RSGT-08-008-01-CPHPS, respectively. For any PCaP data, please include: The authors thank the staff, advisory committees and research subjects participating in the PCaP study for their important contributions. For studies using PCaP DNA/genotyping data, please include: We would like to acknowledge the UNC BioSpecimen Facility and LSUHSC Pathology Lab for our DNA extractions, blood processing, storage and sample disbursement (https://genome.unc.edu/bsp). For studies using PCaP tissue, please include: We would like to acknowledge the RPCI Department of Urology Tissue Microarray and Immunoanalysis Core for our tissue processing, storage and sample disbursement. For studies using HCaP-NC follow-up data, please use: The Health Care Access and Prostate Cancer Treatment in North Carolina (HCaP-NC) study is carried out as a collaborative study supported by the American Cancer Society award RSGT-08-008-01-CPHPS. The authors thank the staff, advisory committees and research subjects participating in the HCaP-NC study for their important contributions. For studies that use both PCaP and HCaP-NC, please use: The authors thank the staff, advisory committees and research subjects participating in the PCaP and HCaP-NC studies for their important contributions. PCMUS: The PCMUS study was supported by the Bulgarian National Science Fund, Ministry of Education and Science (contract DOO-119/2009; DUNK01/2-2009; DFNI-B01/28/2012) with additional support from the Science Fund of Medical University - Sofia (contract 51/2009; 8I/2009; 28/2010). PHS: The Physicians' Health Study was supported by grants CA34944, CA40360, CA097193, HL26490, and HL34595. PHS members are grateful to the participants and staff of the Physicians' Health Study and Health Professionals Follow-Up Study for their valuable contributions, as well as the following state cancer registries for their help: AL, AZ, AR, CA, CO, CT, DE, FL, GA, ID, IL, IN, IA, KY, LA, ME, MD, MA, MI, NE, NH, NJ, NY, NC, ND, OH, OK, OR, PA, RI, SC, TN, TX, VA, WA, and WY. PLCO: This PLCO study was supported by the Intramural Research Program of the Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIHPLCO thanks Drs. Christine Berg and Philip Prorok, Division of Cancer Prevention at the National Cancer Institute, the screening center investigators and staff of the PLCO Cancer Screening Trial for their contributions to the PLCO Cancer Screening Trial. We thank Mr. Thomas Riley, Mr. Craig Williams, Mr. Matthew Moore, and Ms. Shannon Merkle at Information Management Services, Inc., for their management of the data and Ms. Barbara O'Brien and staff at Westat, Inc. for their contributions to the PLCO Cancer Screening Trial. We also thank the PLCO study participants for their contributions to making this study possible. Poland: None reported PROCAP: PROCAP was supported by the Swedish Cancer Foundation (08-708, 09-0677). PROCAP thanks and acknowledges all of the participants in the PROCAP study. We thank Carin Cavalli-Björkman and Ami Rönnberg Karlsson for their dedicated work in the collection of data. Michael Broms is acknowledged for his skilful work with the databases. KI Biobank is acknowledged for handling the samples and for DNA extraction. We acknowledge The NPCR steering group: Pär Stattin (chair), Anders Widmark, Stefan Karlsson, Magnus Törnblom, Jan Adolfsson, Anna Bill-Axelson, Ove Andrén, David Robinson, Bill Pettersson, Jonas Hugosson, Jan-Erik Damber, Ola Bratt, Göran Ahlgren, Lars Egevad, and Roy Ehrnström. PROGReSS: The PROGReSS study is founded by grants from the Spanish Ministry of Health (INT15/00070; INT16/00154; FIS PI10/00164, FIS PI13/02030; FIS PI16/00046); the Spanish Ministry of Economy and Competitiveness (PTA2014-10228-I), and Fondo Europeo de Desarrollo Regional (FEDER 2007-2013). ProMPT: Founded by CRUK, NIHR, MRC, Cambride Biomedical Research Centre ProtecT: Founded by NIHR. ProtecT and ProMPT would like to acknowledge the support of The University of Cambridge, Cancer Research UK. Cancer Research UK grants (C8197/A10123) and (C8197/A10865) supported the genotyping team. We would also like to acknowledge the support of the National Institute for Health Research which funds the Cambridge Bio-medical Research Centre, Cambridge, UK. We would also like to acknowledge the support of the National Cancer Research Prostate Cancer: Mechanisms of Progression and Treatment (PROMPT) collaborative (grant code G0500966/75466) which has funded tissue and urine collections in Cambridge. We are grateful to staff at the Welcome Trust Clinical Research Facility, Addenbrooke's Clinical Research Centre, Cambridge, UK for their help in conducting the ProtecT study. We also acknowledge the support of the NIHR Cambridge Biomedical Research Centre, the DOH HTA (ProtecT grant), and the NCRI/MRC (ProMPT grant) for help with the bio-repository. The UK Department of Health funded the ProtecT study through the NIHR Health Technology Assessment Programme (projects 96/20/06, 96/20/99). The ProtecT trial and its linked ProMPT and CAP (Comparison Arm for ProtecT) studies are supported by Department of Health, England; Cancer Research UK grant number C522/A8649, Medical Research Council of England grant number G0500966, ID 75466, and The NCRI, UK. The epidemiological data for ProtecT were generated though funding from the Southwest National Health Service Research and Development. DNA extraction in ProtecT was supported by USA Dept of Defense award W81XWH-04-1-0280, Yorkshire Cancer Research and Cancer Research UK. The authors would like to acknowledge the contribution of all members of the ProtecT study research group. The views and opinions expressed therein are those of the authors and do not necessarily reflect those of the Department of Health of England. The bio-repository from ProtecT is supported by the NCRI (ProMPT) Prostate Cancer Collaborative and the Cambridge BMRC grant from NIHR. We thank the National Institute for Health Research, Hutchison Whampoa Limited, the Human Research Tissue Bank (Addenbrooke's Hospital), and Cancer Research UK. PROtEuS: PROtEuS was supported financially through grants from the Canadian Cancer Society (13149, 19500, 19864, 19865) and the Cancer Research Society, in partnership with the Ministère de l'enseignement supérieur, de la recherche, de la science et de la technologie du Québec, and the Fonds de la recherche du Québec - Santé.PROtEuS would like to thank its collaborators and research personnel, and the urologists involved in subjects recruitment. We also wish to acknowledge the special contribution made by Ann Hsing and Anand Chokkalingam to the conception of the genetic component of PROtEuS. QLD: The QLD research is supported by The National Health and Medical Research Council (NHMRC) Australia Project Grants (390130, 1009458) and NHMRC Career Development Fellowship and Cancer Australia PdCCRS funding to J Batra. The QLD team would like to acknowledge and sincerely thank the urologists, pathologists, data managers and patient participants who have generously and altruistically supported the QLD cohort. RAPPER: RAPPER is funded by Cancer Research UK (C1094/A11728; C1094/A18504) and Experimental Cancer Medicine Centre funding (C1467/A7286). The RAPPER group thank Rebecca Elliott for project management. SABOR: The SABOR research is supported by NIH/NCI Early Detection Research Network, grant U01 CA0866402-12. Also supported by the Cancer Center Support Grant to the Cancer Therapy and Research Center from the National Cancer Institute (US) P30 CA054174. SCCS: SCCS is funded by NIH grant R01 CA092447, and SCCS sample preparation was conducted at the Epidemiology Biospecimen Core Lab that is supported in part by the Vanderbilt-Ingram Cancer Center (P30 CA68485). Data on SCCS cancer cases used in this publication were provided by the Alabama Statewide Cancer Registry; Kentucky Cancer Registry, Lexington, KY; Tennessee Department of Health, Office of Cancer Surveillance; Florida Cancer Data System; North Carolina Central Cancer Registry, North Carolina Division of Public Health; Georgia Comprehensive Cancer Registry; Louisiana Tumor Registry; Mississippi Cancer Registry; South Carolina Central Cancer Registry; Virginia Department of Health, Virginia Cancer Registry; Arkansas Department of Health, Cancer Registry, 4815 W. Markham, Little Rock, AR 72205. The Arkansas Central Cancer Registry is fully funded by a grant from National Program of Cancer Registries, Centers for Disease Control and Prevention (CDC). Data on SCCS cancer cases from Mississippi were collected by the Mississippi Cancer Registry which participates in the National Program of Cancer Registries (NPCR) of the Centers for Disease Control and Prevention (CDC). The contents of this publication are solely the responsibility of the authors and do not necessarily represent the official views of the CDC or the Mississippi Cancer Registry. SCPCS: SCPCS is funded by CDC grant S1135-19/19, and SCPCS sample preparation was conducted at the Epidemiology Biospecimen Core Lab that is supported in part by the Vanderbilt-Ingram Cancer Center (P30 CA68485). SEARCH: SEARCH is funded by a program grant from Cancer Research UK (C490/A10124) and supported by the UK National Institute for Health Research Biomedical Research Centre at the University of Cambridge. SNP_Prostate_Ghent: The study was supported by the National Cancer Plan, financed by the Federal Office of Health and Social Affairs, Belgium. SPAG: Wessex Medical ResearchHope for Guernsey, MUG, HSSD, MSG, Roger Allsopp STHM2: STHM2 was supported by grants from The Strategic Research Programme on Cancer (StratCan), Karolinska Institutet; the Linné Centre for Breast and Prostate Cancer (CRISP, number 70867901), Karolinska Institutet; The Swedish Research Council (number K2010-70X-20430-04-3) and The Swedish Cancer Society (numbers 11-0287 and 11-0624); Stiftelsen Johanna Hagstrand och Sigfrid Linnérs minne; Swedish Council for Working Life and Social Research (FAS), number 2012-0073STHM2 acknowledges the Karolinska University Laboratory, Aleris Medilab, Unilabs and the Regional Prostate Cancer Registry for performing analyses and help to retrieve data. Carin Cavalli-Björkman and Britt-Marie Hune for their enthusiastic work as research nurses. Astrid Björklund for skilful data management. We wish to thank the BBMRI.se biobank facility at Karolinska Institutet for biobank services. PCPT & SELECT are funded by Public Health Service grants U10CA37429 and 5UM1CA182883 from the National Cancer Institute. SWOG and SELECT thank the site investigators and staff and, most importantly, the participants who donated their time to this trial. TAMPERE: The Tampere (Finland) study was supported by the Academy of Finland (251074), The Finnish Cancer Organisations, Sigrid Juselius Foundation, and the Competitive Research Funding of the Tampere University Hospital (X51003). The PSA screening samples were collected by the Finnish part of ERSPC (European Study of Screening for Prostate Cancer). TAMPERE would like to thank Riina Liikanen, Liisa Maeaettaenen and Kirsi Talala for their work on samples and databases. UGANDA: None reported UKGPCS: UKGPCS would also like to thank the following for funding support: The Institute of Cancer Research and The Everyman Campaign, The Prostate Cancer Research Foundation, Prostate Research Campaign UK (now Prostate Action), The Orchid Cancer Appeal, The National Cancer Research Network UK, The National Cancer Research Institute (NCRI) UK. We are grateful for support of NIHR funding to the NIHR Biomedical Research Centre at The Institute of Cancer Research and The Royal Marsden NHS Foundation Trust. UKGPCS should also like to acknowledge the NCRN nurses, data managers, and consultants for their work in the UKGPCS study. UKGPCS would like to thank all urologists and other persons involved in the planning, coordination, and data collection of the study. ULM: The Ulm group received funds from the German Cancer Aid (Deutsche Krebshilfe). WUGS/WUPCS: WUGS would like to thank the following for funding support: The Anthony DeNovi Fund, the Donald C. McGraw Foundation, and the St. Louis Men's Group Against Cancer.
PLCG1 R707Q mutation is counter selected under targeted therapy in a patient with a hepatic angiosarcoma
An important potential enabling resource for Personalized Medicine is the combination of a DNA repository with Electronic Medical Record (EMR) systems sufficiently robust to provide excellence in clinical care and to serve as resources for analysis of disease susceptibility and therapeutic outcomes across patient populations. The Vanderbilt EMR is a state of the art clinical and research tool (that includes >1.7 million records), and is associated with a DNA repository which has been in development for over 3 years; these are the key components of VGER, the Vanderbilt Genome-Electronic Records project, a part of NHGRI's eMERGE network. The VGER model acquires DNA from discarded blood samples collected from routine patient care, and can link these to de-identified data extracted and readily updated from the EMR. The phenotype we analyze here is the QRS duration on the electrocardiogram, since slow conduction (indicated by longer QRS duration) is a marker of arrhythmia susceptibility.
Studies of long-lived individuals have revealed few genetic mechanisms for protection against age-associated disease. Therefore, we pursued genome sequencing of a related phenotype-healthy aging-to understand the genetics of disease-free aging without medical intervention. In contrast with studies of exceptional longevity, usually focused on centenarians, healthy aging is not associated with known longevity variants, but is associated with reduced genetic susceptibility to Alzheimer and coronary artery disease. Additionally, healthy aging is not associated with a decreased rate of rare pathogenic variants, potentially indicating the presence of disease-resistance factors. In keeping with this possibility, we identify suggestive common and rare variant genetic associations implying that protection against cognitive decline is a genetic component of healthy aging. These findings, based on a relatively small cohort, require independent replication. Overall, our results suggest healthy aging is an overlapping but distinct phenotype from exceptional longevity that may be enriched with disease-protective genetic factors.
Follicular lymphoma (FL) is usually an indolent malignant B-cell lymphoma, but at least 30% of the patients experience transformation to Diffuse Large B-cell Lymphoma (DLBCL) with inferior survival. Early identification of patients with poor prognosis is essential for tailoring treatment. Here, we describe a network-based integrative analysis of 168 FL-biopsies from 127 patients with long clinical follow-up and transformation status. A distinct shift in expression towards a DLBCL expression phenotype is observed in BTK associated genes in cases with transformation. This shift is present years prior to transformation. An integrated score combining the BTK score and a clinical index is proposed, termed the BTK-FLIPI score. It identifies patients (16%) with very poor prognosis, with median time to progression 21.7 months (95% CI: 14.3-42.6) versus 70.9 months (95% CI: 41.5-95.0) for the remainder. The results suggest BTK as a potential target for treatment of high-risk FL-patients.
Gliosarcoma is a variant of glioblastoma with equally poor prognosis and characterized by mixed glial and mesenchymal pathology. Metastasis is not uncommon but involvement of the spinal cord is rare, and comprehensive genetic characterization of spinal gliosarcoma is lacking. We describe a patient initially diagnosed with a low-grade brain glioma via biopsy, followed by adjuvant radiation and temozolomide treatment. Nearly two years after diagnosis, she developed neurological deficits from an intradural, extramedullary tumor anterior to the spinal cord at T4, which was resected and diagnosed as gliosarcoma. Whole-exome sequencing (WES) of this tumor revealed a hypermutated phenotype, characterized by somatic mutations in key DNA mismatch repair (MMR) pathway genes, an abundance of C>T transitions within the identified somatic SNVs, and microsatellite stability, together consistent with temozolomide-mediated hypermutagenesis. This is the first report of a hypermutator phenotype in gliosarcoma, which may represent a novel genomic mechanism of progression from lower-grade glioma.
Acute myeloid leukemia (AML) is an aggressive blood cancer with poor prognosis. We report a comprehensive proteogenomic analysis of bone-marrow biopsies from 252 uniformly treated AML patients to elucidate the molecular pathophysiology of AML in order to inform future diagnostic and therapeutic approaches. In addition to in-depth quantitative proteomics, our analysis includes cytogenetic profiling and DNA/RNA sequencing. We identify five proteomic AML subtypes, each reflecting specific biological features spanning genomic boundaries. Two of these proteomic subtypes correlate with patient outcome, but none are exclusively associated with specific genomic aberrations. Remarkably, one subtype (Mito-AML), which is only captured in the proteome, is characterized by high expression of mitochondrial proteins and confers poor outcome, with reduced remission rate and shorter overall survival upon treatment with intensive induction chemotherapy. Functional analyses reveal that Mito-AML is metabolically wired towards stronger complex I dependent respiration and is more responsive to treatment with the BCL2 inhibitor venetoclax.
Retinoblastoma (Rb) is a deadly childhood eye cancer that is classically initiated by inactivation of the RB1 tumor suppressor. Clinical management continues to rely on nonspecific chemotherapeutic agents that are associated with treatment resistance and toxicity. Here, we analyzed whole exomes from primary Rb tumors to identify novel Rb dependencies. Several recurrent genomic aberrations implicate estrogen-related receptor gamma (ESRRG) in Rb pathogenesis. ESRRG is an essential mediator of hypoxic adaptation and cell survival in retinoblastoma, representing a promising new therapeutic target.
Methylmercury is a potent neurotoxin, and the fetal period is the most vulnerable exposure period. There is significant variability in methylmercury metabolism, which has been attributed to differences in the structure and function of the gut microbiome. Our main objective was to better understand the interplay between gut microorganisms and methylmercury metabolism during pregnancy. To address this aim, associations were investigated between maternal biomarkers (blood, hair, stool) for prenatal methylmercury exposure and maternal gut microbiota during early and late gestation.
Urothelial carcinoma (UC) is the most prevalent cancer of kidney transplant recipients in some regions. The integration of BK polyomavirus (BKPyV) genome into the human genome is observed in some cases of UC. The prevalence of aristolochic acid (AA) mutational signatures in urothelial carcinoma is high in the general population of Taiwan. To better understand the mechanisms of UC carcinogenesis in kidney transplant recipients, 19 UC specimens were analyzed by whole genome sequencing (WGS) and Hybrid Sequencing.
Glioblastoma is a highly aggressive type of brain tumour for which there is no curative treatment available. Immunotherapies have shown limited responses in unselected glioblastoma patients, and it is currently not well understood what separates the few responders from the non-responders at the cellular and molecular level. Here we investigate the phenotypic and transcriptional evolutionary dynamics at single-cell resolution during nivolumab immune checkpoint treatment of eight glioblastoma patients enrolled in a translational phase I/II clinical trial.
Background: Metastatic castration-resistant prostate cancer (mCRPC) is a heterogeneous disease in which molecular stratification is needed to improve clinical outcomes. The identification of predictive biomarkers can have a major impact on the care of these patients, but the availability of metastatic tissue samples for research in this setting is limited. Objective: To study the prevalence of immune biomarkers of potential clinical utility to immunotherapy in mCRPC by tumour RNA sequencing, and to determine their association with overall survival (OS).
Barrett’s oesophagus is a precursor of oesophageal adenocarcinoma. In this common condition, squamous epithelium in the oesophagus is replaced by columnar epithelium in response to acid reflux. Barrett’s oesophagus is highly heterogeneous and its relationships to normal tissues are unclear. We investigated the cellular complexity of Barrett’s oesophagus and the upper gastrointestinal tract using RNA-sequencing of single cells from multiple biopsies from six patients with Barrett’s oesophagus and two patients without oesophageal pathology.
Immune memory is key to effective antimicrobial responses, but the impact of mRNA vaccines on this process is not fully understood. Our research shows that SARS-CoV-2 mRNA vaccines alter the epigenetic profile of human macrophages, specifically enhancing histone acetylation, which is linked to immune training. Significant epigenetic changes, along with increased cytokine release, require two vaccine doses. However, these effects diminish over time but can be restored with a booster dose six months later, maintaining a strong pro-inflammatory response.
Gallbladder carcinoma is the most common cancer of the biliary tract with dismal survival largely due to delayed diagnosis. Biliary tract intraepithelial neoplasia (BilIN) is the common benign tumor that is suspected to be precancerous lesions. However, the genetic and evolutionary relationships between BilIN and carcinoma remain unclear. Here we performed whole-exome sequencing of coexisting low-grade BilIN (adenoma), high-grade BilIN, and carcinoma lesions, and normal tissues from the same patients.
Endometrial carcinoma, the most common gynecologic cancer, develops from endometrial epithelium which is composed of secretory and ciliated cells. Pathologic classification is unreliable and there is a need for prognostic tools. We used single cell sequencing to study organoid model systems derived from normal endometrial endometrium to discover novel markers specific for endometrial ciliated or secretory cells. We performed single cell sequencing on endometrial and ovarian tumors and found both secretory-like and ciliated-like tumor cells.
The inhibitor of DNA-binding 3 (ID3) is a transcriptional repressor protein that limits the binding of basic helixloop-helix (bHLH) transcription factors to DNA. Preliminary results from our lab confirm that cellular depletion of ID3 is associated with an impaired DNA damage response. we hypothesize that ID3 has the potential to epigenetically affect DNA repair. This is based on our observation that upon cellular ID3 knockdown, a reduction of chromatin-bound H4K16ac and H3K36me3.
The IMBA stem cell bank, iPSC Biobank, is a collection of induced pluripotent stem cell lines (iPSC lines) produced from skin and blood cells. The iPSC Biobank is integrated into the IMBA Stem Cell Core Facility. It stores and provides high-quality reference control panels of iPSC clones as a tool to the research community. The resource is available to all scientists and commercial institutions. The ethical guidelines of the iPSC Biobank are always based on prevailing laws and internal regulations.
Low-grade serous ovarian cancer (LGSOC) is a rare disease that occurs more frequently in younger women compared with high-grade disease. The current treatment is suboptimal, and better understanding of molecular pathogenesis of this disease is needed. In this study, we compared the whole genome sequences of LGSOCs from short-term and long-term survivors (defined as <40 and >60 months, respectively). Our goal was to identify novel genomic aberrations in LGSOC, especially in short-term survivors.
Thyroid cancer is the most common endocrine malignancy. This dataset encompasses two types of thyroid cancer : anaplastic which is the most de-differentiated and aggressive one, and papillary which is the most common one. We profiled 14 patients, including 10 papillary and 4 anaplastic thyroid carcinomas, using both single nuclei RNA sequencing and spatial transcriptomics to link single cell resolution RNA sequencing with tissue morphology and better understand inter and intratumoral thyroid cancer heterogeneity.
Targeted panel sequencing data from PanNEN samples. Sample ID is annotated in the following manner: each patient is given a number and "P" is appended to the patient number if it is a primary tumor, "M” if it is metastasis and "N" if it is normal (healthy tissue) sample. All NETG1 and NETG2 samples underwent panel sequencing using a custom panel (in-house PanNEN panel). All NEC and NETG3 samples (except PNET2, PNET77P and PNET77M) underwent panel sequencing using a commercial CCP panel.
Smith-Lemli-Opitz syndrome (SLOS) is a disorder of cholesterol production by the body. It is caused by changes in the DHCR7 gene, which is the blueprint for an enzyme called 7-dehydrocholesterol- delta7-reductase. This enzyme is necessary for the production of cholesterol by all cells in the body. People with SLOS often have malformations of major organs, slow growth, feeding difficulties and intellectual disability or learning problems. Because patients with SLOS cannot make enough cholesterol, it has been proposed that cholesterol supplementation (either with egg yolk or liquid suspensions of cholesterol) could help improve the symptoms of the disease. However, despite the widespread use of cholesterol supplementation, it is still not known whether it works or not. The study will try to provide an answer to this question by studying the disease and its progression while patients are receiving cholesterol. The clinical features of SLOS are thought to be related to low cholesterol and buildup of toxic cholesterol precursors (substances from which cholesterol is formed). But how exactly low cholesterol and toxic precursors contribute to the disease is poorly understood. This knowledge is critically important because it should help discover new therapeutic targets and develop treatments of the disease in the long run. The study will try to fill this gap with a comprehensive clinical and biochemical testing of the study participants over the course of several years. Last, a limitation of previous SLOS research studies has been their low number of participants. This is understandable because SLOS is a rare diseases and few research groups are working on it. However, in order to fully understand the disease, researchers need to study as many patients as possible. This study is unique in that it is run by a network of several highly specialized clinical research sites across the country. Having several sites involved increases the researcher's ability to recruit and study large number of patients, and centralize patients' information in a comprehensive SLOS clinical registry. This registry will be key to identify markers for diagnostic testing, screening and measuring outcomes in future studies of treatment. The purpose of this study is to learn as much as possible about Smith-Lemli-Opitz Syndrome (SLOS) by following a large group of affected children and adults over time. In this study, we will measure cholesterol and other similar chemicals in blood and urine, evaluate development and behavior, do limited medical evaluation, and carry out brain imaging studies.This study will help researchers: learn more about what causes SLOS and how SLOS changes with age, note differences in features of SLOS among those affected, evaluate the effect of giving extra cholesterol in this condition, and develop ways to evaluate whether treatments developed in the future will be helpful.
Individuals with a past diagnosis of severe combined immune deficiency (including many cases of "leaky SCID", Omenn syndrome, and reticular dysgenesis) who have undergone blood and marrow transplant, gene therapy, or enzyme replacement in the past may be eligible for 6902. The purpose of 6902 is very similar to 6901, except 6902 is looking backwards at what has already been done in the past (compared to 6901 which is looking into the future). Over 800 patients with SCID are expected to be enrolled on 6902. This makes 6902 the largest study ever to describe outcomes for patients with SCID treated at many different hospitals around North America. One of the most important components of the 6902 study is the "cross sectional" study. Patients who have received their treatments (BMT, gene therapy, enzyme replacement) many years ago are asked to come back to the hospital where they were treated. During this visit, additional research blood work is drawn and information is gathered regarding long-term transplant outcomes such as infections, graft-versus-host disease, autoimmune diseases, and quality of life. This will allow PIDTC researchers to better understand long-term outcomes from procedures that occurred many years ago (sometimes over 30 years ago) - something that is not possible at the present time with 6901. This will help researchers to best design new treatments and clinical trials in the future for children with SCID.
This study is part 1 of 2 for Schumacher et al, Nat Commun. 2015 Jul 7;6:7138. Part 1 is the original GWAS data for 9,259 participants on the Affymetrix Axiom platform of the studies listed below in the Inclusion Criteria.
Prostate cancer is readily curable if detected early. The goal of this research is to assess the molecular alterations that contribute to prostate cancer development, and how the interaction of tumor-cell autonomous alterations and immune cell proportions contribute to the predictability of recurrence.
This study is from a phase I clinical trial of neoantigen DNA vaccines in triple negative breast cancer patients with persistent disease following neoadjuvant chemotherapy. The exome and RNA sequences used to identify somatic mutations, predict neoantigens, and design vaccines is within.
The goal of the study is to evaluate response of PitNET to drug therapy before surgery. Patients with and without drug therapy are compared in RNA level. The RNA is extracted from surgery material, sequenced and deferentially expressed genes (DEG) determined.
This submission contains single-cell genome and transcriptome data that is used for the following manuscript: https://doi.org/10.1101/2023.01.13.521174. All Illumina data deposited here is retrieved from HCC38(BL) cell lines and a melanoma PDX model as described in the paper.
Nonalcoholic fatty liver disease (NAFLD) is the most common cause of chronic liver disease in children in the United States. NAFLD is associated with obesity and metabolic syndrome; however, there is limited understanding of the development and progression of NAFLD. There is evidence of abnormalities of bacterial colonization, and intestinal bacterial product-induced inflammation associated with NAFLD and disease progression. The goal of this study was to characterize the intestinal microbiome in pediatric participants with NAFLD and in both obese and normal weight controls to determine the relationship between alterations in the intestinal microbiome, inflammation, and the development of NAFLD. We hypothesized that alterations in the intestinal microbiome are associated with increased systemic inflammation and the development and severity of NAFLD.
Niemann-Pick disease, type C1 (NPC1) is neurodegenerative disorder due to pathological variants of NPC1. NPC1 is predominately a pediatric/adolescent disorder, although adult onset cases have been described. The NPC1 protein functions to move unesterified cholesterol from the endolysosomal lumen to make it bioavailable for cellular function. Impaired NPC1 function results in endolysosomal accumulation of unesterified cholesterol and other lipids. The NPC1 phenotype is extremely heterogeneous, both with respect to age of onset and sign/symptom complex. Comparison of individuals homozygous for the common p.I1061T variant and affected siblings suggests that other genes can significantly modify the clinical phenotype. The goal of this project is to obtain genome sequencing on ~200 individuals with NPC1 in order to facilitate identifying potential genetic modifiers.