Paired PCR-free whole genome sequencing data of a matched metastatic melanoma cell line (COLO829) and normal across three lineages and across separate institutions, with independent library preparations, sequencing, and analysis. The data was generated with mean mapped coverages of 99X for COLO829 and 103X for the paired normal across three institutions. Overall, common events include >35,000 point mutations, 446 small insertion/deletions, and >6,000 genes affected by copy number changes. We present this reference to the community as an initial standard for enabling quantitative evaluation of somatic mutation pipelines across institutions.
The Cancer Immune Monitoring and Analysis Centers (CIMACs) and the Cancer Immunologic Data Commons (CIDC) has engaged in efforts to harmonize Whole-exome (WES) and RNA-sequencing (RNA-seq) data from three different experimental platforms (MD Anderson, NCI MoCha lab, and Broad Institute) and are using a series of developed pipelines to process the early trial samples. To evaluate the consistency of tumor WES and RNA-seq profiling platforms across different centers, the CIMACs-CIDC conducted a systematic harmonization study. DNA and RNA were centrally extracted from fresh frozen (FF) and formalin-fixed paraffin-embedded (FFPE) non-small cell lung carcinoma (NSCLC) tumors and distributed to three centers for WES and RNA-seq profiling
Submission FAQ Before Submission Is the EGA the right archive for my data? The most suitable archive for your data is dependent on the type of data you are wishing to submit and whether the data require public or controlled access. Public access is defined as complete and open access to all submitted data. On the contrary, controlled access, exerted by the EGA, requires formal applications to be made to access the submitted data files and metadata. EGA only accepts human-derived data subject to controlled access. If your submission contains other types of data, please choose the appropriate repository for it (see image below): ENA, EVA, ArrayExpress, BioSD and GWAS catalog. Should your submission be subjected to controlled access? Data access conditions are normally defined in the original informed consent agreements signed by the participants involved in your study. All data submitted to the EGA is subject to controlled access. These consents prevent the derived data files, potentially identifiable, from being dispersed by open and public access. Controlled access data often refers to human data derived from medical research and consortium projects. If in doubt, consult the informed consent agreements that apply to your study The EGA enables you to hold a submission before publication. What data types can be submitted to the EGA? Data types accepted by the EGA can be split into three categories: Sequences: both in generic and platform-specific formats. Array-based: from raw signal files to processed matrices. Phenotypes: all possible phenotype formats are accepted. All manufacturer-specific raw data formats derived from major next generation sequencing platforms are accepted. Also generic sequence formats: flat reads in a FASTQ file, aligned sequences (BAM or CRAM files) as well as sequence variation files in VCF format. All array-based technologies are accepted, including raw data, intensity and analysis files, without any restriction on data formats accepted. We also accept and distribute phenotype data (associated to the samples) in almost any format: from an image to a README file. How long does a submission take? Submissions to EGA come in a variety of formats and sizes, thus it is difficult for us to exactly predict how long a submission will take. We, therefore, advise all of our submitters to allow as much time as possible to make a submission. Based on previous records, we anticipate that the submission process may take at least one month. The submitter’s familiarity with the procedures, possible technical issues that may arise during submission and the amount of data that is being submitted are the main factors that will affect the length of the submission process. However, each step of a regular submission should be considered when estimating the time it would take: Encryption of the files Upload of the files Metadata submission Archival of the files Release of the study and datasets to EGA webpage For example, the upload of the files depends on the submission size, while metadata submission mainly relies on each submitter’s expertise. Further, some steps (e.g. answering to inquiries) depend on the EGA Helpdesk team, which may take some days to be processed during busy times Is data deposited in the EGA secure? The EGA set-up consists of a secure computing facility for data processing, a shared EBI set-up for data submissions and distribution of data via data requests made through the EGA website. Data is also copied in the Barcelona Supercomputing Center (BSC) infrastructure, where all stored and distributed data is encrypted Data is encrypted along the submission process and stored securely, granting its access to authorised users exclusively. During the download process, through our Python Client or Aspera, all requested data is downloaded over secure https connections. All data at the EGA is encrypted, and only accessible (for log-in and download) through secure protocols. For further information please, visit our security overview. What documentation do I need to provide? All submissions require policy’s documentation: 'Data Access Agreement (DAA)', 'Data Processing Agreement (DPA)' and 'Authorized Submitters Formulary'. The data processors (EGA) and the data owners will also sign the DPA. Will all metadata be public? Among the submitted metadata we need to make the distinction between identifiable and unidentifiable metadata: (1) the former may allow the identification of the human the sample derived from (e.g. detailed geographical providence, personal name, family ancestry…); (2) while the latter can be used to interpret the data without compromising the anonymity of the patients. The majority of the metadata submitted to the EGA corresponds to the unidentifiable category (e.g. sequencer's model). This type of metadata is publicly available on the EGA website and other EBI resources/partners’ websites. On the other hand, some parts samples’ metadata are subject to being identifiable, and thus only accessible by authorized data requesters, with the exception of: 5 submitter-defined attributes of the sample: alias, title, subject_id, gender and phenotype. It is the submitter’s responsibility not to submit sensitive metadata in these public fields. 3 anonymised fields that pinpoint the sample record in archivals: sample’s EGA stable ID (EGAN…), BioSample ID (SAMEA…) and submitter’s center name. During Submission Are there any sample specific requirements for EGA? All samples submitted to the EGA must include the attributes of biological sex, subject ID (anonymised individual identifier) and phenotype information. These are critical for data findability and its analysis, and we highly recommend using controlled ontology terms where applicable. For example: defining tumour and non-tumour samples and/or defining disease state. The EGA recommends using the Experimental Factor Ontology Database to find ontologized terms that describe your sample phenotypes. How do I get an accession number to use in my publication? You will receive your study accession number (EGAS…) upon complete your submission, either: Programmatically. As soon as the metadata is submitted and validated your study will be assigned an accession number that will be given in the submission’s response. Manually registering your study and relevant metadata using the online metadata submission tool: the EGA submitter Portal. How are files uploaded to the EGA? Data files are uploaded into private submission drop boxes (i.e. environments to which you are granted access and where you can transfer your files) using INBOX or FTP. These spaces are provided as part of the submission procedure. Before uploading any file, you must encrypt your files, . Only encrypted files shall be uploaded to the drop boxes. Why does data need to to encrypted for my submitted files? It is one of the security steps the EGA has implemented. In case of a security breach, people without the proper encryption key will not be able to read or use the information that could have been leaked. This measure is essential when working with sensitive data, such as controlled access human data. All submitters must use crypt4gh to create EGA compliant files prior to uploading them. This encryption is GPG-based, using EGA’s public key. Why are my files not available if I see them in the INBOX? There exists a time window between the data upload and the availability of such files via the Submitter Portal. For this reason, some metadata (run and analysis objects) cannot be registered until at least 24 hours after the files have been uploaded to your box. Why are MD5 sum values generated for my submitted files? We require pre- and post- encryption MD5 (message-digest) checksum values to be provided for all submitted files. These 128-bit values are computed using the content of each file, creating unique sequences that allow us to ensure that file integrity has been maintained during the transfer process. In other words, if the MD5 checksums we generate and those you generated match, we infer that the content of the transferred files is correct (i.e. files are not corrupted or truncated). MD5 checksums are computed automatically using the crypt4gh tool provided. Your submission will not be accepted and may be significantly delayed if you do not provide MD5 checksum values for all data files in the required format. How can I check if my files are correctly uploaded to the inbox? It is important to check the status pf your file so you know whether your files are in the inbox, being processed, or if there is any issue with one of them. In order to check this, you should: Look for the file locally. Drag and drop it to the file table. Then bars will appear on the table, which means that we are processing it. Green: The files checksum are correct and your file will move to “ingested files”. No further actions are needed from your end.. Red: The files checksum does not match and your file needs to be re-uploaded. Please re-upload relevant files to your inbox using the same path. After Submission How do I use my accession number in my publication? We suggest the use of the below template, using your study accession ID (EGAS…) : Data has been deposited at the European Genome-phenome Archive (EGA), which is hosted by the EBI and the CRG, under accession number EGASXXXXXXXXXXX. Further information about EGA can be found at https://ega-archive.org and "The European Genome-phenome Archive of human data consented for biomedical research" Your study ID will be the one that groups your whole submission, and thus its usage is recommended as such. Nevertheless, all metadata submitted to EGA hold a unique and persistent identifier (starting with EGA…) that can be used to identify specific records. For example, you could reference a specific dataset (EGAD…) or sample (EGAN…) in your publications (see full list of identifiers). How do I make my data searchable? Once you have finalized your submission, you can schedule the data release. Please take into account that the release process needs time for the files to be archived in our system, and for the Helpdesk team to validate your submission. Can I withdraw (meta)data from the EGA? We have methods in place for the secure removal of deposited (meta)data. Contact EGA-helpdesk for further details. EGA complies with FAIRness of (meta)data, and thus, even when the data is removed we keep an entry for their identifiers in our system. In other words, we execute a soft delete on canceled objects (e.g. a study): metadata is still stored in our systems, but it loses all links, cannot be queried and data files cannot be retrieved anymore. The reason behind this behaviour is so that queries using withdrawn data properly respond back (see example of a canceled study). What happens to the data once it has been submitted to the EGA? When the data is submitted, the submitter can choose either keep their data private or schedule the release of their data.
Forty years ago, the first measurements of cell free DNA (cfDNA) from cancer patients revealed that its total concentration was often elevated compared to healthy controls. The basis for this increase, and in particular the source of the extra cfDNA in such patients, has never been determined. To address this issue, we assessed methylation patterns of cfDNA from 57 individuals, 34 of whom had unequivocally elevated cfDNA concentrations (i.e., >15 ng/mL of plasma). We found that the excess cfDNA in cancer patients did not come from either neoplastic cells or surrounding cells from the tumor's tissue of origin. Instead, 80% of the cfDNA was derived from neutrophils, B cells or T cells, with neutrophils accounting for ~2/3. B cells contributed more than T cells despite the great excess of T cells over B cells in the blood. To see if this result was generalizable to other clinical situations in which elevated cfDNA concentrations have been described, we evaluated nine patients before and one day following surgery. After surgical resection, cfDNA concentration increased by an average of 16.1-fold, as expected. The majority of this increase (55%) was contributed by leukocytes, though damaged hepatocytes contributed a substantial minority (31% of increase). These data suggest that the elevated concentrations of cfDNA in cancer patients, as with patients after surgery, arises largely from dying leukocytes—not from the neoplastic cells or the tissue of origin of the cancer.
For ENOC cohorts, OvCaRe cases were reviewed, including frozen material, by at least two expert gynecopathologists prior to inclusion in the sequencing cohort who provided the confirmation on final selected cohort. Frozen H&E from Tokyo were also used for evaluation along with representative H&E photos and review done at the Jikei School of Medicine. For ENOC, DAH985 and DG1288 are recurrent and both were treated with chemotherapy after their first surgery. DAH123 is a untreated sample, metastasis from an primary endometrial tumour. All HGSC, GCT, CCOC and the rest ENOC tumours are primary tumour samples. Library construction and sequencing Frozen specimens with >50% tumour cellularity (based on initial slide review) were used for cryosectioning and subsequent nucleic acid extraction. Patient tumour and normal blood samples derived from primary, untreated fresh frozen tumour specimens harvested at diagnosis during standard of care debulking surgery. Germline DNA was provided from peripheral blood buffy coat on all specimens except 13 from Tokyo, where non-cancer frozen tissue was used as a germline source. DNA extraction from both matched normal (blood) and tumour samples (frozen tissue) were performed using the QIAamp Blood and Tissue DNA kit (Qiagen) and quantified using a Qbit fluorometer and reagents (high-sensitivity assay). Three lanes of Illumina HiSeq 2500 v4 chemistry for normal samples and five lanes for tumour samples were obtained. The PCR-free protocol was adopted to eliminate the PCR-induced bias and improve coverage across the genome.
BACKGROUND TRACERx (TRAcking Cancer Evolution through therapy (Rx)) is a prospective cohort study designed to investigate intratumor heterogeneity (ITH) in relation to clinical outcome, and to determine the clonal nature of driver events and evolutionary processes in early stage non-small cell lung cancer (NSCLC). METHODS Multiregion high-depth whole-exome sequencing (M-seq) was performed on 100 early stage NSCLC tumors resected prior to systemic therapy. A total of 327 tumor regions were sequenced and analyzed to define evolutionary histories, obtain a census of clonal and subclonal events, and assess the relationship between ITH and recurrence-free survival (RFS). RESULTS Widespread ITH was observed for both somatic copy number alterations (median 48% [0.03-88%]) and mutations (median 30% [0.5-93%]). Driver mutations in EGFR, MET, BRAF and TP53 were almost always clonal. However, heterogeneous driver alterations occurring later in evolution were found in over 75% of tumors and were common in PIK3CA, NF1 and genes involved in chromatin modification and DNA response and repair. Genome doubling and ongoing dynamic chromosomal instability (CIN), illustrated by mirrored subclonal allelic imbalance, were identified as causes of ITH resulting in parallel evolution of driver copy number events, including amplifications of CDK4, FOXA1, and BCL11A. Elevated copy number heterogeneity was associated with shorter RFS (HR=4.9, P=0.00044), which remained significant in a multivariate analysis. CONCLUSIONS ITH mediated through CIN, rather than point mutational heterogeneity, was associated with increased risk of relapse, supporting its value as a prognostic predictor, and the need to target this high-risk phenotype.
Data Access NOTE: Please refer to the "Authorized Access" section below for information about how access to the data from this accession differs from many other dbGaP accessions. BiospecimensAccess to Biospecimens is through the NHLBI Biologic Specimen and Data Repository Information Coordinating Center (BioLINCC). Biospecimens from ARDSNet-ALVEOLI include Plasma and DNA. Please note that use of biospecimens in genetic research is subject to a tiered consent. Available Data Outcome data regarding organ failure free days are not available. Objectives The ARDS Network is a consortium of clinical centers and a coordinating center to design and test novel therapies for the treatment of Acute Lung Injury (ALI) and Acute Respiratory Distress Syndrome (ARDS). The ARDS Network 01/03 trials included an investigation of the efficacy and safety of Ketoconazole and Respiratory Management in the treatment of ALI and ARDS (KARMA). The Ketoconazole arm of the KARMA study was later stopped due to an inability to show efficacy. Participants continued to be randomized to the respiratory management arms of the study (ARMA), which compared two ventilator strategies: a tidal volume of 6 mL/kg versus 12 mL/kg. The LARMA phase of the study investigated the efficacy of Lisofylline and Respiratory Management. The objective of the ALVEOLI study was to compare clinical outcomes of participants with ALI and ARDS treated with a higher end-expiratory lung volume/lower FiO2 versus a lower end-expiratory lung volume/higher FiO2 ventilation strategy. The ALVEOLI study tested the hypothesis that mortality from ALI and ARDS would be reduced with a mechanical ventilation strategy designed to prevent lung injury from repeated collapse of bronchioles and alveoli at end-expiration. Background Most participants requiring mechanical ventilation for ALI and ARDS receive positive end-expiratory pressure (PEEP) of 5 to 12 cm of water. Higher PEEP levels may improve oxygenation and reduce ventilator-induced lung injury but may also cause circulatory depression and lung injury from overdistention. PEEP levels higher than traditional levels may reduce ventilator-induced lung injury by decreasing the proportion of nonaerated lung and higher PEEP levels may allow arterial-oxygenation goals to be met at a lower level of inspired oxygen (FiO2). Participants A total of 550 participants were randomized to receive mechanical ventilation with either lower or higher PEEP levels, which were set according to different tables of predetermined combinations of PEEP and fraction of inspired oxygen. Conclusions Participants with acute lung injury and ARDS who receive mechanical ventilation with a tidal-volume goal of 6 ml per kilogram of predicted body weight and an end-inspiratory plateau-pressure limit of 30 cm of water, clinical outcomes were statistically similar whether lower or higher PEEP levels are used. (Brower, et al., 1004; PMID: 15269312).
The Cleveland Family Study (CFS) is a family-based study of sleep apnea, consisting of 2,284 individuals (46% African American) from 361 families studied on up to 4 occasions over a period of 16 years. The study began in 1990 with the initial aims of quantifying the familial aggregation of sleep apnea. National Institutes of Health (NIH) renewals provided expansion of the original cohort, including increased minority recruitment, and longitudinal follow-up, with the last exam occurring in February 2006. The CFS was designed to provide fundamental epidemiological data on risk factors for sleep disordered breathing (SDB). The sample was selected by identifying affected probands who had laboratory diagnosed obstructive sleep apnea. All first-degree relatives, spouses and available second-degree relatives of affected probands were studied. In addition, during the first 5 study years, neighborhood control families were identified through a neighborhood proband, and his/her spouses and first-degree relatives. Each exam, occurring at approximately 4-year intervals, included new enrollment as well as follow up exams for previously enrolled subjects. For the first three visits, data, including an overnight sleep study, were collected in participants' homes while the last visit occurred in a general clinical research center (GCRC). Phenotypic characterization of the entire cohort included overnight sleep apnea studies, blood pressure, spirometry, anthropometry and questionnaires. Currently, data of 710 individuals are available for use through BioData Catalyst (with genotype data available through dbGaP).The National Sleep Research Resource (NSRR) is a NIH-supported sleep data repository that offers free access to large collections of de-identified physiological signals and related clinical data from a large range of cohort studies, clinical trials and other data sources from children and adults, including healthy individuals from the community and individuals with known sleep or other health disorders. The goals of NSRR are to facilitate rigorous research that requires access to large or more diverse data sets, including raw physiological signals, to promote a better understanding of risk factors for sleep and circadian disorders and/or the impact of sleep disturbances on health-related outcomes. Data from over 15 data sources and more than 40,000 individual sleep studies, many linked to dozens if not hundreds of clinical data elements, are available (as of Feb. 2022). Query tools are available to identify variables of interest, and their meta-data and provenance.
The ALCHEMIST study will accrue patients that are potentially eligible for the Intergroup adjuvant studies and perform central EGFR and ALK genotyping using a central reference laboratory certified by the Clinical Laboratory Improvement Amendments of 1988 (CLIA). Patients may either present prior to surgery with resectable NSCLC, may present following complete resection (before or after adjuvant chemotherapy). Eligibility is limited to those with NSCLC of a non-squamous histological subtype and those with adequate performance status and organ function for future trial eligibility. All subjects must submit tissue for central EGFR and ALK genotyping, as well as additional tissue for advanced genomics at the CCG. Subjects may have had local genotyping done prior to registration, however if these results shows no targetable EGFR or ALK alterations (or if it shows a KRAS mutation) then the patient will not be eligible for this screening protocol given the primary aim is to facilitate accrual to the adjuvant studies. All subjects will provide peripheral blood for matched normal DNA. All subjects (including those known to have EGFR or ALK alterations with local genotyping) will have formalin-fixed tissue collected for central genotyping. The testing will be performed at Response Genetics (Los Angeles, CA), a commercial CLIA-certified laboratory. ALK FISH will be performed using the Vysis break-apart probe and EGFR sequencing will be performed of exons 18-21. It is preferred that a full tumor block be submitted, but if unavailable, 15 unstained slides can be submitted. Genotyping results are expected to be provided to the treating clinician within 14 business days of submission so they can be used to determine eligibility for the randomized adjuvant studies, or to confirm the local results. Results will also be reported at intervals to the study team for update into the Alliance database. In addition to the commercial genotyping at Response Genetics, tissue will be collected for research genomics by CCG. For those patients with a block available, this will be forwarded to the CCG after clinical testing at Response Genetics. For patients without a block available, 15 unstained slides should be submitted to the CCG BCR for exploratory analysis. A peripheral blood specimen will also be collected and sent to the CCG BCR to use as matched normal. Specimens will be coded. Over the course of the study, the CCG will perform advanced genomic analysis of the resected lung cancer specimens in a research, non-CLIA environment. Following completion of the genomic analysis, the results can be matched with the clinical follow-up results using a link between the samples codes and the patient identifiers for correlative analyses. Subjects participating in the follow-up portion of the ALCHEMIST study, as well as those participating in the adjuvant therapeutic studies, are expected to undergo a standard-of-care diagnostic biopsy to confirm recurrence. At least two core biopsies, minimum, should be obtained as part of this recurrence biopsy. Paraffin embedded tissue from this biopsy will be used for confirmation of recurrence, for CLIA genomics to plan subsequent clinical trials, and for additional research genomics by the CCG. Plasma will additionally be collected at recurrence and sent to CCG for study of circulating free DNA (cfDNA). Data for the ALCHEMIST study is not yet available to be accessed by the public at this time; when ready it will be made available at the GDC Data Portal.
Data Access NOTE: Please refer to the “Authorized Access” section below for information about how access to the data from this accession differs from many other dbGaP accessions. Biospecimens Access to Biospecimens is through the NHLBI Biologic Specimen and Data Repository Information Coordinating Center (BioLINCC). Biospecimens from ARDSNet-ARMA/KARMA/LARMA include plasma, serum and urine. Please note that use of biospecimens in genetic research is subject to a tiered consent. Objectives The ARDS Network is a consortium of clinical centers and a coordinating center to design and test novel therapies for the treatment of Acute Lung Injury (ALI) and Acute Respiratory Distress Syndrome (ARDS). The primary objective of the KARMA trial was to investigate the efficacy and safety of Ketoconazole and Respiratory Management in the treatment of ALI and ARDS. The Ketoconazole arm of the study was later stopped due to an inability to show efficacy. Participants continued to be randomized to the respiratory management arms of the study (ARMA), which compared two ventilator strategies: a tidal volume of 6 mL/kg versus 12 mL/kg. The LARMA phase of the study investigated the efficacy of Lisofylline and Respiratory Management. Background Participants suffering from ARDS are extremely ill, require mechanical ventilation and, despite improvements in medical care and technology, had a mortality rate as high as 50 percent. An excessive inflammatory response is characteristic of ALI of which ARDS represents the most severe end of the pathophysiologic spectrum. The inflammatory response includes increased numbers of neutrophils activated to produce cytokines, proteases, and reactive oxygen intermediates. Pulmonary injury may also be enhanced by alveolar and tissue macrophages as a producer of vasoactive substances, neutrophil chemoattractants, and procoagulant substances. Ketoconazole, a synthetic antifungal imidazole, also has anti-inflammatory activities and may inhibit neutrophil recruitment via several different pathways known to be involved in the development of ALI and ARDS. Lisofylline causes a marked decrease in the circulating levels of the major oxidizable species of free fatty acids and also inhibits proinflammatory intracellular signaling. Mechanical ventilation in participants with ALI and ARDS have traditionally used tidal volumes of 10 to 15 ml per kilogram of body weight. These large tidal volumes are often necessary to achieve normal partial pressure of arterial carbon dioxide and pH, but may induce inflammatory responses through disruption of pulmonary epithelium and endothelium. Mechanical ventilation at lower tidal volumes may reduce injurious lung stretch and decrease the inflammatory response. Participants The Ketoconazole and Lisofylline trials were designed as 2 x 2 factorials and included 220 participants in each trial. A total of 860 participants were randomized into the ventilator management trial. Participants enrolled in the Lisofylline or Ketoconozole studies had to be concurrently enrolled in the ventilator management study and were first randomized into a ventilator strategy and then to drug or placebo. Conclusions Ketoconazole was found to be safe but did not reduce mortality, duration of mechanical ventilation, or improve lung function. Lisofylline was also found to be safe and to have no beneficial effect for participants with ALI or ARDS. Ventilation at lower tidal volumes resulted in reduced mortality and an increase in the number of days without ventilator support. (PMIDs: 10789668, 11902249, 10793162).
This project used NGS (next generation sequencing), using Illumina NOVASEQ 6000 and Illumina DRAGEN aligner. The dataset includes BAM and BAM.BAI files from Whole Genome Sequencing of 15 samples (8 female, 7 male of Warmian-Mazurian Voivodeship, Poland from POPULOUS collection). Library Construction Protocol: Illumina DNA PCR-Free Prep, Tagmentation. Reference Genome: GRCh37.
This project used NGS (next generation sequencing), using Illumina NOVASEQ 6000 and Illumina DRAGEN aligner. The dataset includes BAM and BAM.BAI files from Whole Genome Sequencing of 17 samples (7 female, 10 male of West Pomeranian Voivodeship, Poland from POPULOUS collection). Library Construction Protocol: Illumina DNA PCR-Free Prep, Tagmentation. Reference Genome: GRCh37.
Subpopulations and Intermediate Outcome Measures in COPD Study Description Subpopulations and intermediate outcome measures in COPD study (SPIROMICS) supports the prospective collection and analysis of phenotypic, biomarker, genetic, genomic, and clinical data from subjects with COPD for the purpose of identifying subpopulations and intermediate outcome measures. It is funded by the National Heart, Lung, and Blood Institute and is coordinated by the University of North Carolina at Chapel Hill. Molecular fingerprinting and extensive subject phenotyping will be performed to identify disease subpopulations and to identify and validate surrogate markers of disease severity, which will be useful as intermediate outcome measures for future clinical trials. Secondary aims are to clarify the natural history of COPD, to develop bioinformatic resources that will enable the utilization and sharing of data in studies of COPD and related diseases, and to create a collection of clinical, biomarker, radiographic, and genetic data that can be used by external investigators for other studies of COPD. Participating Institutions: Research participants for SPIROMICS will be enrolled, phenotyped, and followed at twelve SPIROMICS Clinical Centers: Columbia University, Temple University, Johns Hopkins University, Wake Forest University, University of Michigan, University of Illinois at Chicago, University of Iowa, University of Utah, National Jewish Health, University of California at San Francisco, and University of California at Los Angeles. The University of North Carolina at Chapel Hill serves as the Genomics and Informatics Center. The Radiology Reading Center is based at the University of Iowa. The PFT Reading Center is based at the University of California at Los Angeles. Study Design: SPIROMICS is a prospective cohort study that enrolled approximately 2,981 participants at twelve clinical centers over five years. Participants are distributed across four enrollment strata (i.e., Never-smokers, Smokers without COPD, Mild/Moderate COPD, and Severe COPD). Study Visits: Participants have up to four in-person visits (Baseline and Annual Clinic Visits at years 1, 2, 3 after Baseline). Study questionnaires and spirometry are completed at all main study visits. Blood and urine samples are collected at visits 1, 2, and 4. Sputum samples are collected at Visit 1. The CT scans are completed and Baseline and Visit 2. Participants also receive quarterly follow-up calls to assess health status and determine if an exacerbation has occurred. Substudies Bronchoscopy and Immunophenotyping Substudy The Bronchoscopy Substudy will enroll 50 subjects per site, for a total of 300 participants. These participants will be recruited across all four study strata. This substudy includes two study visits. During the first visit, sputum samples are collected for Immunophenotyping analysis at the Immunophenotyping Core Lab based at the University of Michigan. Participants then return for a second visit during which samples are collected via bronchoscopy, including bronchoalveolar lavage, epithelial brushings and bronchial biopsies. Immunophenotyping analysis is also conducted on BAL and blood collected during the bronchoscopy study visit. Repeatability Substudy The entire baseline clinic visit was repeated on 98 volunteers to determine reliability of measurement procedures. All baseline study-related procedures and questionnaires, including the CT scan, were re-administered and new samples of blood, urine, saliva, and sputum were collected. Field center staff processed these biospecimen samples according to the same protocol. Exacerbation Substudy The Exacerbation Substudy is a prospective, longitudinal observational study of up to 400 participants, which will allow the assessment of participant-driven health care utilization (HCU) events and symptom-defined exacerbation events over time. HCU-driven events are defined by change in medical treatment in response to a perceived COPD Exacerbation. Symptom-based events will be defined by using EXACT-PRO (EXacerbations of Chronic Pulmonary Disease Tool - Patient Reported Outcome), a daily symptom diary developed to measure the frequency, severity, and duration of exacerbations in clinical trials. All participants are provided with a PDA on which to complete the diary. Participants reporting a possible COPD exacerbation will be brought into the study clinic for a study visit to collect biological specimens and questionnaire data. The overall objectives of the Exacerbation Substudy are to: Obtain clinical data and specimens on SPIROMICS participants before and during an acute COPD exacerbation defined by Health care utilization triggered by the subject, or Symptomatic change (triggered by an EXACT defined threshold) Describe symptomatic changes occurring around HCU and symptom-defined (EXACT) events and their relationship to clinical and specimen data, Characterize the non-exacerbation "stable" state in COPD using the EXACT, including: Symptom variability (EXACT), Clinical data and specimen parameters during a stable state (baseline), The relationship between clinical and specimen data and symptom severity and variability. Explore the characteristics of stable patients, defined as those who do not experience HCU or symptom-defined (EXACT) events during the sub-study period, using baseline clinical data and specimens, and compare these characteristics with patients who experience HCU and/or symptom-defined events. Examine the relationship between HCU and symptom-defined exacerbation frequency during the first one-year period of follow-up for exacerbations and health outcomes, including 12-month change in lung function and COPD health status, and longer-term morbidity and mortality, with the latter derived from long-term data from the larger SPIROMICS study.
This study (10-C-0086) collected and analyzed tumor and circulating tumor omics of pediatric and young adult patients with relapsed rhabdomyosarcoma who were co-enrolled in and receiving treatment on the interventional study 17-C-0049. We sought to describe the summary genomic findings of tumors, and report on the detection of circulating tumor DNA (ctDNA) in serial samples. Tumor tissue and blood were collected and analyzed. Tumor samples were subjected to whole exome and/or whole genome sequencing paired with RNA-Seq. Cell free DNA (cfDNA) was extracted from serially collected blood samples and sequenced. Fusion status, mutations, and IGF1R and YES1 expression from tumor samples revealed the presence of a PAX3 fusion in most samples, a number of common mutations, and universal but heterogeneous expression of IGF-1R and YES1. ctDNA was detected above the 3% threshold in a majority of patients and analysis of cfDNA demonstrated an ability to monitor tumor clonal evolution.
In this study, lymphoblastoid cell lines (LCLs) derived from women with past perimenopausal depression (PMD) (n = 8) and asymptomatic controls (n = 9) were compared via poly(A)-selected transcriptome sequencing (RNA-seq). LCLs were cultured in three experimental conditions: at vehicle baseline (ovarian steroid-free), during estradiol (E2) treatment, and following E2-withdrawal. Transcriptomic analysis revealed significant differences in transcript expression in PMD LCLs compared to controls in all experimental conditions, and significant overlap in genes that were changed in PMD regardless of experimental condition. Among these, chemokine CXCL10 was upregulated in women with PMD, but most so after E2-WD and CYP7B1, an enzyme intrinsic to DHEA metabolism, was upregulated in PMD across experimental conditions. Finally, this in vitro model identified gene networks dysregulated in PMD which included inflammatory response, early/late E2-response, and cholesterol homeostasis. Data available will include the normalized counts for each subject in the study.
Lung-MAP (S1400, NCT02785952) was a multicenter, open-label, phase III randomized clinical trial. The Lung-MAP-I substudy (S1400I) was conducted from December 2015 to April 2018, through the National Clinical Trials Network and led by the SWOG Cancer Research Network. The trial compared nivolumab plus ipilimumab (nivo+ipi) with nivolumab monotherapy (nivo) in patients with chemotherapy-pretreated, immunotherapy-naïve, advanced squamous non-small cell lung carcinoma (sqNSCLC). Two hundred fifty-two patients were randomly assigned to receive nivo+ipi (n = 125) or nivo (n = 127). The clinical efficacy endpoints were overall survival, progression free survival, duration of response, and best objective response by RECIST 1.1. Multi-omic translational analysis was performed in collaboration with the the CIMAC–Cancer Immunologic Data Commons (CIDC) Network using available tumor tissue samples and blood samples (n = 160) submitted for Lung-MAP screening and provided by the SWOG tissue bank.
Induced Pluripotent Stem Cells (iPSC) derived from healthy individuals are important controls for disease modeling studies. To create a resource of genetically annotated iPSC, we reprogrammed footprint-free lines from four volunteers in the Personal Genome Project Canada (PGPC). Multilineage directed differentiation efficiently produced functional cortical neurons, cardiomyocytes and hepatocytes. Pilot users further demonstrated line versatility by generating kidney organoids, T-cells and sensory neurons. A frameshift knockout was introduced into MYBPC3 and these cardiomyocytes exhibited a hypertrophic phenotype as expected. WGS annotation of PGPC lines revealed on average 20 coding variants. Importantly, nearly all annotated PGPC and HipSci lines harboured at least one pre-existing or acquired variant with cardiac, neurological or other disease associations. Overall, PGPC lines were efficiently differentiated by multiple users into cell types found in six tissues for disease modelling, and clinical annotation highlighted variant-preferred lines for use as unaffected controls in specific disease settings.
Adjuvant nivolumab demonstrated a significant improvement in disease-free survival (DFS; primary endpoint) versus placebo in patients with resected esophageal or gastroesophageal junction cancer and residual pathologic disease following neoadjuvant chemoradiotherapy (CRT) in the phase III CheckMate 577 study, leading to global approval and widespread adoption. Exploratory biomarker analyses were performed using whole-exome sequencing, RNA sequencing, and immunohistochemistry. Hazard ratios for DFS favored nivolumab versus placebo in patients with higher inflammation and proliferation gene expression signature scores; lower M2 macrophage, endothelial, and stromal GES scores; and higher densities of CD3+/CD8- T cells and natural killer cells. Assessments of pre-CRT and post-CRT tumor tissue found post-CRT increases in programmed death ligand 1 combined positive score in 51% of patients, which appeared to be associated with greater DFS benefit. Through these analyses, we identified patient subpopulations that appear to derive improved DFS benefit from adjuvant nivolumab in this setting.
The analysis of cell-free DNA (cfDNA) in plasma represents a rapidly advancing field in medicine. cfDNA consists predominantly of nucleosome-protected DNA shed into the bloodstream by cells undergoing apoptosis. We performed whole-genome sequencing (WGS) of plasma DNA and identified two discrete regions at transcription start sites (TSS) where the nucleosome occupancy results in different read-depth coverage patterns in expressed and silent genes. By employing machine learning for gene classification we found that the plasma DNA read depth patterns from healthy donors reflected the expression signature of hematopoietic cells. In cancer patients with metastatic disease, we were able to classify expressed cancer driver genes in regions with somatic copy number gains with high accuracy. We could even determine the expressed isoform of genes with several TSSs as confirmed by RNA-Seq of the matching primary tumor. Our analyses provide functional information about the cells releasing their DNA into the circulation.
Here we analyzed the molecular profiles and clinical outcomes of 1,310 patients (proportion of rare cancers, 75%) enrolled in a prospective observational study by the German Cancer Consortium that applies whole-genome/exome and RNA sequencing to inform the care of adults with incurable cancers. Based on 472 single and six composite biomarkers, a cross-institutional molecular tumor board provided evidence-based management recommendations, including diagnostic reevaluation, genetic counseling, and experimental treatment, in 88% of cases. Recommended therapies were administered in 362 of 1,138 patients (31.8%) and resulted in significantly improved response and disease control rates (23.9% and 55.3%, respectively) compared to previous therapies, translating into a progression-free survival ratio >1.3 in 35.7% of patients.
Activating BRAF mutations are rare events in multiple myeloma but have been shown to be promising therapeutic targets in small case studies. This multicenter phase II trial assessed the efficacy and safety of the BRAF/MEK inhibitors, encorafenib and binimetinib, in patients with relapsed/refractory multiple myeloma (RRMM) carrying an activating BRAFV600E mutation. Twelve RRMM patients with a median of five prior lines of therapy were enrolled. The study met its primary endpoint with 10 patients achieving at least a partial response according to IMWG criteria (overall response rate 83.3%). Responses occurred rapidly within the first cycle and deepened over time. The median progression-free survival (PFS) was 5.6 months and overall survival was 55% at 24 months. Genomic profiling revealed RAS mutations and structural variants involving the BRAF locus to be drivers of resistance to BRAF/MEK inhibition in RRMM. This trial demonstrates the value of translationally driven clinical trials in selected patient populations.
We report herein an extensive exploratory biomarker analysis of refractory tumors taken from pediatric patients prior to receiving atezolizumab monotherapy in the phase 1-2 iMATRIX-atezolizumab trial (NCT02541604). A high percentage of CD8+ T cells and elevated protein levels of programmed cell death ligand 1 (PD-L1) were associated with progression-free survival (PFS). T-cell receptor (TCR) sequencing revealed that diverse infiltrating TCR repertoire at baseline was prognostic. We found no associations between panel-based tumor mutation burden (TMB) or specific genetic aberrations with PFS in this study. Through a pan-cancer gene co-expression network analysis, we developed a novel tumor-agnostic Pediatric Cytotoxicity and Antigen Presentation (PedCAP) signature that was associated with improved PFS in the iMATRIX-atezo study. Our study highlights features of immune response in pediatric cancers when treated with immune checkpoint inhibitors and provides a multi-biomarker pediatric immunogram framework to guide prospective clinical trials in pediatric cancers.
Original description of the study: From ELLIPSE (linked to the PRACTICAL consortium), we contributed ~78,000 SNPs to the OncoArray. A large fraction of the content was derived from the GWAS meta-analyses in European ancestry populations (overall and aggressive disease; ~27K SNPs). We also selected just over 10,000 SNPs from the meta-analyses in the non-European populations, with a majority of these SNPs coming from the analysis of overall prostate cancer in African ancestry populations as well as from the multiethnic meta-analysis. A substantial fraction of SNPs (~28,000) were also selected for fine-mapping of 53 loci not included in the common fine-mapping regions (tagging at r2>0.9 across ±500kb regions). We also selected a few thousand SNPs related with PSA levels and/or disease survival as well as SNPs from candidate lists provided by study collaborators, as well as from meta-analyses of exome SNP chip data from the Multiethnic Cohort and UK studies. The Contributing Studies: Aarhus: Hospital-based, Retrospective, Observational. Source of cases: Patients treated for prostate adenocarcinoma at Department of Urology, Aarhus University Hospital, Skejby (Aarhus, Denmark). Source of controls: Age-matched males treated for myocardial infarction or undergoing coronary angioplasty, but with no prostate cancer diagnosis based on information retrieved from the Danish Cancer Register and the Danish Cause of Death Register. AHS: Nested case-control study within prospective cohort. Source of cases: linkage to cancer registries in study states. Source of controls: matched controls from cohort ATBC: Prospective, nested case-control. Source of cases: Finnish male smokers aged 50-69 years at baseline. Source of controls: Finnish male smokers aged 50-69 years at baseline BioVu: Cases identified in a biobank linked to electronic health records. Source of cases: A total of 214 cases were identified in the VUMC de-identified electronic health records database (the Synthetic Derivative) and shipped to USC for genotyping in April 2014. The following criteria were used to identify cases: Age 18 or greater; male; African Americans (Black) only. Note that African ancestry is not self-identified, it is administratively or third-party assigned (which has been shown to be highly correlated with genetic ancestry for African Americans in BioVU; see references). Source of controls: Controls were identified in the de-identified electronic health record. Unfortunately, they were not age matched to the cases, and therefore cannot be used for this study. Canary PASS: Prospective, Multi-site, Observational Active Surveillance Study. Source of cases: clinic based from Beth Israel Deaconness Medical Center, Eastern Virginia Medical School, University of California at San Francisco, University of Texas Health Sciences Center San Antonio, University of Washington, VA Puget Sound. Source of controls: N/A CCI: Case series, Hospital-based. Source of cases: Cases identified through clinics at the Cross Cancer Institute. Source of controls: N/A CerePP French Prostate Cancer Case-Control Study (ProGene): Case-Control, Prospective, Observational, Hospital-based. Source of cases: Patients, treated in French departments of Urology, who had histologically confirmed prostate cancer. Source of controls: Controls were recruited as participating in a systematic health screening program and found unaffected (normal digital rectal examination and total PSA < 4 ng/ml, or negative biopsy if PSA > 4 ng/ml). COH: hospital-based cases and controls from outside. Source of cases: Consented prostate cancer cases at City of Hope. Source of controls: Consented unaffected males that were part of other studies where they consented to have their DNA used for other research studies. COSM: Population-based cohort. Source of cases: General population. Source of controls: General population CPCS1: Case-control - Denmark. Source of cases: Hospital referrals. Source of controls: Copenhagen General Population Study CPCS2: Source of cases: Hospital referrals. Source of controls: Copenhagen General Population Study CPDR: Retrospective cohort. Source of cases: Walter Reed National Military Medical Center. Source of controls: Walter Reed National Military Medical Center ACS_CPS-II: Nested case-control derived from a prospective cohort study. Source of cases: Identified through self-report on follow-up questionnaires and verified through medical records or cancer registries, identified through cancer registries or the National Death Index (with prostate cancer as the primary cause of death). Source of controls: Cohort participants who were cancer-free at the time of diagnosis of the matched case, also matched on age (±6 mo) and date of biospecimen donation (±6 mo). EPIC: Case-control - Germany, Greece, Italy, Netherlands, Spain, Sweden, UK. Source of cases: Identified through record linkage with population-based cancer registries in Italy, the Netherlands, Spain, Sweden and UK. In Germany and Greece, follow-up is active and achieved through checks of insurance records and cancer and pathology registries as well as via self-reported questionnaires; self-reported incident cancers are verified through medical records. Source of controls: Cohort participants without a diagnosis of cancer EPICAP: Case-control, Population-based, ages less than 75 years at diagnosis, Hérault, France. Source of cases: Prostate cancer cases in all public hospitals and private urology clinics of département of Hérault in France. Cases validation by the Hérault Cancer Registry. Source of controls: Population-based controls, frequency age matched (5-year groups). Quotas by socio-economic status (SES) in order to obtain a distribution by SES among controls identical to the SES distribution among general population men, conditionally to age. ERSPC: Population-based randomized trial. Source of cases: Men with PrCa from screening arm ERSPC Rotterdam. Source of controls: Men without PrCa from screening arm ERSPC Rotterdam ESTHER: Case-control, Prospective, Observational, Population-based. Source of cases: Prostate cancer cases in all hospitals in the state of Saarland, from 2001-2003. Source of controls: Random sample of participants from routine health check-up in Saarland, in 2000-2002 FHCRC: Population-based, case-control, ages 35-74 years at diagnosis, King County, WA, USA. Source of cases: Identified through the Seattle-Puget Sound SEER cancer registry. Source of controls: Randomly selected, age-frequency matched residents from the same county as cases Gene-PARE: Hospital-based. Source of cases: Patients that received radiotherapy for treatment of prostate cancer. Source of controls: n/a Hamburg-Zagreb: Hospital-based, Prospective. Source of cases: Prostate cancer cases seen at the Department of Oncology, University Hospital Center Zagreb, Croatia. Source of controls: Population-based (Croatia), healthy men, older than 50, with no medical record of cancer, and no family history of cancer (1st & 2nd degree relatives) HPFS: Nested case-control. Source of cases: Participants of the HPFS cohort. Source of controls: Participants of the HPFS cohort IMPACT: Observational. Source of cases: Carriers and non-carriers (with a known mutation in the family) of the BRCA1 and BRCA2 genes, aged between 40 and 69, who are undergoing prostate screening with annual PSA testing. This cohort has been diagnosed with prostate cancer during the study. Source of controls: Carriers and non-carriers (with a known mutation in the family) of the BRCA1 and BRCA2 genes, aged between 40 and 69, who are undergoing prostate screening with annual PSA testing. This cohort has not been diagnosed with prostate cancer during the study. IPO-Porto: Hospital-based. Source of cases: Early onset and/or familial prostate cancer. Source of controls: Blood donors Karuprostate: Case-control, Retrospective, Population-based. Source of cases: From FWI (Guadeloupe): 237 consecutive incident patients with histologically confirmed prostate cancer attending public and private urology clinics; From Democratic Republic of Congo: 148 consecutive incident patients with histologically confirmed prostate cancer attending the University Clinic of Kinshasa. Source of controls: From FWI (Guadeloupe): 277 controls recruited from men participating in a free systematic health screening program open to the general population; From Democratic Republic of Congo: 134 controls recruited from subjects attending the University Clinic of Kinshasa KULEUVEN: Hospital-based, Prospective, Observational. Source of cases: Prostate cancer cases recruited at the University Hospital Leuven. Source of controls: Healthy males with no history of prostate cancer recruited at the University Hospitals, Leuven. LAAPC: Subjects were participants in a population-based case-control study of aggressive prostate cancer conducted in Los Angeles County. Cases were identified through the Los Angeles County Cancer Surveillance Program rapid case ascertainment system. Eligible cases included African American, Hispanic, and non-Hispanic White men diagnosed with a first primary prostate cancer between January 1, 1999 and December 31, 2003. Eligible cases also had (a) prostatectomy with documented tumor extension outside the prostate, (b) metastatic prostate cancer in sites other than prostate, (c) needle biopsy of the prostate with Gleason grade ≥8, or (d) needle biopsy with Gleason grade 7 and tumor in more than two thirds of the biopsy cores. Eligible controls were men never diagnosed with prostate cancer, living in the same neighborhood as a case, and were frequency matched to cases on age (± 5 y) and race/ethnicity. Controls were identified by a neighborhood walk algorithm, which proceeds through an obligatory sequence of adjacent houses or residential units beginning at a specific residence that has a specific geographic relationship to the residence where the case lived at diagnosis. Malaysia: Case-control. Source of cases: Patients attended the outpatient urology or uro-onco clinic at University Malaya Medical Center. Source of controls: Population-based, age matched (5-year groups), ascertained through electoral register, Subang Jaya, Selangor, Malaysia MCC-Spain: Case-control. Source of cases: Identified through the urology departments of the participating hospitals. Source of controls: Population-based, frequency age and region matched, ascertained through the rosters of the primary health care centers MCCS: Nested case-control, Melbourne, Victoria. Source of cases: Identified by linkage to the Victorian Cancer Registry. Source of controls: Cohort participants without a diagnosis of cancer MD Anderson: Participants in this study were identified from epidemiological prostate cancer studies conducted at the University of Texas MD Anderson Cancer Center in the Houston Metropolitan area. Cases were accrued in the Houston Medical Center and were not restricted with respect to Gleason score, stage or PSA. Controls were identified via random-digit-dialing or among hospital visitors and they were frequency matched to cases on age and race. Lifestyle, demographic, and family history data were collected using a standardized questionnaire. MDACC_AS: A prospective cohort study. Source of cases: Men with clinically organ-confined prostate cancer meeting eligibility criteria for a prospective cohort study of active surveillance at MD Anderson Cancer Center. Source of controls: N/A MEC: The Multiethnic Cohort (MEC) is comprised of over 215,000 men and women recruited from Hawaii and the Los Angeles area between 1993 and 1996. Between 1995 and 2006, over 65,000 blood samples were collected from participants for genetic analyses. To identify incident cancer cases, the MEC was cross-linked with the population-based Surveillance, Epidemiology and End Results (SEER) registries in California and Hawaii, and unaffected cohort participants with blood samples were selected as controls MIAMI (WFPCS): Prostate cancer cases and controls were recruited from the Departments of Urology and Internal Medicine of the Wake Forest University School of Medicine using sequential patient populations as described previously (PMID:15342424). All study subjects received a detailed description of the study protocol and signed their informed consent, as approved by the medical center's Institutional Review Board. The general eligibility criteria were (i) able to comprehend informed consent and (ii) without previously diagnosed cancer. The exclusion criteria were (i) clinical diagnosis of autoimmune diseases; (ii) chronic inflammatory conditions; and (iii) infections within the past 6 weeks. Blood samples were collected from all subjects. MOFFITT: Hospital-based. Source of cases: clinic based from Moffitt Cancer Center. Source of controls: Moffitt Cancer Center affiliated Lifetime cancer screening center NMHS: Case-control, clinic based, Nashville TN. Source of cases: All urology clinics in Nashville, TN. Source of controls: Men without prostate cancer at prostate biopsy. PCaP: The North Carolina-Louisiana Prostate Cancer Project (PCaP) is a multidisciplinary population-based case-only study designed to address racial differences in prostate cancer through a comprehensive evaluation of social, individual and tumor level influences on prostate cancer aggressiveness. PCaP enrolled approximately equal numbers of African Americans and Caucasian Americans with newly-diagnosed prostate cancer from North Carolina (42 counties) and Louisiana (30 parishes) identified through state tumor registries. African American PCaP subjects with DNA, who agreed to future use of specimens for research, participated in OncoArray analysis. PCMUS: Case-control - Sofia, Bulgaria. Source of cases: Patients of Clinic of Urology, Alexandrovska University Hospital, Sofia, Bulgaria, PrCa histopathologically confirmed. Source of controls: 72 patients with verified BPH and PSA<3,5; 78 healthy controls from the MMC Biobank, no history of PrCa PHS: Nested case-control. Source of cases: Participants of the PHS1 trial/cohort. Source of controls: Participants of the PHS1 trial/cohort PLCO: Nested case-control. Source of cases: Men with a confirmed diagnosis of prostate cancer from the PLCO Cancer Screening Trial. Source of controls: Controls were men enrolled in the PLCO Cancer Screening Trial without a diagnosis of cancer at the time of case ascertainment. Poland: Case-control. Source of cases: men with unselected prostate cancer, diagnosed in north-western Poland at the University Hospital in Szczecin. Source of controls: cancer-free men from the same population, taken from the healthy adult patients of family doctors in the Szczecin region PROCAP: Population-based, Retrospective, Observational. Source of cases: Cases were ascertained from the National Prostate Cancer Register of Sweden Follow-Up Study, a retrospective nationwide cohort study of patients with localized prostate cancer. Source of controls: Controls were selected among men referred for PSA testing in laboratories in Stockholm County, Sweden, between 2010 and 2012. PROGReSS: Hospital-based, Prospective, Observational. Source of cases: Prostate cancer cases from the Hospital Clínico Universitario de Santiago de Compostela, Galicia, Spain. Source of controls: Cancer-free men from the same population ProMPT: A study to collect samples and data from subjects with and without prostate cancer. Retrospective, Experimental. Source of cases: Subjects attending outpatient clinics in hospitals. Source of controls: Subjects attending outpatient clinics in hospitals ProtecT: Trial of treatment. Samples taken from subjects invited for PSA testing from the community at nine centers across United Kingdom. Source of cases: Subjects who have a proven diagnosis of prostate cancer following testing. Source of controls: Identified through invitation of subjects in the community. PROtEuS: Case-control, population-based. Source of cases: All new histologically-confirmed cases, aged less or equal to 75 years, diagnosed between 2005 and 2009, actively ascertained across Montreal French hospitals. Source of controls: Randomly selected from the Provincial electoral list of French-speaking men between 2005 and 2009, from the same area of residence as cases and frequency-matched on age. QLD: Case-control. Source of cases: A longitudinal cohort study (Prostate Cancer Supportive Care and Patient Outcomes Project: ProsCan) conducted in Queensland, through which men newly diagnosed with prostate cancer from 26 private practices and 10 public hospitals were directly referred to ProsCan at the time of diagnosis by their treating clinician (age range 43-88 years). All cases had histopathologically confirmed prostate cancer, following presentation with an abnormal serum PSA and/or lower urinary tract symptoms. Source of controls: Controls comprised healthy male blood donors with no personal history of prostate cancer, recruited through (i) the Australian Red Cross Blood Services in Brisbane (age range 19-76 years) and (ii) the Australian Electoral Commission (AEC) (age and post-code/ area matched to ProsCan, age range 54-90 years). RAPPER: Multi-centre, hospital based blood sample collection study in patients enrolled in clinical trials with prospective collection of radiotherapy toxicity data. Source of cases: Prostate cancer patients enrolled in radiotherapy trials: CHHiP, RT01, Dose Escalation, RADICALS, Pelvic IMRT, PIVOTAL. Source of controls: N/A SABOR: Prostate Cancer Screening Cohort. Source of cases: Men >45 yrs of age participating in annual PSA screening. Source of controls: Males participating in annual PSA prostate cancer risk evaluations (funded by NCI biomarkers discovery and validation grant), recruited through University of Texas Health Science Center at San Antonio and affiliated sites or through study advertisements, enrolment open to the community SCCS: Case-control in cohort, Southeastern USA. Prospective, Observational, Population-based. Source of cases: SCCS entry population. Source of controls: SCCS entry population SCPCS: Population-based, Retrospective, Observational. Source of cases: South Carolina Central Cancer Registry. Source of controls: Health Care Financing Administration beneficiary file SEARCH: Case-control - East Anglia, UK. Source of cases: Men < 70 years of age registered with prostate cancer at the population-based cancer registry, Eastern Cancer Registration and Information Centre, East Anglia, UK. Source of controls: Men attending general practice in East Anglia with no known prostate cancer diagnosis, frequency matched to cases by age and geographic region SNP_Prostate_Ghent: Hospital-based, Retrospective, Observational. Source of cases: Men treated with IMRT as primary or postoperative treatment for prostate cancer at the Ghent University Hospital between 2000 and 2010. Source of controls: Employees of the University hospital and members of social activity clubs, without a history of any cancer. SPAG: Hospital-based, Retrospective, Observational. Source of cases: Guernsey. Source of controls: Guernsey STHM2: Population-based, Retrospective, Observational. Source of cases: Cases were selected among men referred for PSA testing in laboratories in Stockholm County, Sweden, between 2010 and 2012. Source of controls: Controls were selected among men referred for PSA testing in laboratories in Stockholm County, Sweden, between 2010 and 2012. PCPT: Case-control from a randomized clinical trial. Source of cases: Randomized clinical trial. Source of controls: Randomized clinical trial SELECT: Case-cohort from a randomized clinical trial. Source of cases: Randomized clinical trial. Source of controls: Randomized clinical trial TAMPERE: Case-control - Finland, Retrospective, Observational, Population-based. Source of cases: Identified through linkage to the Finnish Cancer Registry and patient records; and the Finnish arm of the ERSPC study. Source of controls: Cohort participants without a diagnosis of cancer UGANDA: Uganda Prostate Cancer Study: Uganda is a case-control study of prostate cancer in Kampala Uganda that was initiated in 2011. Men with prostate cancer were enrolled from the Urology unit at Mulago Hospital and men without prostate cancer (i.e. controls) were enrolled from other clinics (i.e. surgery) at the hospital. UKGPCS: ICR, UK. Source of cases: Cases identified through clinics at the Royal Marsden hospital and nationwide NCRN hospitals. Source of controls: Ken Muir's control- 2000 ULM: Case-control - Germany. Source of cases: familial cases (n=162): identified through questionnaires for family history by collaborating urologists all over Germany; sporadic cases (n=308): prostatectomy series performed in the Clinic of Urology Ulm between 2012 and 2014. Source of controls: age-matched controls (n=188): age-matched men without prostate cancer and negative family history collected in hospitals of Ulm WUGS/WUPCS: Cases Series, USA. Source of cases: Identified through clinics at Washington University in St. Louis. Source of controls: Men diagnosed and managed with prostate cancer in University based clinic. Acknowledgement Statements: Aarhus: This study was supported by the Danish Strategic Research Council (now Innovation Fund Denmark) and the Danish Cancer Society. The Danish Cancer Biobank (DCB) is acknowledged for biological material. AHS: This work was supported by the Intramural Research Program of the NIH, National Cancer Institute, Division of Cancer Epidemiology and Genetics (Z01CP010119). ATBC: This research was supported in part by the Intramural Research Program of the NIH and the National Cancer Institute. Additionally, this research was supported by U.S. Public Health Service contracts N01-CN-45165, N01-RC-45035, N01-RC-37004, HHSN261201000006C, and HHSN261201500005C from the National Cancer Institute, Department of Health and Human Services. BioVu: The dataset(s) used for the analyses described were obtained from Vanderbilt University Medical Center's BioVU which is supported by institutional funding and by the National Center for Research Resources, Grant UL1 RR024975-01 (which is now at the National Center for Advancing Translational Sciences, Grant 2 UL1 TR000445-06). Canary PASS: PASS was supported by Canary Foundation and the National Cancer Institute's Early Detection Research Network (U01 CA086402) CCI: This work was awarded by Prostate Cancer Canada and is proudly funded by the Movember Foundation - Grant # D2013-36.The CCI group would like to thank David Murray, Razmik Mirzayans, and April Scott for their contribution to this work. CerePP French Prostate Cancer Case-Control Study (ProGene): None reported COH: SLN is partially supported by the Morris and Horowitz Families Endowed Professorship COSM: The Swedish Research Council, the Swedish Cancer Foundation CPCS1 & CPCS2: Department of Clinical Biochemistry, Herlev and Gentofte Hospital, Copenhagen University Hospital, Herlev Ringvej 75, DK-2730 Herlev, DenmarkCPCS1 would like to thank the participants and staff of the Copenhagen General Population Study for their important contributions. CPDR: Uniformed Services University for the Health Sciences HU0001-10-2-0002 (PI: David G. McLeod, MD) CPS-II: The American Cancer Society funds the creation, maintenance, and updating of the Cancer Prevention Study II cohort. CPS-II thanks the participants and Study Management Group for their invaluable contributions to this research. We would also like to acknowledge the contribution to this study from central cancer registries supported through the Centers for Disease Control and Prevention National Program of Cancer Registries, and cancer registries supported by the National Cancer Institute Surveillance Epidemiology and End Results program. EPIC: The coordination of EPIC is financially supported by the European Commission (DG-SANCO) and the International Agency for Research on Cancer. The national cohorts are supported by the Danish Cancer Society (Denmark); the Deutsche Krebshilfe, Deutsches Krebsforschungszentrum and Federal Ministry of Education and Research (Germany); the Hellenic Health Foundation, Greek Ministry of Health; Greek Ministry of Education (Greece); the Italian Association for Research on Cancer (AIRC) and National Research Council (Italy); the Dutch Ministry of Public Health, Welfare and Sports (VWS), Netherlands Cancer Registry (NKR), LK Research Funds, Dutch Prevention Funds, Dutch ZON (Zorg Onderzoek Nederland), World Cancer Research Fund (WCRF); the Statistics Netherlands (The Netherlands); the Health Research Fund (FIS), Regional Governments of Andalucía, Asturias, Basque Country, Murcia and Navarra, Spanish Ministry of Health ISCIII RETIC (RD06/0020), Red de Centros RCESP, C03/09 (Spain); the Swedish Cancer Society, Swedish Scientific Council and Regional Government of Skåne and Västerbotten, Fundacion Federico SA (Sweden); the Cancer Research UK, Medical Research Council (United Kingdom). EPICAP: The EPICAP study was supported by grants from Ligue Nationale Contre le Cancer, Ligue départementale du Val de Marne; Fondation de France; Agence Nationale de sécurité sanitaire de l'alimentation, de l'environnement et du travail (ANSES). The EPICAP study group would like to thank all urologists, Antoinette Anger and Hasina Randrianasolo (study monitors), Anne-Laure Astolfi, Coline Bernard, Oriane Noyer, Marie-Hélène De Campo, Sandrine Margaroline, Louise N'Diaye, and Sabine Perrier-Bonnet (Clinical Research nurses). ERSPC: This study was supported by the DutchCancerSociety (KWF94-869,98-1657,2002-277,2006-3518, 2010-4800), The Netherlands Organisation for Health Research and Development (ZonMW-002822820, 22000106, 50-50110-98-311, 62300035), The Dutch Cancer Research Foundation (SWOP), and an unconditional grant from Beckman-Coulter-HybritechInc. ESTHER: The ESTHER study was supported by a grant from the Baden Württemberg Ministry of Science, Research and Arts. The ESTHER group would like to thank Hartwig Ziegler, Sonja Wolf, Volker Hermann, Heiko Müller, Karina Dieffenbach, Katja Butterbach for valuable contributions to the study. FHCRC: The FHCRC studies were supported by grants R01-CA056678, R01-CA082664, and R01-CA092579 from the US National Cancer Institute, National Institutes of Health, with additional support from the Fred Hutchinson Cancer Research Center. FHCRC would like to thank all the men who participated in these studies. Gene-PARE: The Gene-PARE study was supported by grants 1R01CA134444 from the U.S. National Institutes of Health, PC074201 and W81XWH-15-1-0680 from the Prostate Cancer Research Program of the Department of Defense and RSGT-05-200-01-CCE from the American Cancer Society. Hamburg-Zagreb: None reported HPFS: The Health Professionals Follow-up Study was supported by grants UM1CA167552, CA133891, CA141298, and P01CA055075. HPFS are grateful to the participants and staff of the Physicians' Health Study and Health Professionals Follow-Up Study for their valuable contributions, as well as the following state cancer registries for their help: AL, AZ, AR, CA, CO, CT, DE, FL, GA, ID, IL, IN, IA, KY, LA, ME, MD, MA, MI, NE, NH, NJ, NY, NC, ND, OH, OK, OR, PA, RI, SC, TN, TX, VA, WA, and WY. IMPACT: The IMPACT study was funded by The Ronald and Rita McAulay Foundation, CR-UK Project grant (C5047/A1232), Cancer Australia, AICR Netherlands A10-0227, Cancer Australia and Cancer Council Tasmania, NIHR, EU Framework 6, Cancer Councils of Victoria and South Australia, and Philanthropic donation to Northshore University Health System. We acknowledge support from the National Institute for Health Research (NIHR) to the Biomedical Research Centre at The Institute of Cancer Research and Royal Marsden Foundation NHS Trust. IMPACT acknowledges the IMPACT study steering committee, collaborating centres, and participants. IPO-Porto: The IPO-Porto study was funded by Fundaçäo para a Ciência e a Tecnologia (FCT; UID/DTP/00776/2013 and PTDC/DTP-PIC/1308/2014) and by IPO-Porto Research Center (CI-IPOP-16-2012 and CI-IPOP-24-2015). MC and MPS are research fellows from Liga Portuguesa Contra o Cancro, Núcleo Regional do Norte. SM is a research fellow from FCT (SFRH/BD/71397/2010). IPO-Porto would like to express our gratitude to all patients and families who have participated in this study. Karuprostate: The Karuprostate study was supported by the the Frech National Health Directorate and by the Association pour la Recherche sur les Tumeurs de la ProstateKarusprostate thanks Séverine Ferdinand. KULEUVEN: F.C. and S.J. are holders of grants from FWO Vlaanderen (G.0684.12N and G.0830.13N), the Belgian federal government (National Cancer Plan KPC_29_023), and a Concerted Research Action of the KU Leuven (GOA/15/017). TVDB is holder of a doctoral fellowship of the FWO. LAAPC: This study was funded by grant R01CA84979 (to S.A. Ingles) from the National Cancer Institute, National Institutes of Health. Malaysia: The study was funded by the University Malaya High Impact Research Grant (HIR/MOHE/MED/35). Malaysia thanks all associates in the Urology Unit, University of Malaya, Cancer Research Initiatives Foundation (CARIF) and the Malaysian Men's Health Initiative (MMHI). MCCS: MCCS cohort recruitment was funded by VicHealth and Cancer Council Victoria. The MCCS was further supported by Australian NHMRC grants 209057, 251553, and 504711, and by infrastructure provided by Cancer Council Victoria. Cases and their vital status were ascertained through the Victorian Cancer Registry (VCR) and the Australian Institute of Health and Welfare (AIHW), including the National Death Index and the Australian Cancer Database. MCC-Spain: The study was partially funded by the Accion Transversal del Cancer, approved on the Spanish Ministry Council on the 11th October 2007, by the Instituto de Salud Carlos III-FEDER (PI08/1770, PI09/00773-Cantabria, PI11/01889-FEDER, PI12/00265, PI12/01270, and PI12/00715), by the Fundación Marqués de Valdecilla (API 10/09), by the Spanish Association Against Cancer (AECC) Scientific Foundation and by the Catalan Government DURSI grant 2009SGR1489. Samples: Biological samples were stored at the Parc de Salut MAR Biobank (MARBiobanc; Barcelona) which is supported by Instituto de Salud Carlos III FEDER (RD09/0076/00036). Also sample collection was supported by the Xarxa de Bancs de Tumors de Catalunya sponsored by Pla Director d'Oncologia de Catalunya (XBTC). MCC-Spain acknowledges the contribution from Esther Gracia-Lavedan in preparing the data. We thank all the subjects who participated in the study and all MCC-Spain collaborators. MD Anderson: Prostate Cancer Case-Control Studies at MD Anderson (MDA) supported by grants CA68578, ES007784, DAMD W81XWH-07-1-0645, and CA140388. MDACC_AS: None reported MEC: Funding provided by NIH grant U19CA148537 and grant U01CA164973. MIAMI (WFPCS): ACS MOFFITT: The Moffitt group was supported by the US National Cancer Institute (R01CA128813, PI: J.Y. Park). NMHS: Funding for the Nashville Men's Health Study (NMHS) was provided by the National Institutes of Health Grant numbers: RO1CA121060. PCaP only data: The North Carolina - Louisiana Prostate Cancer Project (PCaP) is carried out as a collaborative study supported by the Department of Defense contract DAMD 17-03-2-0052. For HCaP-NC follow-up data: The Health Care Access and Prostate Cancer Treatment in North Carolina (HCaP-NC) study is carried out as a collaborative study supported by the American Cancer Society award RSGT-08-008-01-CPHPS. For studies using both PCaP and HCaP-NC follow-up data please use: The North Carolina - Louisiana Prostate Cancer Project (PCaP) and the Health Care Access and Prostate Cancer Treatment in North Carolina (HCaP-NC) study are carried out as collaborative studies supported by the Department of Defense contract DAMD 17-03-2-0052 and the American Cancer Society award RSGT-08-008-01-CPHPS, respectively. For any PCaP data, please include: The authors thank the staff, advisory committees and research subjects participating in the PCaP study for their important contributions. For studies using PCaP DNA/genotyping data, please include: We would like to acknowledge the UNC BioSpecimen Facility and LSUHSC Pathology Lab for our DNA extractions, blood processing, storage and sample disbursement (https://genome.unc.edu/bsp). For studies using PCaP tissue, please include: We would like to acknowledge the RPCI Department of Urology Tissue Microarray and Immunoanalysis Core for our tissue processing, storage and sample disbursement. For studies using HCaP-NC follow-up data, please use: The Health Care Access and Prostate Cancer Treatment in North Carolina (HCaP-NC) study is carried out as a collaborative study supported by the American Cancer Society award RSGT-08-008-01-CPHPS. The authors thank the staff, advisory committees and research subjects participating in the HCaP-NC study for their important contributions. For studies that use both PCaP and HCaP-NC, please use: The authors thank the staff, advisory committees and research subjects participating in the PCaP and HCaP-NC studies for their important contributions. PCMUS: The PCMUS study was supported by the Bulgarian National Science Fund, Ministry of Education and Science (contract DOO-119/2009; DUNK01/2-2009; DFNI-B01/28/2012) with additional support from the Science Fund of Medical University - Sofia (contract 51/2009; 8I/2009; 28/2010). PHS: The Physicians' Health Study was supported by grants CA34944, CA40360, CA097193, HL26490, and HL34595. PHS members are grateful to the participants and staff of the Physicians' Health Study and Health Professionals Follow-Up Study for their valuable contributions, as well as the following state cancer registries for their help: AL, AZ, AR, CA, CO, CT, DE, FL, GA, ID, IL, IN, IA, KY, LA, ME, MD, MA, MI, NE, NH, NJ, NY, NC, ND, OH, OK, OR, PA, RI, SC, TN, TX, VA, WA, and WY. PLCO: This PLCO study was supported by the Intramural Research Program of the Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIHPLCO thanks Drs. Christine Berg and Philip Prorok, Division of Cancer Prevention at the National Cancer Institute, the screening center investigators and staff of the PLCO Cancer Screening Trial for their contributions to the PLCO Cancer Screening Trial. We thank Mr. Thomas Riley, Mr. Craig Williams, Mr. Matthew Moore, and Ms. Shannon Merkle at Information Management Services, Inc., for their management of the data and Ms. Barbara O'Brien and staff at Westat, Inc. for their contributions to the PLCO Cancer Screening Trial. We also thank the PLCO study participants for their contributions to making this study possible. Poland: None reported PROCAP: PROCAP was supported by the Swedish Cancer Foundation (08-708, 09-0677). PROCAP thanks and acknowledges all of the participants in the PROCAP study. We thank Carin Cavalli-Björkman and Ami Rönnberg Karlsson for their dedicated work in the collection of data. Michael Broms is acknowledged for his skilful work with the databases. KI Biobank is acknowledged for handling the samples and for DNA extraction. We acknowledge The NPCR steering group: Pär Stattin (chair), Anders Widmark, Stefan Karlsson, Magnus Törnblom, Jan Adolfsson, Anna Bill-Axelson, Ove Andrén, David Robinson, Bill Pettersson, Jonas Hugosson, Jan-Erik Damber, Ola Bratt, Göran Ahlgren, Lars Egevad, and Roy Ehrnström. PROGReSS: The PROGReSS study is founded by grants from the Spanish Ministry of Health (INT15/00070; INT16/00154; FIS PI10/00164, FIS PI13/02030; FIS PI16/00046); the Spanish Ministry of Economy and Competitiveness (PTA2014-10228-I), and Fondo Europeo de Desarrollo Regional (FEDER 2007-2013). ProMPT: Founded by CRUK, NIHR, MRC, Cambride Biomedical Research Centre ProtecT: Founded by NIHR. ProtecT and ProMPT would like to acknowledge the support of The University of Cambridge, Cancer Research UK. Cancer Research UK grants (C8197/A10123) and (C8197/A10865) supported the genotyping team. We would also like to acknowledge the support of the National Institute for Health Research which funds the Cambridge Bio-medical Research Centre, Cambridge, UK. We would also like to acknowledge the support of the National Cancer Research Prostate Cancer: Mechanisms of Progression and Treatment (PROMPT) collaborative (grant code G0500966/75466) which has funded tissue and urine collections in Cambridge. We are grateful to staff at the Welcome Trust Clinical Research Facility, Addenbrooke's Clinical Research Centre, Cambridge, UK for their help in conducting the ProtecT study. We also acknowledge the support of the NIHR Cambridge Biomedical Research Centre, the DOH HTA (ProtecT grant), and the NCRI/MRC (ProMPT grant) for help with the bio-repository. The UK Department of Health funded the ProtecT study through the NIHR Health Technology Assessment Programme (projects 96/20/06, 96/20/99). The ProtecT trial and its linked ProMPT and CAP (Comparison Arm for ProtecT) studies are supported by Department of Health, England; Cancer Research UK grant number C522/A8649, Medical Research Council of England grant number G0500966, ID 75466, and The NCRI, UK. The epidemiological data for ProtecT were generated though funding from the Southwest National Health Service Research and Development. DNA extraction in ProtecT was supported by USA Dept of Defense award W81XWH-04-1-0280, Yorkshire Cancer Research and Cancer Research UK. The authors would like to acknowledge the contribution of all members of the ProtecT study research group. The views and opinions expressed therein are those of the authors and do not necessarily reflect those of the Department of Health of England. The bio-repository from ProtecT is supported by the NCRI (ProMPT) Prostate Cancer Collaborative and the Cambridge BMRC grant from NIHR. We thank the National Institute for Health Research, Hutchison Whampoa Limited, the Human Research Tissue Bank (Addenbrooke's Hospital), and Cancer Research UK. PROtEuS: PROtEuS was supported financially through grants from the Canadian Cancer Society (13149, 19500, 19864, 19865) and the Cancer Research Society, in partnership with the Ministère de l'enseignement supérieur, de la recherche, de la science et de la technologie du Québec, and the Fonds de la recherche du Québec - Santé.PROtEuS would like to thank its collaborators and research personnel, and the urologists involved in subjects recruitment. We also wish to acknowledge the special contribution made by Ann Hsing and Anand Chokkalingam to the conception of the genetic component of PROtEuS. QLD: The QLD research is supported by The National Health and Medical Research Council (NHMRC) Australia Project Grants (390130, 1009458) and NHMRC Career Development Fellowship and Cancer Australia PdCCRS funding to J Batra. The QLD team would like to acknowledge and sincerely thank the urologists, pathologists, data managers and patient participants who have generously and altruistically supported the QLD cohort. RAPPER: RAPPER is funded by Cancer Research UK (C1094/A11728; C1094/A18504) and Experimental Cancer Medicine Centre funding (C1467/A7286). The RAPPER group thank Rebecca Elliott for project management. SABOR: The SABOR research is supported by NIH/NCI Early Detection Research Network, grant U01 CA0866402-12. Also supported by the Cancer Center Support Grant to the Cancer Therapy and Research Center from the National Cancer Institute (US) P30 CA054174. SCCS: SCCS is funded by NIH grant R01 CA092447, and SCCS sample preparation was conducted at the Epidemiology Biospecimen Core Lab that is supported in part by the Vanderbilt-Ingram Cancer Center (P30 CA68485). Data on SCCS cancer cases used in this publication were provided by the Alabama Statewide Cancer Registry; Kentucky Cancer Registry, Lexington, KY; Tennessee Department of Health, Office of Cancer Surveillance; Florida Cancer Data System; North Carolina Central Cancer Registry, North Carolina Division of Public Health; Georgia Comprehensive Cancer Registry; Louisiana Tumor Registry; Mississippi Cancer Registry; South Carolina Central Cancer Registry; Virginia Department of Health, Virginia Cancer Registry; Arkansas Department of Health, Cancer Registry, 4815 W. Markham, Little Rock, AR 72205. The Arkansas Central Cancer Registry is fully funded by a grant from National Program of Cancer Registries, Centers for Disease Control and Prevention (CDC). Data on SCCS cancer cases from Mississippi were collected by the Mississippi Cancer Registry which participates in the National Program of Cancer Registries (NPCR) of the Centers for Disease Control and Prevention (CDC). The contents of this publication are solely the responsibility of the authors and do not necessarily represent the official views of the CDC or the Mississippi Cancer Registry. SCPCS: SCPCS is funded by CDC grant S1135-19/19, and SCPCS sample preparation was conducted at the Epidemiology Biospecimen Core Lab that is supported in part by the Vanderbilt-Ingram Cancer Center (P30 CA68485). SEARCH: SEARCH is funded by a program grant from Cancer Research UK (C490/A10124) and supported by the UK National Institute for Health Research Biomedical Research Centre at the University of Cambridge. SNP_Prostate_Ghent: The study was supported by the National Cancer Plan, financed by the Federal Office of Health and Social Affairs, Belgium. SPAG: Wessex Medical ResearchHope for Guernsey, MUG, HSSD, MSG, Roger Allsopp STHM2: STHM2 was supported by grants from The Strategic Research Programme on Cancer (StratCan), Karolinska Institutet; the Linné Centre for Breast and Prostate Cancer (CRISP, number 70867901), Karolinska Institutet; The Swedish Research Council (number K2010-70X-20430-04-3) and The Swedish Cancer Society (numbers 11-0287 and 11-0624); Stiftelsen Johanna Hagstrand och Sigfrid Linnérs minne; Swedish Council for Working Life and Social Research (FAS), number 2012-0073STHM2 acknowledges the Karolinska University Laboratory, Aleris Medilab, Unilabs and the Regional Prostate Cancer Registry for performing analyses and help to retrieve data. Carin Cavalli-Björkman and Britt-Marie Hune for their enthusiastic work as research nurses. Astrid Björklund for skilful data management. We wish to thank the BBMRI.se biobank facility at Karolinska Institutet for biobank services. PCPT & SELECT are funded by Public Health Service grants U10CA37429 and 5UM1CA182883 from the National Cancer Institute. SWOG and SELECT thank the site investigators and staff and, most importantly, the participants who donated their time to this trial. TAMPERE: The Tampere (Finland) study was supported by the Academy of Finland (251074), The Finnish Cancer Organisations, Sigrid Juselius Foundation, and the Competitive Research Funding of the Tampere University Hospital (X51003). The PSA screening samples were collected by the Finnish part of ERSPC (European Study of Screening for Prostate Cancer). TAMPERE would like to thank Riina Liikanen, Liisa Maeaettaenen and Kirsi Talala for their work on samples and databases. UGANDA: None reported UKGPCS: UKGPCS would also like to thank the following for funding support: The Institute of Cancer Research and The Everyman Campaign, The Prostate Cancer Research Foundation, Prostate Research Campaign UK (now Prostate Action), The Orchid Cancer Appeal, The National Cancer Research Network UK, The National Cancer Research Institute (NCRI) UK. We are grateful for support of NIHR funding to the NIHR Biomedical Research Centre at The Institute of Cancer Research and The Royal Marsden NHS Foundation Trust. UKGPCS should also like to acknowledge the NCRN nurses, data managers, and consultants for their work in the UKGPCS study. UKGPCS would like to thank all urologists and other persons involved in the planning, coordination, and data collection of the study. ULM: The Ulm group received funds from the German Cancer Aid (Deutsche Krebshilfe). WUGS/WUPCS: WUGS would like to thank the following for funding support: The Anthony DeNovi Fund, the Donald C. McGraw Foundation, and the St. Louis Men's Group Against Cancer.
The Age-Related Eye Disease Study (AREDS) was initially designed as a long-term multi-center, prospective study of the clinical course of age-related macular degeneration (AMD) and age-related cataract. In addition to collecting natural history data, AREDS included a clinical trial of high-dose vitamin and mineral supplements for AMD and a clinical trial of high-dose vitamin supplements for cataract. AREDS participants were 55 to 80 years of age at enrollment and had to be free of any illness or condition that would make long-term follow-up or compliance with study medications unlikely or difficult. On the basis of fundus photographs graded by a central reading center, best-corrected visual acuity and ophthalmologic evaluations, 4,757 participants were enrolled in one of several AMD categories, including persons with no AMD. The clinical trials for AMD and cataract were conducted concurrently. AREDS participants were followed on the clinical trials for a median time of 6.5 years. Subsequent to the conclusion of the clinical trials, participants were followed for an additional 5 years and natural history data were collected. The AREDS research design is detailed in AREDS Report 1. AREDS Report 8 contains the mainline results from the AMD trial; AREDS Report 9 contains the results of the cataract trial. Blood samples were also collected from 3,700+ AREDS participants for genetic research. Genetic samples from 600 AREDS participants (200 controls, 200 Neovascular AMD cases, and 200 Geographic Atrophy cases) were selected using data available in March 2005 and then were evaluated with a genome-wide scan. These data, as well as selected phenotypic data, were made available in the dbGaP. DNA from AREDS participants, which is currently being stored in the AREDS Genetic Repository, is available for research purposes. However, not all of the 3,700+ AREDS participants who submitted a blood sample currently have DNA available. In addition to including the data from the genome-wide scan on the 600 original samples, this second version of the AREDS dbGaP database provides a comprehensive set of data tables with extensive clinical information collected for the 4,757 participants who participated in AREDS. The tables include information collected at enrollment/baseline, during study follow-up, fundus and lens pathology, nutritional estimates, quality of life measures and measures of morbidity and mortality. In November 2010, over 72,000 high quality fundus and lens photographs of 595 AREDS participants (of the original 600 selected for the genome-wide scan) were made available in the AREDS dbGaP. In addition to the genome-wide scan data, the fundus and lens grading data for these participants are also available through the AREDS dbGaP. Details about the ocular photographs that are available may be found in the document "Age-Related Eye Disease Study (AREDS) Ocular Photographs". In January 2012, a measure of daily sunlight exposure was added in a separate "sunlight" table. Furthermore, the "followup" table has been revised. The visual acuity for the right eye was inadvertently missing at odd-numbered visits (01, 03, 05, etc.). This data is now part of the table. In February 2014 over 134,500 high-quality fundus photographs (macular field F2) of 4613 AREDS participants were added to the existing AREDS dbGaP resource. The AREDS dbGaP image archive already contains over 72,000 high quality fundus and lens photographs of 595 AREDS participants for whom dbGaP-accessible genotype data exist. Information about the available ocular photographs found in the document "Age-Related Eye Disease Study (AREDS) Ocular Photographs" has been updated with an addendum. It is hoped that this resource will better help researchers understand two important diseases that affect an aging population. These data may be applied to examination and inference on genetic and genetic-environmental bases for age-related diseases of public health significance and may also help elucidate the clinical course of both conditions, generate hypotheses, and aid in the design of clinical trials of preventive interventions. Definitions of Final AMD Phenotype Categories Please see phd001138.1 for a detailed description of how AREDS participants' final AMD phenotype was categorized. User's Guide for AREDS Phenotype Data A detailed User's Guide for the AREDS phenotype data is available. This User's Guide is meant to be a comprehensive document which explains the complexities of the AREDS data. It is recommended that all researchers using AREDS phenotype data make use of this User's Guide.
Background: Circulating cell free (ccf) fetal DNA has enabled non-invasive prenatal fetal aneuploidy testing without direct discrimination of the genetically distinct maternal and fetal DNA. Current testing may be improved by specifically enriching the sample material for fetal DNA. DNA methylation may allow for such a separation of DNA and thus support additional clinical opportunities; however, this depends on knowledge of the methylomes of ccf DNA and its cellular contributors. Results: Whole genome bisulfite sequencing was performed on a set of unmatched samples including ccf DNA from 8 non-pregnant (NP) and 7 pregnant female donors and genomic DNA from 7 buffy coat and 5 placenta samples. We found CpG cytosines within longer fragments were more likely to be methylated, linking DNA methylation and fragment size in ccf DNA. Comparison of the methylomes of placenta and NP ccf DNA revealed many of the 51,259 identified differentially methylated regions (DMRs) were located in domains exhibiting consistent placenta hypomethylation across millions of consecutive bases, regions we termed placenta hypomethylated domains. DMRs identified when comparing placenta to NP ccf DNA were recapitulated in pregnant ccf DNA, confirming the ability to detect differential methylation in ccf DNA mixtures. Conclusions: We generated methylome maps for four sample types at single base resolution, identified a link between DNA methylation and fragment length in ccf DNA, identified DMRs between sample groups, and uncovered the presence of megabase-size placenta hypomethylated domains. Furthermore, we anticipate these results to provide a foundation to which future studies using discriminatory DNA methylation may be compared.