The cerebrospinal fluid (CSF) is thought to be a main route for immune surveillance in the central nervous system (CNS). Understanding the composition and state of CSF in healthy individuals may provide insight into perturbations present during neuroinflammation. Here, we have used single-cell RNA and TCR sequencing to profile CSF and peripheral blood mononuclear cells (PBMCs) from healthy individuals and patients with relapsing-remitting multiple sclerosis (MS), a chronic autoimmmune disease. In addition, with the goal of understanding how transcriptional artefacts are introduced in transcriptomics experiments, we have performed single-cell and TCR sequencing to profile PBMCs from a healthy individual that were processed with the same enzymatic digestion used for tissues (such as the brain) in the presence or absence of transcription inhibitors.
Repeat expansions in FGF14 cause autosomal dominant late-onset cerebellar ataxia (SCA27B) with estimated pathogenic thresholds of 250 (incomplete penetrance) and 300 AAG repeats (full penetrance), but the sequence of pathogenic and non-pathogenic expansions remains unexplored. Here, we demonstrate that STRling and ExpansionHunter accurately detect FGF14 expansions from short-read genomic data using outlier approaches. By combining long-range PCR and nanopore sequencing in 169 patients with cerebellar ataxia and 802 controls, we compare FGF14 expansion alleles, including interruptions and flanking regions. Uninterrupted AAG expansions are significantly enriched in patients with ataxia from a lower threshold (180-200 repeats) than previously reported based on expansion size alone. Conversely, AAGGAG hexameric expansions are equally frequent in patients and controls. Distinct 5' flanking regions, interruptions and pre-repeat sequences correlate with repeat size. Furthermore, pure AAG (pathogenic) and AAGGAG (non-pathogenic) repeats form different secondary structures. Regardless of expansion size, SCA27B is a recognizable clinical entity characterized by frequent episodic ataxia and downbeat nystagmus, similar to the presentation observed in a family with a previously unreported nonsense variant (SCA27A). Overall, this study suggests that SCA27B is a major overlooked cause of adult-onset ataxia, accounting for 23-31% of unsolved patients. We strongly recommend re-evaluating pathogenic thresholds and integrating expansion sequencing into the molecular diagnostic process.
The Women's Interagency HIV Study (WIHS), a multicenter, prospective study, was established in August 1993 to carry out comprehensive investigations of the impact of HIV infection and its clinical, laboratory, and psychosocial effects in women. The purpose of this study is to collect and evaluate these data from HIV infected and at-risk women to better understand and provide support for women whom are currently HIV infected or who are at risk for HIV infection. Participants will have study visits every 6 months. Questionnaires regarding sexual behavior, health care utilization, medical and obstetric/gynecological history, psychosocial factors, and sociodemographics will be completed by participants. Physical, gynecological, and lipodystrophy examinations will also be conducted at each visit, and current medication regimen will be noted. Blood and other bodily fluid samples will be collected and registered in both local and national repositories of the WIHS in conjunction with NIAID.
Metastatic disease to the central nervous system (CNS) is devastating, with a poor prognosis. There are limited treatment options for patients with CNS metastases, and in particular with leptomeningeal carcinomatosis (LMD). In this study, we evaluated the molecular and cellular features of parenchymal brain metastases and LMD. We used single-cell RNA and cell-free DNA profiling of CSF from patients with LMD receiving immunotherapy to demonstrate shifts in the liquid LMD tumor microenvironment. We also characterize the tumor microenvironment of parenchymal brain metastases using single cell sequencing, before and after treatment with immunotherapy.
The aim of this study is to identify the specific bacteria that are responsible for pathological bacterial translocation and subsequent Th17 priming in the liver in primary sclerosing cholangitis (PSC). To clarify the mechanism of how microbiota interacts with the intestinal epithelial barrier, monolayered human intestinal organoids were cultured with the specific bacteria. Gene expression analysis using RNA sequencing in epithelial cells cultured with Klebsiella pneumoniae derived from a PSC patient (KP-P1) or commercially obtained Klebsiella pneumoniae strain (KP JCM1662) was performed in this study.
Pancreatic Neuroendocrine Tumors (PanNETs) most probably arise from the islets of Langerhans, although direct evidence is still lacking. About 40% of non-functioning (NF) PanNETs harbour mutations in MEN1, often in conjunction with alterations in DAXX/ATRX. Despite the homogeneous genetic background, the ADM (mutation in DAXX/ATRX and MEN1) group is heterogeneous regarding molecular features and response to treatment. To explore the molecular and clinical heterogeneity within this group, we integrated transcriptomic and DNA methylation data from 36 and 93 samples, respectively. Using a multi-omic approach we described a two-level hierarchy separating PanNET subtypes mutated in MEN1, DAXX and ATRX. First, DNA methylation discriminates ADM-PanNET from PanNET mutated in MEN1 only (α-like). Compared to α-like, all ADM tumors exhibit DNA methylation changes at enhancer and lowly methylated regions as well as at peri-centromeric and telomeric regions, associated with alternative lengthening of telomeres, increased chromosomal instability, and enhanced proliferative capacity. At a second level, transcriptomic analysis revealed three distinct ADM subtypes: ADM hypoxic, ADM NST (No Special Type), and ADM immunosuppressive. The ADM hypoxic subtype is characterized by strong hypoxia signature, likely regulated epigenetically. The ADM NST subtype appears to be primarily driven by epigenetic changes that promote proliferation. The ADM immunosuppressive is highly enriched in immune component and strong metastasis-like signature. While all the three subtypes are highly associated with liver metastasis formation and invasion, the ADM immunosuppressive are significantly smaller (<2.5 cm p.value=0.023) despite displaying aggressive molecular features. By defining these three novel ADM subtypes, our study provides a refined framework for understanding PanNET heterogeneity. This classification underscores potential diagnostic markers and highlights distinct biological vulnerabilities that may inform the development of subtype-tailored therapeutic strategies.
Salivary gland cancer (SGC) is a rare cancer for which systemic treatment options are limited. Therefore, it is important to characterize its genetic landscape in search for actionable aberrations. This research aimed to identify these actionable aberrations by NGS-based analysis of DNA (single and multiple nucleotide variants, copy number variants, microsatellite instability and tumor mutational burden) in a large cohort of SGC patients. DNA was extracted from archival tissue of 119 patients with various SGC subtypes and sequenced using a targeted NGS panel encompassing 523 cancer related genes (TruSight Oncology 500, TSO500). The 119 patients included 46 AdCC patients, 43 SDC patients, 15 MEC patients, 9 AciCC patients and 6 patients with other subtypes (1 secretory carcinoma, 1 polymorphic adenocarcinoma (PAC), 1 adenocarcinoma NOS, 1 myoepithelial carcinoma, 1 epithelial/myoepithelial carcinoma and 1 mixed PAC/myoepithelial carcinoma). Median exon coverage ranged from 80-904 (median 308) unique reads and the median percentage of exon coverage with at least 100 unique reads was median 96.6% in these 119 samples.
Interpretation of the function of non-coding risk loci for neuropsychiatric disorders and brain-relevant traits via gene expression and alternative splicing quantitative trait locus (e/sQTL) analyses is generally performed in bulk post-mortem adult tissue. However, genetic risk loci are enriched in regulatory elements active during neocortical differentiation, and regulatory effects of risk variants may be masked by heterogeneity in bulk tissue. Here, we map e/sQTLs, and allele-specific expression in cultured cells representing two major developmental stages, primary human neural progenitors (n=85) and their sorted neuronal progeny (n=74), identifying numerous loci not detected in either bulk developing cortical wall or adult cortex. Using colocalization and genetic imputation via transcriptome-wide association, we uncover cell-type specific regulatory mechanisms underlying risk for brain-relevant traits that are active during neocortical differentiation. Specifically, we identified a progenitor-specific eQTL for CENPW co-localized with common variant associations for cortical surface area and educational attainment.
The Genetics and Epidemiology of Colorectal Cancer Consortium (GECCO) is a collaborative effort comprised of a coordinating center and scientific researchers from well-characterized cohort and case-control studies conducted in North America and Europe. This international consortium aims to accelerate the discovery of common and rare genetic risk variants for colorectal cancer by conducting large-scale meta-analyses of existing and newly generated genome-wide association study (GWAS) data, replicating and fine-mapping of GWAS discoveries, and investigating how genetic risk variants are modified by environmental risk factors. To expand these efforts, we assembled case-control sets or nested case-control sets from 20 different North American or European studies. Summary descriptions and study participant inclusions/exclusion criteria for each of these studies are detailed below. The Black Women's Health Study (BWHS): Is the largest follow-up study of the health of African-American women (Cozier et al., 2004; Rosenberg et al., 1995) [PMID: 15018884; PMID: 7722208]. The purpose is to identify and evaluate causes and preventives of cancers and other serious illnesses in African-American women. Among the diseases being studied are breast cancer, colorectal cancer, type 2 diabetes, uterine fibroids, systemic lupus erythematosus, and cardiovascular disease. The study began in 1995, when 59,000 black women from all parts of the United States enrolled through postal questionnaires. The women provided demographic and health data on the 1995 baseline questionnaire, including information on weight, height, smoking, drinking, contraceptive use, use of other selected medications, illnesses, reproductive history, physical activity, diet, use of health care, and other factors. The participants are followed through biennial questionnaires to determine the occurrence of cancers and other illnesses and to update information on risk factors. Self-reports of cancer are confirmed through medical records and state cancer registry records. Mouthwash-swish samples, as a source of DNA, were obtained from ~26,000 BWHS participants in 2002-2007. DNA was isolated from the mouthwash-swish samples at the Boston University Molecular Core Genetics Laboratory using the QIAAMP DNA Mini Kit (Qiagen). All incident colorectal cancer cases with a DNA sample were included in the present analysis. Two controls per case, selected from among BWHS participants free of colorectal cancer at end of follow-up, were matched to cases on year of birth (+/- 2 years) and geographical region of residence (Northeast, South, Midwest, and West). A total 209 colorectal cancer cases and 423 controls were sent for genotyping. Campaign Against Cancer and Heart Disease (CLUE II): The Campaign Against Cancer and Heart Disease, is a prospective cohort designed to identify biomarkers and other factors associated with risk of cancer, heart disease, and other conditions (Kakourou et al., 2015) [PMID: 26220152]. 32,894 participants were recruited from May through October 1989 from Washington County, Maryland and surrounding communities. Colorectal cancer cases (n = 297) and matched controls (n = 296) were identified between 1989 and 2000 among participants in the CLUE II cohort of Washington County, Maryland. Colorectal Cancer Study of Austria (CORSA): In the ongoing colorectal cancer study of Austria (CORSA), more than 13,000 Caucasian participants have been recruited within the province-wide screening project "Burgenland Prevention Trial of Colorectal Disease with Immunological Testing" (B-PREDICT) since 2003 (Hofer et al., 2011) [PMID: 21422235]. All inhabitants of the Austrian province Burgenland aged between 40 and 80 years are annually invited to participate in fecal immunochemical testing and haemoccult positive screening participants are invited for colonoscopy. CORSA includes genomic DNA and plasma of colorectal cancer cases, low-risk and high-risk adenomas, and colonoscopy-negative controls. Controls received a complete colonoscopy and were free of colorectal cancer or polyps. CORSA participants have been recruited in the four KRAGES hospitals in Burgenland, Austria, and additionally, at the Medical University of Vienna (Department of Surgery), the Viennese hospitals "Rudolfstiftung" and the "Sozialmedizinisches Zentrum Sud", and at the Medical University of Graz (Department of Internal Medicine). 1403 colorectal cancer and advanced colorectal adenoma cases, and 1404 matched controls were selected for the study. Distribution of factors sex and age (5 year strata) were evenly matched between cases and controls. Cancer Prevention Study II (CPS II): The CPS II Nutrition cohort is a prospective study of cancer incidence and mortality in the United States, established in 1992 and described in detail elsewhere (Calle et al., 2002; Campbell et al., 2014) [PMID: 12015775; PMID: 25472679]. At enrollment, participants completed a mailed self-administered questionnaire including information on demographic, medical, diet, and lifestyle factors. Follow-up questionnaires to update exposure information and to ascertain newly diagnosed cancers were sent biennially starting in 1997. Reported cancers were verified through medical records, state cancer registry linkage, or death certificates. The Emory University Institutional Review Board approves all aspects of the CPS II Nutrition Cohort. A total of 360 cases and 359 controls were selected for this study. Czech Republic Colorectal Cancer Study (Czech Republic CCS): Cases with positive colonoscopy results for malignancy, confirmed by histology as colon or rectal carcinomas, were recruited between September 2003 and May 2012 in several oncological departments in the Czech Republic (Prague, Pilsen, Benesov, Brno, Liberec, Ples, Pribram, Usti and Labem, and Zlin). Two control groups, sampled at the same time of cases recruitment, were included in the study. The first group consisted of hospital-based individuals with a negative colonoscopy result for malignancy or idiopathic bowel diseases. The reasons for the colonoscopy were: i) positive fecal occult blood test, ii) hemorrhoids, iii) abdominal pain of unknown origin, and iv) macroscopic bleeding. The second control group consisted of healthy blood donor volunteers from a blood donor center in Prague. All individuals were subjected to standard examinations to verify the health status for blood donation and were cancer-free at the time of the sampling. Details of CRC cases and controls have been reported previously (Vymetalkova et al., 2014; Naccarati et al., 2016; Vymetalkova et al., 2016) [PMID: 24755277; PMID: 26735576; PMID: 27803053]. All subjects were informed and provided written consent to participate in the study. They approved the use of their biological samples for genetic analyses, according to the Declaration of Helsinki. The design of the study was approved by the Ethics Committee of the Institute of Experimental Medicine, Prague, Czech Republic. All subjects included in the study were Caucasians and comprised 1792 cases and 1764 matched controls. Controls were matched to CRC cases as 1:1 ratio. Matching was done on age and sex. Age was matched on +-5 years, whereas sex was matched exactly. For the cases without matched controls, matching was done only on sex. Early Detection Research Network (EDRN): The aim of the EDRN initiative is to develop and sustain a biorepository for support of translational research (Amin et al., 2010) [PMID: 21031013]. High-quality biospecimens were accrued and annotated with pertinent clinical, epidemiologic, molecular and genomic information. A user-friendly annotation tool and query tool was developed for this purpose. The various components of this annotation tool include: CDEs are developed from the College of American Pathologists (CAP) Cancer Checklists and North American Association of Central Cancer Registries (NAACR) standards. The CDEs provides semantic and syntactic interoperability of the data sets by describing them in the form of metadata or data descriptor. A total of 352 colorectal case samples and 399 controls were selected for this study. Controls were matched to CRC cases based on age and sex. The EPICOLON Consortium (EPICOLON): The EPICOLON Consortium comprises a prospective, multicentre and population-based epidemiology survey of the incidence and features of CRC in the Spanish population (Fernandez-Rozadilla et al., 2013) [PMID: 23350875]. Cases were selected as patients with de novo histologically confirmed diagnosis of colorectal adenocarcinoma. Patients with familial adenomatous polyposis, Lynch syndrome or inflammatory bowel disease-related CRC, and cases where patients or family refused to participate in the study were excluded. Hospital-based controls were recruited through the blood collection unit of each hospital, together with cases. All of the controls were confirmed to have no history of cancer or other neoplasm and no reported family history of CRC. Controls were randomly selected and matched with cases for hospital, sex and age (+- 5 years). A total of 370 cases and 370 controls were selected for genotyping. Hawaii Adenoma Study: For this adenoma study, two flexible-sigmoidoscopy screening clinics were first used to recruit participants on Oahu, Hawaii. Adenoma cases were identified either from the baseline examination at the Hawaii site of the Prostate Lung Colorectal and Ovarian cancer screening trial during 1996-2000 or at the Kaiser Permanente Hawaii's Gastroenterology Screening Clinic during 1995-2007. In addition, starting in 2002 and up to 2007, we also approached for recruitment all eligible patients who underwent a colonoscopy in the Kaiser Permanente Hawaii Gastroenterology Department. Cases were patients with histologically confirmed first-time adenoma(s) of the colorectum and were of Japanese, Caucasian or Hawaiian race/ethnicity. Controls were selected among patients with a normal colorectum and were individually matched to the cases on age at exam, sex, race/ethnicity, screening date (+-3 months) and clinic and type of examination (colonoscopy or flexible sigmoidoscopy). We recruited 1016 adenoma cases (67.8% of all eligible) and 1355 controls (69.2% of all eligible); 889 cases and 1169 controls agreed to give a blood and 29 cases and 34 controls, a mouthwash sample. A total of 989 cases and 1185 controls were genotyped for this study. Columbus-area HNPCC Study (HNPCC, OSUMC): Patients with colorectal adenocarcinoma diagnosed at six participating hospitals were eligible for this study, regardless of age at diagnosis or family history of cancer. Patients with a clinical diagnosis of familial adenomatous polyposis were not eligible for this study. These six hospitals perform the vast majority of all operations for CRC in the Columbus metropolitan area (population 1.7 million). The institutional review board at all participating hospitals approved the research protocol and consent form in accordance with assurances filed with and approved by the United States Department of Health and Human Services. Briefly, during the period of January 1999 through August 2004, 1,566 eligible patients with CRC were accrued to the study (Hampel et al., 2008) [PMID 18809606]. A total of 1472 colorectal cancer samples had enough blood DNA remaining to be sent for genotyping. Control samples were provided by the Ohio State University Medical Center%#39;s (OSUMC) Human Genetics Sample Bank. The Columbus Area Controls Sample Bank is a collection of control samples for use in human genetics research that includes both donors' anonymized biological specimens and linked phenotypic data. The data and samples are collected under the protocol "Collection and Storage of Controls for Genetics Research Studies", which is approved by the Biomedical Sciences Institutional Review Board at OSUMC. Recruitment takes place in OSUMC primary care and internal medicine clinics. If individuals agree to participate, they provide written informed consent, complete a questionnaire that includes demographic, medical and family history information, and donate a blood sample. 4-7 ml of blood is drawn into each of 3 ACD Solution A tubes and is used for genomic DNA extraction and the establishment of an EBV-transformed lymphoblastoid cell culture, cell pellet in Trizol, and plasma. Controls were matched to CRC cases as 1:1. Matching was done on age at reference time (age_ref), race, and sex. Age_ref was matched on +-5 years. Sex and race were matched exactly. For the cases without matched controls, matching was done only on sex and race with 1:1 ratio. Since controls are fewer than cases, one control is matched on 2 cases at most. Health Professionals Follow-up Study (HPFS): A parallel prospective study to the NHS (Nurses' Health Study). The HPFS cohort comprised 51,529 men aged 40-75 who, in 1986, responded to a mailed questionnaire (Rimm et al., 1990) [PMID: 2090285]. Participants provided information on health related exposures, including current and past smoking history, age, weight, height, diet, physical activity, aspirin use, and family history of colorectal cancer. Colorectal cancer and other outcomes were reported by participants or next-of-kin and were followed up through review of the medical and pathology record by physicians. Overall, more than 97% of self-reported colorectal cancers were confirmed by medical record review. Information was abstracted on histology and primary location. Incident cases were defined as those occurring after the subject provided the blood sample. Prevalent cases were defined as those occurring after enrollment in the study but before the subject provided the blood sample. Follow-up evaluation has been excellent, with 94% of the men responding to date. Colorectal cancer cases were ascertained through January 1, 2008. In 1993-1995, 18,825 men in the HPFS mailed blood samples by overnight courier, which were aliquoted into buffy coat and stored in liquid nitrogen. In 2001-2004, 13,956 men in the HPFS who had not provided a blood sample previously mailed in a swish-and-spit sample of buccal cells. Incident cases were defined as those occurring after the subject provided a blood or buccal sample. Prevalent cases were defined as those occurring after enrollment in the study in 1986, but before the subject provided either a blood or buccal sample. After excluding participants with histories of cancer (except nonmelanoma skin cancer), ulcerative colitis, or familial polyposis, case-control sets were previously constructed. In addition to colorectal cancer cases and controls, a set of adenoma cases and matched controls with available DNA from buffy coat were selected for genotyping. Over the follow-up period, data were collected on endoscopic screening practices and, if individuals had been diagnosed with a polyp, the polyps were confirmed to be adenomatous by medical record review. Adenoma cases were ascertained through January 1, 2008. A separate case-control set was constructed of participants diagnosed with advanced adenoma matched to control participants who underwent a lower endoscopy in the same time period and did not have an adenoma. Advanced adenoma was defined as an adenoma 1 cm or larger in diameter and/or with tubulovillous, villous, or highgrade dysplasia/carcinoma-in-situ histology. Matching criteria included year of birth (within 1 year) and month/ year of blood sampling (within 6 months), the reason for their lower endoscopy (screening, family history, or symptoms), and the time period of any prior endoscopy (within 2 years). Controls matched to cases with a distal adenoma either had a negative sigmoidoscopy or colonoscopy examination, and controls matched to cases with proximal adenoma all had a negative colonoscopy. In total, 159 advanced adenoma cases and 109 controls were selected for genotyping. Leeds Colorectal Cancer Study (LCCS): Following local ethical approval, colorectal cancer cases were recruited from 1997 until 2012 in Leeds, UK through surgical clinics. Initially, funding was provided by the UK Ministry of Agriculture, Farming and Fisheries (subsequently the Food Standards Agency) and Imperial Cancer Research Fund (subsequently Cancer Research UK). Recruitment also occurred similarly in Dundee, Perth and York between the periods of 1997 and 2001 using the same protocol and the data and samples were combined. Pathologically confirmed cases were consented at outpatient clinics, providing information on known and postulated risk factors for colorectal cancer (diet, lifestyle and family history) as well as providing a blood sample for DNA. Exclusion criteria included pre-existing diverticular disease and an inability to complete the questionnaire. The General Practitioners of cases (all UK residents have a nominated General Practitioner to whom to refer initial medical queries) and these GPs were asked to send letters to other persons on their patient list of the same gender and born within 5 years of the case. Subsequently to enhance the number of controls, we systematically invited patients from selected GP practices. Diet was assessed in cases and controls using an extensive dietary and lifestyle questionnaire modified by that produced by the European Prospective Investigation in Cancer (EPIC). The frequency that each specific food items were eaten was recorded and we also obtained average fruit and vegetable consumption as a cross-check. In total, 1591 cases and 739 controls provided a DNA sample. The North Carolina Colon Cancer Studies (NCCCS I/II): The North Carolina Colon Cancer Studies (NCCCS I- colon and NCCCS II-rectal) were population-based case-control studies conducted in 33 counties of North Carolina. Cases were identified using the rapid case ascertainment system of the North Carolina Central Cancer Registry. Patients with a first diagnosis of histologically confirmed invasive adenocarcinoma of the colon (cecum through sigmoid colon) between October 1996 and September 2000 were classified as potential cases in the NCCCS I. The NCCCS II included patients with a first diagnosis of histologically confirmed invasive adenocarcinoma of the sigmoid colon, rectosigmoid, or rectum (hereafter collectively referred to as rectal cancer) between May 2001 and September 2006. Additional eligibility requirements were: aged 40-80 years, residence in one of the 33 counties, ability to give informed consent and complete an interview, had a driver's license or identification card issued by the North Carolina Department of Motor Vehicles (if under the age of 65), and had no objections from the primary physician in regards to contacting the individual. Controls, identified and sampled during the respective study dates, were selected from two sources. Potential controls under the age of 65 were identified using the North Carolina Department of Motor Vehicles records. For those 65 years and older, records from the Center for Medicare and Medicaid Services were used. Controls were matched to cases using randomized recruitment strategies. Recruitment probabilities were done using strata of 5-year age, sex, and race groups. Dietary information was collected using a modified version of the semiquantitative food frequency questionnaire developed at the National Cancer Institute. In addition, participants were asked about vitamin and mineral supplementation, special diets, restaurant eating, sodium use, and fats used in cooking. In NCCCS I, 515 colorectal cases and 687 matched controls were sent for genotyping. In NCCCS II, 796 colorectal cases and 823 controls were sent from the NCCCS II for genotyping. Controls were matched to CRC cases as 1:1 ratio. Matching was done on age, race, and sex. Age was matched on +-5 years. Race and sex was matched exactly. For the cases without matched controls, matching was done only on sex and race. Nurses Health Study (NHS): The NHS cohort began in 1976 when 121,700 married female registered nurses age 30-55 years returned the initial questionnaire that ascertained a variety of important health-related exposures (Belanger et al., 1978) [PMID: 248266]. Since 1976, follow-up questionnaires have been mailed every 2 years. Colorectal cancer and other outcomes were reported by participants or next-of-kin and followed up through review of the medical and pathology record by physicians. Overall, more than 97% of self-reported colorectal cancers were confirmed by medical-record review. Information was abstracted on histology and primary location. The rate of follow-up evaluation has been high: as a proportion of the total possible follow-up time, follow-up evaluation has been more than 92%. Colorectal cancer cases were ascertained through June 1, 2008. In 1989 -1990, 32,826 women in NHS I mailed blood samples by overnight courier, which were aliquoted into buffy coat and stored in liquid nitrogen. In 2001-2004, 29,684 women in NHS I who did not previously provide a blood sample mailed a swish-and-spit sample of buccal cells. Incident cases were defined as those occurring after the subject provided a blood or buccal sample. Prevalent cases were defined as those occurring after enrollment in the study in 1976 but before the subject provided either a blood or buccal sample. After excluding participants with histories of cancer (except nonmelanoma skin cancer), ulcerative colitis, or familial polyposis, case-control sets were previously constructed from which DNA was isolated from either buffy coat or buccal cells for genotyping. In addition to colorectal cancer cases and controls, a set of advanced adenoma cases and matched controls with available DNA from buffy coat were selected for genotyping. Over the follow-up period, data were collected on endoscopic screening practices and, if individuals had been diagnosed with a polyp, the polyps were confirmed to be adenomatous by medical record review. Adenoma cases were ascertained through June 1, 2011. A separate case-control set was constructed of participants diagnosed with advanced adenoma matched to control participants who underwent a lower endoscopy in the same time period and did not have an adenoma. Advanced adenoma was defined as an adenoma more than 1 cm in diameter and/or with tubulovillous, villous, or high-grade dysplasia/carcinoma-in-situ histology. Matching criteria included year of birth (within 1 year) and month/year of blood sampling (within 6 months), the reason for their lower endoscopy (screening, family history, or symptoms), and the time period of any prior endoscopy (within 2 years). Controls matched to cases with a distal adenoma either had a negative sigmoidoscopy or colonoscopy examination, and controls matched to cases with proximal adenoma all had a negative colonoscopy. A total of 272 cases and 236 matched controls were sent to CIDR for the advanced adenoma case-control set. Northern Swedish Health and Disease Study (NSHDS): Comprises over 110,000 participants, including approximately one third with repeated sampling occasions, from three population-based cohorts (Dahlin et al., 2010; Myte et al., 2016) [PMID: 20197478; PMID: 27367522]. The largest is the ongoing Vasterbotten Intervention Programme, in which all residents of Vasterbotten County are invited to a health examination upon turning 30 (some years), 40, 50 and 60 years of age. Extensive measured and self-reported health and lifestyle data, as well as blood samples for central biobanking in Umea, Sweden, are collected at the health exam. Leucocyte DNA samples for 1:1-matched CRC case-control sets from the NSHDS, of which 878 samples are included in this study, have been selected for genotyping. This is in addition to 354 samples from the NSHDS previously analyzed as part of the multicenter EPIC cohort. Cancer-specific and overall survival data are available for all patients. For at least 425 patients, archival tumor tissue has been analyzed for the BRAF V600E mutation and by sequencing codon 12 and 13 for KRAS mutations, as well as for MSI screening status by immunohistochemistry and for an eight-gene CIMP panel using quantitative real-time PCR (MethyLight). Ohio Colorectal Cancer Prevention Initiative (OCCPI, OSUMC): OCCPI (ClinicalTrials.gov identifier: NCT01850654) is a population-based study of colorectal cancer patients diagnosed in one of 51 hospitals throughout the state of Ohio from January 1, 2013 through December 31, 2016. The OCCPI was created to decrease CRC incidence in Ohio by identifying patients with hereditary predisposition (statewide universal tumor screening for newly diagnosed CRC patients), increase colonoscopy compliance for first-degree relatives of CRC patients, and encourage future research through the creation of a biorepository. The 51 Ohio hospitals participating in the OCCPI were selected to represent a cross-section of clinical centers in the state based on high reported volume of CRC patients, affiliation with a high volume hospital, or interest in participation. Institutional Review Board (IRB) approval was obtained by the individual hospitals, Community Oncology Programs, or by ceding review to the OSU IRB. Written informed consent was obtained. A total of 2139 colorectal cases were genotyped. Patients were considered eligible for this study if they were age 18 or older at the time of enrollment, if they had a surgical resection (or biopsy if unresectable) in the state of Ohio demonstrating an adenocarcinoma of the colorectum from 1/1/13 - 12/31/16. Matched control samples were selected from the Ohio State University Medical Center's (OSUMC) Human Genetics Sample Bank in an identical way to the selection for the Columbus-area HNPCC Study (please refer to the description for the Columbus-area HNPCC Study). Prostate, Lung, Colorectal and Ovarian Cancer Screening Trail (PLCO): PLCO enrolled 154,934 participants (men and women, aged between 55 and 74 years) at ten centers into a large, randomized, two-arm trial to determine the effectiveness of screening to reduce cancer mortality. Sequential blood samples were collected from participants assigned to the screening arm. Participation was 93% at the baseline blood draw. In the observational (control) arm, buccal cells were collected via mail using the "swish-and-spit" protocol and participation rate was 65%. Details of this study have been previously described (Huang et al., 2016) [PMID: 27673363] and are available online (http://dcp.cancer.gov/plco). For this study 1651 advanced adenoma cases and 1392 controls were selected for genotyping. Selenium and Vitamin E Prevention Trial (SELECT): The Selenium and Vitamin E Cancer Prevention Trial (SELECT) was a double-blind, placebo controlled clinical trial which explored using selenium and vitamin E alone and in combination to prevent prostate cancer in healthy men (Lippman et al., 2009) [PMID: 19066370]. Secondary endpoints included the prevention of colorectal and lung cancers. SELECT was conducted at 427 sites and centers in the United States, Canada and Puerto Rico; 35,533 men 55 years and older (50 or older if African American) were randomized beginning August 22, 2001. Supplementation was discontinued on October 23, 2008 due to futility. 308 colorectal cancer cases and 308 matched controls were selected from the SELECT population and sent for genotyping. Screening Markers For Colorectal Disease Study and Colonoscopy and Health Study (SMS-REACH): Details on this study population were previously reported (Burnett-Hartman et al., 2014) [PMID: 24875374]. Participants were enrollees in an integrated health-care delivery system in western Washington State (Group Health Cooperative, Seattle, Washington) aged 24-79 years who underwent an index colonoscopy for any indication between 1998 and 2007 and donated a buccal-cell or blood sample for genotyping analysis. Study recruitment took place in 2 phases, with phase 1 occurring in 1998-2003 and phase 2 occurring in 2004-2007. Persons who had undergone a colonoscopy less than 1 year prior to the index colonoscopy, persons with inadequate bowel preparation for the index colonoscopy, and persons with a prior or new diagnosis of colorectal cancer, a familial colorectal cancer syndrome (such as familial adenomatous polyposis), or another colorectal disease were ineligible. Patients diagnosed with adenomas or serrated polyps and persons who were polyp-free at the index colonoscopy (controls) were systematically recruited during both phases of recruitment. Approximately 75% agreed to participate and provided written informed consent. Based on medical records, persons who agreed to participate and those who refused study participation were similar with respect to age, sex, and colorectal polyp status. Study protocols were approved by the institutional review boards of the Group Health Cooperative and the Fred Hutchinson Cancer Research Center (Seattle, Washington). A total of 575 cases and 508 matched were selected for the study. Controls were matched to CRC cases as 1:1 ratio. Matching was done on age_ref, race, and sex. Age_ref was matched on +-5 years. The Women's Health Initiative (WHI): WHI is a long-term national health study that has focused on strategies for preventing heart disease, breast and colorectal cancer, and osteoporotic fractures in postmenopausal women. The original WHI study included 161,808 postmenopausal women enrolled between 1993 and 1998. The Fred Hutchinson Cancer Research Center in Seattle, WA serves as the WHI Clinical Coordinating Center for data collection, management, and analysis of the WHI. The WHI has two major parts: a partial factorial randomized Clinical Trial (CT) and an Observational Study (OS); both were conducted at 40 Clinical Centers nationwide. The CT enrolled 68,132 postmenopausal women between the ages of 50-79 into trials testing three prevention strategies. If eligible, women could choose to enroll in one, two, or all three of the trial components. The components are: Hormone Therapy Trials (HT): This double-blind component examined the effects of combined hormones or estrogen alone on the prevention of coronary heart disease and osteoporotic fractures, and associated risk for breast cancer. Women participating in this component with an intact uterus were randomized to estrogen plus progestin (conjugated equine estrogens [CEE], 0.625 mg/d plus medroxyprogesterone acetate [MPA] 2.5 mg/d] or a matching placebo. Women with prior hysterectomy were randomized to CEE or placebo. Both trials were stopped early, in July 2002 and March 2004, respectively, based on adverse effects. All HT participants continued to be followed without intervention until close-out. Dietary Modification Trial (DM): The Dietary Modification component evaluated the effect of a low-fat and high fruit, vegetable and grain diet on the prevention of breast and colorectal cancers and coronary heart disease. Study participants were randomized to either their usual eating pattern or a low-fat dietary pattern. Calcium/Vitamin D Trial (CaD): This double-blind component began 1 to 2 years after a woman joined one or both of the other clinical trial components. It evaluated the effect of calcium and vitamin D supplementation on the prevention of osteoporotic fractures and colorectal cancer. Women in this component were randomized to calcium (1000 mg/d) and vitamin D (400 IU/d) supplements or a matching placebo. The Observational Study (OS)examines the relationship between lifestyle, environmental, medical and molecular risk factors and specific measures of health or disease outcomes. This component involves tracking the medical history and health habits of 93,676 women not participating in the CT. Recruitment for the observational study was completed in 1998 and participants were followed annually for 8 to 12 years. All centrally confirmed cases of invasive colorectal cancers, or deaths from colorectal cancer were selected as potential cases from September 30, 2015 database. Controls were participants free of colorectal cancer (invasive or in situ) as of September 30, 2015. Potential cases and controls were excluded if they (1) were non-White; (2) had history of colorectal cancers at baseline; (3) lost to follow-up after enrollment; (4) DbGAP ineligible; (5) had <1.25ug of DNA; (6) selected for WHI study M26 Phase I or II; (7) selected for WHI study AS224 and also included in the imputation project. A total of 578 cases and 104,429 controls met the eligibility criteria. Each case was matched with 1 control (1:1) that exactly met the following matching criteria: age (+-5 years), 40 randomization centers (exact), WHI date (+-3 years), CaD date (+-3 years), OS flag (exact), HRT assignments (exact), DM assignments (exact), and CaD assignments (exact). Control selection was done in a time-forward manner, selecting one control for each case from the risk set at the time of the case's event. The matching algorithm was allowed to select the closest match based on a criteria to minimize an overall distance measure (Bergstralh EJ, Kosanke JL. Computerized matching of cases to controls. Technical Report #56, Department of Health Sciences Research, Mayo Clinic, Rochester MN. April 1995). Each matching factor was given the same weight. When exact matches could not be found, the matching criteria were gradually relaxed among unmatched cases and controls until all cases had found matched controls. Using the matching criteria specified above, 559 of the 578 eligible cases found exact matches. The matching criteria was then relaxed to : Age+-5, randomization centers, WHI date +- 3 years, CaD date +- 3 years, OS flag, HRT flag, DM flag, CaD flag. 17 of the remaining 19 unmatched cases found matched controls. By matching on Age+-5, randomization centers, WHI date +- 3 years, CaD date +- 3 years, OS flag, HRT flag, the remaining 2 unmatched cases found their matches.
A deeper understanding of the pathological mechanisms of SARS-CoV-2 infection is required to combat COVID-19. Through this dataset, we analyze postmortem lung cells from patients that are infected/uninfected with SARS-CoV-2 with snRNA-seq.
A common symptom during late stage breast cancer disease is pleural effusion, which is related to poor prognosis. In pleural effusions malignant cells can be detected, which indicates a metastatic spread from the primary tumor site. Beside the usefulness for investigation of the metastatic process, it has been a source to isolate cells with putative cancer stem cell (CSC) properties. For this study, pleural effusion aspirate from 25 metastatic breast cancer patients were processed to establish in vitro culture. Patient-derived aspirates were cultured under sphere forming conditions and isolated primary cultures were sorted for cancer stem cell populations ALDH1+ and CD44+CD24-/low and sphere forming efficiency of CSC and non-CSC subpopulation was determined. Unsorted cells had a higher potential to form spheres compared to CSC subpopulations. Subsequently, DNA was isolated from pleural effusions before and after cell sorting, to compare their copy number profiles generated by low-coverage whole genome sequencing to DNA from FFPE primary tumors or bone metastasis. Unbalanced profiles could be detected in four pleural samples before sorting, in two ALDH1+ sorted samples, in one CD44+CD24-/low and in all control samples. Additionally, spheroid culture from one pleural sample could be passaged on and was characterized. Therefore, pleural effusions are an appropriate media for enrichment of putative CSC. However, low-coverage sequencing is difficult due to low quantities of abnormal cells compared to normal cells.
Charcot Marie Tooth disease (CMT) is an umbrella term that covers any inherited peripheral neuropathy. People with CMT have a problem with the nerves that go to the feet and hands that cause muscle and sensation loss, as well as difficulty with balance. There are at least forty genes that, when mutated, cause CMT. The purpose of this study is to look at the natural history of CMT to see how it changes over time. Particular emphasis will be put on studying people with CMT1B, CMT2A, CMT4A, and CMT4C, though all people with CMT are encouraged to participate. Participants are invited back on a yearly basis to determine how the changes are occurring. Objectives: Determine the natural history and genotype-phenotype correlations of disease-causing mutations in CMT1B, CMT2A, CMT4A and CMT4C as well as other forms of CMT Determine the capability of the newly developed CMTPeds score and the Minimal Dataset to measure impairment and perform longitudinal measurements in patients with multiple forms of CMT over a ten-year window This is a longitudinal study of individuals with CMT. Study participants will be invited to be re-evaluated every year. Evaluations will consist of neurological histories and examinations, selected nerve conduction studies (NCS) as well as completion of assorted clinical outcome measures including the CMTNS, Minimal Dataset, and Peds CMT Scale. Selected CMT patients and controls will also receive glabrous skin biopsies.
Gynecologic malignancy is one of the leading causes of mortality in female adults worldwide. Comprehensive genomic analysis has revealed a list of molecular aberrations that are essential to tumorigenesis, progression, and metastasis of gynecologic tumors. However, targeting such alterations has frequently led to treatment failures due to underlying genomic complexity and activation of various tumor cell survival pathway molecules. A compilation of molecular characterization of tumors with pharmacological drug response is the next step towards clinical application of patient-tailored treatment regimens. Toward this goal, we have established a library of 139 gynecologic tumors including cervical, endometrial, and epithelial ovarian cancers (EOCs) and uterine sarcomas that were genomically and/or pharmacologically annotated and explored dynamic pharmacogenomic associations against 37 molecularly targeted drugs. We discovered lineage-specific drug sensitivities based on subcategorization of gynecologic tumors and identified TP53 mutation as a molecular determinant that elicit therapeutic response to poly (ADP-Ribose) polymerase (PARP) inhibitor. We further identified transcriptome expression of inhibitor of DNA biding 2 (ID2) as a potential predictive biomarker for treatment response to olaparib. Together, our results demonstrate potential utility of rapid drug screening combined with genomic profiling for precision treatment of gynecologic cancers.
Genomic profiling can provide prognostic and predictive information to guide clinical care. Biomarkers that reliably predict patient response to chemotherapy and immune checkpoint inhibition in gastric cancer are lacking. In this work, we used our machine learning algorithm NTriPath to identify a gastric-cancer specific 32-gene signature. Using unsupervised clustering on expression levels of these 32 genes in tumors from 567 patients, we identified four molecular subtypes that were prognostic for survival. We then built a support vector machine with linear kernel and the binary classifier to generate a risk score that is prognostic for 5-year overall survival and validated the risk score using three independent datasets. We also found that the molecular subtypes predicted response to adjuvant 5-fluorouracil and platinum after gastrectomy and immune checkpoint inhibitors in patients with metastatic or recurrent disease. The 32-gene signature is a promising prognostic and predictive biomarker to guide the clinical care of gastric cancer patients and should be validated in a prospective manner. This EGA archive includes both St.Mary hospital cohort samples (n=28) and Yonsei immunotherapy patient samples (n=17).
Tumors arise as a result of uncontrolled cell proliferation driven by mutations in the signaling pathways that regulate the balance between cell division and differentiation. Despite of that, in the tumor, cancer stem cells seem to retain their capability to differentiate, which can lead to differential chemotherapy response. We previously showed that intestinal stem cells and differentiated cells display specific metabolic features and that they interact through metabolites. Moreover, we found that altering mitochondrial metabolism is sufficient to lead and define stem cell differentiation. In cancer, cellular metabolism is highly derailed. However, how metabolic changes in the cancer cell contribute to cell fate decisions remains unknown. Recently, the crosstalk between metabolism and epigenetics has emerged as one of the players in tumor development. Here we aim to address the impact of the altered metabolism of cancer cells on their epigenome as a regulatory axis of cell identity and fate. We use human-derived tumor organoids where we introduce genetic encoded fluorescent-based reporters to visualize cell identity and metabolism at single cell level. We combine 4D live imaging with a recently developed machine learning method to re-construct cell-lineages, trace stem cell state and metabolic changes during tumor organoid development. We find that, mimicking the tumor metabolic microenvironment (Warburg effect-driven low-glucose high-lactate) leads to changes cellular metabolism and histone acetylation. Strikingly, these alterations provoke profound changes in tumor development by increasing the population of cancer stem cells and their proliferation and differentiation dynamics.
The Gabriella Miller Kids First Pediatric Research Program (Kids First) is a trans-NIH effort initiated in response to the 2014 Gabriella Miller Kids First Research Act and supported by the NIH Common Fund. This program focuses on gene discovery in pediatric cancers and structural birth defects and the development of the Gabriella Miller Kids First Pediatric Data Resource (Kids First Data Resource). All of the WGS and phenotypic data from this study are accessible through dbGaP and kidsfirstdrc.org, where other Kids First datasets can also be accessed. Nonsyndromic craniosynostosis (NCS) is a common, major structural birth defect - due to the premature fusion of one or more cranial sutures - that requires extensive surgical correction and is associated with considerable ongoing medical problems and health care costs. Because little is known about the causes of NCS, whole genome sequencing will help advance knowledge of genetic factors contributing to the etiology of NCS. Data from this project will lead to a better understanding of biological processes involved in the etiology of NCS and provide critical insights for development of early diagnostic tools and therapeutic strategies.
Identifying genetic risk loci for Age-Related Macular Degeneration is important. Most of the loci identified to date have been found through the interrogation of common variants. This study initiates the identification of rare variants in a founder population, the Amish, and will use genotyping of known risk loci and ultimately use the whole exome chip to assess the association of Age-related Macular Degeneration with coding variants. We will further refine the Age-Related Macular Degeneration phenotype through the use of modern imaging, the OCT, to visualize the early anatomic signs of Age-Related Macular Degeneration. We hypothesize there are endophenotypes associated with specific genotypes that can be used to determine Age-related Macular Degeneration progression. These endophenotypes are hypothesized to be influenced by a combination of common and rare variants. We will relate these endophenotypes to genotypes to define the role of genetics in the progression of Age-Related Macular Degeneration and a new risk profile incorporating genotypic information.
Neuroblastoma (NB) is one of the most lethal childhood cancers due its propensity to treatment resistance. By spatial mapping of subclone geographies before and after chemotherapy across 89 tumor regions from 12 NBs, we find that densely packed territories of closely related subclones present at diagnosis are replaced under effective treatment by islands of distantly related survivor subclones, originating from a different most recent ancestor compared to lineages dominating before treatment. Conversely, in tumors that progressed under treatment, ancestors of subclones dominating later in disease are present already at diagnosis. Chemotherapy treated xenografts and cell culture models replicates these two contrasting scenarios and shows branching evolution to be a constant feature of proliferating NB cells. Phylogenies based on whole genome sequencing of 505 individual NB cells indicate that a rich repertoire of parallel subclones, emerges already with the first oncogenic mutations and lays the foundation for clonal replacement under treatment.
Primary T cell immunodeficiency disorders are genetically heterogenous. Mutations in genes required for specific stages of T cell development within the thymus can lead to reduced numbers of circulating T cells. On the hand, T cells may not egress from the thymus normally leading to profound T cell lymphopenia. As a result, patients with these disorders succumb to a range of severe opportunisitic infections unless allogeneic transplantation is performed. We wish to whole exome sequence a Caucasian family in which the index case shows features of impaired T cell egress from the thymus. Importantly, known genes implicated in T cell immunodeficiency including CORO1A have been seqeuenced and found to normal in this individual.
How to manage your EGA account Welcome to the documentation page for creating an EGA user. The European Genome-Phenome Archive (EGA) is a platform for archiving, managing, and sharing human genomic and phenotypic data. To submit, access, and manage requests for the data hosted on the EGA platform, users are required to register for an account. This documentation page provides step-by-step instructions for creating an EGA user account, as well as information on the different types of EGA user accounts and their associated privileges. You can check the EGA data flow for a better understanding. Manage EGA account step-by-step guide Contact Information Public Keys Password Authenticator Federated Identity Sessions Request an EGA account step-by-step guide Fill in this form.You cannot use a generic email account (e.g., Gmail). Make sure you use your institutional email account.Your email is by default your username.The criteria for a strong password is: It should contain between 8 and 32 charactersIt should contain at least one uppercase letterIt should contain at least one numberIt should contain at least one special characterYou will need to wait for the Helpdesk team to check your identity. Please allow two business days for this step.After the Helpdesk validation step, you will receive an email with a link to validate your institutional email.Click the link to verify your email. It will redirect you to the EGA website.As soon as your account has been validated, you will be able to start adding more information to your EGA profile. For that, you first need to log in using your credentials from step 1. Manage EGA account step-by-step guide As a logged-in user, go to your profile by clicking on Welcome > My Profile. To modify it, click the Edit button.You will see the following options: Contact Information In this section, you can modify the personal information of your EGA account. Public Keys Adding a public key is necessary for downloading data via Live Outbox distribution. If you don’t know how to generate your key, you can follow the instructions available on How do I create a key? Password In the Password tab, you can change your password anytime to ensure the security of your account. However, if you receive an email notifying you that your password was modified and you did not initiate the change, it could be a sign that your account has been compromised. In this case, you should immediately contact our Helpdesk team to report the issue and receive assistance in restoring your account security. Authenticator You can set your two-factor authentication (2FA) system for your EGA account as an identity and access management security method that requires two forms of identification to access resources and data. To obtain your code, please follow the instructions below, also available in the Authenticator tab in your profile. Sessions The Sessions tab displays a list of all devices where you are currently logged in as an EGA user. This includes any open sessions you have on your computer, phone, or other devices. By reviewing the list of active sessions, you can monitor the devices that have access to your account and ensure that there are no unauthorised logins. If you notice any suspicious activity, you can close the sessions from this page to immediately terminate access to your account from the unauthorised device.
The AVENIO ctDNA Expanded Kit is a next-generation sequencing (NGS) liquid biopsy assay with a 77 gene panel (192 kb) containing genes in U.S. National Comprehensive Cancer Network (NCCN) Guidelines and emerging cancer biomarkers. This pan-cancer assay was applied to 100 plasma samples from patients with lung cancer undergoing treatment in the OSCILLATE trial. After 150 bp paired-end sequencing, reads were aligned to the human genome reference with the AVENIO Oncology Analysis Software. These files are the sorted non-deduplicated alignments generated by the analysis software used for subsequent variant, indel and CNV calling.
GABAergic interneurons are essential for neural circuit function and their loss or dysfunction is implicated in human neuropsychiatric disease. In vitro methods for interneuron generation hold promise for studying human cellular and functional properties and ultimately therapeutic cell replacement. We used a protocol for generating cortical interneurons from hESCs and analyzed the properties and maturation timecourse of cell types using single-cell RNAseq (data available at GEO under: GSE93802 on March 10 2017). Transcriptomic profiles of the hESC-derived interneurons were compared to several different populations of cells from mid-gestation human neocortex that showed differing levels of PAX6 and SOX2 expression. For this study, 104 samples of 100 human neocortical cells each, have been sorted based on SOX2 and PAX6 expression, mRNA recovered from the fixed cells by FRISCR, and transcriptomic profiles generated by SmartSeq2. The RNA-seq data from the 104 100-cell samples is included in this dbGaP study.
Ependymomas are common childhood brain tumours that occur throughout the nervous system, but are most common in the paediatric hindbrain. Current standard therapy comprises surgery and radiation, but not cytotoxic chemotherapy as it does not further increase survival. Whole-genome and whole-exome sequencing of 47 hindbrain ependymomas reveals an extremely low mutation rate, and zero significant recurrent somatic single nucleotide variants. Although devoid of recurrent single nucleotide variants and focal copy number aberrations, poor-prognosis hindbrain ependymomas exhibit a CpG island methylator phenotype. Transcriptional silencing driven by CpG methylation converges exclusively on targets of the Polycomb repressive complex 2 which represses expression of differentiation genes through trimethylation of H3K27. CpG island methylator phenotype-positive hindbrain ependymomas are responsive to clinical drugs that target either DNA or H3K27 methylation both in vitro and in vivo. We conclude that epigenetic modifiers are the first rational therapeutic candidates for this deadly malignancy, which is epigenetically deregulated but genetically bland.
The Mutographs project aims to advance our understanding of the causes of cancer through studies of mutational signatures. Led by Mike Stratton, together with Paul Brennan, Ludmil Alexandrov, Allan Balmain, David Phillips and Peter Campbell, this large-scale international research endeavour was awarded a Cancer Research UK Grand Challenge. Different patterns of somatic mutation are generated by the different environmental, lifestyle and genetic factors that cause cancer, many of them are still unknown. Within Mutographs, the International Agency for Research on Cancer is coordinating the recruitment of 5000 individuals with cancer (colorectal, renal, pancreatic, oesophageal adenocarcinoma or oesophageal squamous cancers) across 5 continents to explore whether different mutational signatures explain marked variation in incidence. In brief, through an international network of collaborators around the world, biological materials are collected, along with demographic, histological, clinical and questionnaire data. Whole genome sequences of tumour-germline DNA pairs are generated at the Wellcome Trust Sanger Institute (Illumina HiSeqX, 40X and 20X depth respectively). Somatic mutational signatures are subsequently extracted by non-negative matrix factorisation methods and correlated with risk factors data. Through an enhanced understanding of cancer aetiology, Mutographs unprecedented effort is anticipated to outline modifiable risk factors, lead to new approaches to prevent cancer, and provide opportunities to empower early detection, refine high-risk groups and contribute to further therapeutic development.