he Mutographs project aims to advance our understanding of the causes of cancer through studies of mutational signatures. Led by Mike Stratton, together with Paul Brennan, Ludmil Alexandrov, Allan Balmain, David Phillips and Peter Campbell, this large-scale international research endeavour was awarded a Cancer Research UK Grand Challenge. Within Mutographs, work lead by the Sanger Institute will investigate whether detection of somatic mutations and mutational signatures in circulating white blood cells can be developed into a practical, generic system for surveying and monitoring multiple different endogenous and exogenous exposures, providing an ‘observatory’ on somatic mutational processes in humans. Whole genome sequences are generated at the Wellcome Sanger Institute (Illumina HiSeqX). Somatic mutational signatures are subsequently extracted by non-negative matrix factorisation methods. Through an enhanced understanding of cancer aetiology, Mutographs unprecedented effort is anticipated to outline modifiable risk factors, lead to new approaches to prevent cancer, and provide opportunities to empower early detection, refine high-risk groups and contribute to further therapeutic development.
Normal breast luminal epithelial progenitors have been implicated as cell of origin in basal-like breast cancer, but their anatomical localization remains understudied. Here, we combine micro-collection of uncultured organoids and single cell mRNA sequencing (scRNA-seq) of FACS-sorted luminal epithelial cells with multicolor imaging to profile ducts and terminal ductal lobular units (TDLUs) and compare them with breast cancer subtypes. Unsupervised clustering reveals eleven distinct clusters and a differentiation trajectory starting with keratin 15+ (K15+) progenitors enriched in ducts. Spatial mapping of luminal progenitors is confirmed at the protein level by staining with critical duct markers. Comparison of the gene expression profiles of normal luminal cells with those of breast cancer subtypes reveals a strong correlation between normal breast K15+ ductal progenitors and basal-like breast cancer. We propose that K15+ basal-like breast cancers originate in ductal progenitors, which emphasizes the importance of not only lineages but also cellular position within the ductal-lobular tree.
The prostate gland produces prostatic fluid, high in zinc and citrate and essential for the maintenance of spermatozoa. Prostate cancer is a common condition with limited treatment efficacy in castration-resistant metastatic disease, including with immune checkpoint inhibitors. We used single-cell RNA-sequencing to perform an unbiased assessment of the cellular landscape of human prostate and identified a previously unappreciated subset of tumour-enriched androgen receptor-negative luminal epithelial cells, with increased expression of cancer-associated genes. We also found a variety of innate and adaptive immune cells in normal prostate that were transcriptionally perturbed in prostate cancer. An exception was a unique, prostate-specific, zinc transporter-expressing macrophage population (MAC-MT), that contributed to tissue zinc accumulation in homeostasis, but showed enhanced inflammatory gene expression in tumours, including T cell-recruiting chemokines. Remarkably, enrichment of the MAC-MT signature in cancer biopsies was associated with improved disease-free survival, suggesting beneficial anti-tumour functions.
Single cell transcriptomics is a powerful tool to map the complex tumor microenvironment, providing unprecedented resolution into cell types, cell states, cell interactions, and tumor heterogeneity. Depending on their tissue site and whether processing begins from a fresh or frozen sample, different tumors require different optimizations in order to obtain high quality single cell transcriptomic data. In this study, we profiled a range of tumor types, varying in cell-of-origin, solid and non-solid forms, patient ages, and transitions. The tumors also further varied in their tissue site and in their sample collection method. To enable profiling of such diverse samples, we developed a systematic toolbox for single cell RNA-Seq and single nucleus RNA-Seq of fresh and frozen clinical tumor samples. We validated this toolbox for 216,490 cells and nuclei extracted from 39 samples across 23 tumors (some tumors were split into multiple samples). The 23 tumors represented eight unique tumor types. This toolbox comprises both experimental and computational protocols as well as guidelines for comparing and selecting protocols for the biological problem of interest.
Overview: Our overall long-term goal is to determine risk factors for the complex (multifactorial) disease, venous thromboembolism (VTE), that will allow physicians to stratify individual patient risk and target VTE prophylaxis to those who would benefit most. In this genome-wide association case-control study (1300 cases and 1300 controls) we hope to identify susceptibility variants for VTE. Mutations within genes encoding for important components of the anticoagulant, procoagulant, fibrinolytic, and innate immunity pathways are risk factors for VTE. We hypothesize that other genes within these four pathways or within other pathways also are VTE disease-susceptibility genes. Therefore, we performed a genome wide association (GWA) screen and analysis using the Illumina 660W platform to identify SNPs within 1,300 clinic-based, non-cancer VTE cases primarily from Minnesota and the upper Midwest USA, and 1300 clinic-based, unrelated controls frequency-matched on patient age, gender, myocardial infarction/stroke status and state of residence. This is a subset of a slightly larger candidate gene study using 1500 case-control pairs to identify haplotype-tagging SNPs (ht-SNPs) in a large set of candidate genes (n~750) within the anticoagulant, procoagulant, fibrinolytic, and innate immunity pathways. Study Populations. Cases. VTE cases were consecutive Mayo Clinic outpatients with objectively-diagnosed deep vein thrombosis (DVT) and/or pulmonary embolism (PE) residing in the upper Midwest and referred by Mayo Clinic physician to the Mayo Clinic Special Coagulation Laboratory for clinical diagnostic testing to evaluate for an acquired or inherited thrombophilia, or to the Mayo Clinic Thrombophilia Center. Any person contacted to be a control but discovered to have had a VTE was evaluated for inclusion as a case. Cases were primarily residents from Minnesota, Wisconsin, Iowa, Michigan, Illinois, North or South Dakota, Nebraska, Kansas, Missouri and Indiana. A DVT or PE was categorized as objectively diagnosed when (a) confirmed by venography or pulmonary angiography, or pathology examination of thrombus removed at surgery, or (b) if at least one non-invasive test (compression duplex ultrasonography, lung scan, computed tomography scan, magnetic resonance imaging) was positive. A VTE was defined as: Proximal leg deep vein thrombosis (DVT), which includes the common iliac, internal iliac, external iliac, common femoral, superficial [now termed "femoral"] femoral, deep femoral [sometimes referred to as "profunda" femoral] and/or popliteal veins. (Note: greater and lesser saphenous veins, or other superficial or perforator veins, were not included as proximal or distal leg DVT). Distal leg DVT (or "isolated calf DVT"), which includes the anterior tibial, posterior tibial and/or peroneal veins. (Note: gastrocnemius, soleal and/or sural [e.g., "deep muscular veins" of the calf] vein thrombosis was not included as distal leg DVT). Arm DVT, which includes the axillary, subclavian and/or innominate (brachiocephalic) veins. (Note: jugular [internal or external], cephalic and brachial vein thrombosis was not included in "arm DVT"). Hepatic, portal, splenic, superior or inferior mesenteric, and/or renal vein thrombosis. (Note: ovarian, testicular, peri-prostatic and/or pelvic vein thrombosis was not included). Cerebral vein thrombosis (includes cerebral or dural sinus or vein, saggital sinus or vein, and/or transverse sinus or vein thrombosis). Inferior vena cava (IVC) thrombosis Superior vena cava (SVC) thrombosis Pulmonary embolism Patients with VTE related to active cancer, antiphospholipid syndrome, inflammatory bowel disease, vasculitis, a rheumatoid or other autoimmune disorder, a vascular anomaly (e.g., Klippel-Trénaunay syndrome, etc.), heparin-induced thrombocytopenia, or a mechanical cause for DVT (e.g., arm DVT or SVC thrombosis related to a central venous catheter or transvenous pacemaker, portal and/or splenic vein thrombosis related to liver cirrhosis, IVC thrombosis related to retroperitoneal fibrosis, etc.), with hemodialysis arteriovenous fistula thrombosis, or with prior liver or bone marrow transplantation were excluded. Controls. A Mayo Clinic outpatient control group was prospectively recruited for this study. Controls were frequency-matched on the age group (18-29, 30-39, 40-49, 50-59, 60-69, 70-79, and 80+ years), sex, myocardial infarction/stroke status, and state of residence distribution of the cases. We selected clinic-based controls using a controls' database of persons undergoing general medical examinations in the Mayo Clinic Departments of General Internal Medicine or Primary Care Internal Medicine. Additionally persons undergoing evaluation at the Mayo Clinic Sports Medicine Center, and the Department of Family Medicine were screened for inclusion as controls. This study is part of the Gene Environment Association Studies initiative (GENEVA, http://www.genevastudy.org) funded by the trans-NIH Genes, Environment, and Health Initiative (GEI). The overarching goal is to identify novel genetic factors that contribute to venous thrombosis through large-scale genome-wide association studies of 1,300 clinic-based, VTE cases and 1300 clinic-based, unrelated controls. Genotyping was performed at the Johns Hopkins University Center for Inherited Disease Research (CIDR). Data cleaning and harmonization were done at the GEI-funded GENEVA Coordinating Center at the University of Washington.
To characterise cfDNA methylome in metastatic prostate cancer
We performed whole genome sequencing using adaptive sampling with GridION on 33 patients with clinically suspected hereditary cancer syndromes to evaluate an efficient computational workflow.
We performed target capture sequencing of ROS1-rearranged non-small cell lung cancer patients to develop and examine the efficacy of new molecular targeted drug against ROS1 tyrosine kinase.
Error-corrected next-generation sequencing of rectal mucus samples from patients suspected to have colorectal cancer. This is 1 of 4 sequencing experiments on the same sample type.