Chordoma is a rare tumor that is believed to develop from notochord remnants. We previously identified germline duplication of the TBXT gene as a major susceptibility mechanism in several chordoma families. Currently, we are using whole exome sequencing to identify additional susceptibility genes in chordoma families without TBXT duplication as well as in sporadic cases.
The cerebrospinal fluid (CSF) is thought to be a main route for immune surveillance in the central nervous system (CNS). Understanding the composition and state of CSF in healthy individuals may provide insight into perturbations present during neuroinflammation. Here, we have used single-cell RNA and TCR sequencing to profile CSF and peripheral blood mononuclear cells (PBMCs) from healthy individuals and patients with relapsing-remitting multiple sclerosis (MS), a chronic autoimmmune disease. In addition, with the goal of understanding how transcriptional artefacts are introduced in transcriptomics experiments, we have performed single-cell and TCR sequencing to profile PBMCs from a healthy individual that were processed with the same enzymatic digestion used for tissues (such as the brain) in the presence or absence of transcription inhibitors.
Whole genome sequencing data of 26 high-grade serous carcinoma (HGSC) patients (87 samples) sequenced with MGISEQ-2000 and HiSeq X Ten.
The goal of this study was to identify SNPs and CNPs that are associated with development of normal tissue toxicities resulting from radiotherapy for prostate cancer. The study population includes approximately 1,400 men treated with brachytherapy, external beam radiation therapy, or a combination of the two treatments, and assessed for adverse effects at baseline and following radiotherapy. Three toxicity endpoints were investigated using a two-stage GWAS approach: urinary morbidity, rectal bleeding and erectile dysfunction. The study sample was split into a discovery set (N=367) and a replication set (N=417), and an additional 647 samples, which were not part of this original cohort, were also included as an independent replication set. The replication set was developed via collaboration developed under the framework of the Radiogenomics Consortium (RGC). The long-term goal of this project, and other radiogenomics projects lead by the RGC, is two-fold: 1) Develop an assay capable of predicting which cancer patients are most likely to develop radiation injuries resulting from treatment with a standard RT protocol, and 2) Obtain information to assist with the elucidation of the molecular pathways responsible for radiation-induced normal tissue toxicities. These studies focus on multiple cancer types including prostate, breast, lung, and head and neck cancers.
The opportunities for sequencing-based methylome analysis of clinical samples are increasing. To reduce its cost and the amount of genomic DNA required for library preparation, we aimed to establish a capture methyl-seq protocol, which adopts pre-pooling of multiple libraries before hybridization capture and TET2/APOBEC-mediated conversion of unmethylated cytosine to thymine. We compared a publicly available dataset generated by the standard Agilent protocol of SureSelect XT Human Methyl-Seq Kit and our dataset obtained by our modified protocol, EMCap, that adopted sample pre-pooling and enzymatic conversion. We confirmed that the quality of DNA methylation data was comparable between the two datasets. As our protocol, EMCap, is more cost-effective and reduces the amount of input genomic DNA, it would serve as a better choice for clinical methylome sequencing.
In Saudi Arabia, thyroid cancer ranked second only to breast cancer among females, eleventh among males and this increased incidence is prevalent in other Gulf Council Countries also. Differentiated thyroid cancer, which includes papillary thyroid cancer (PTC) and follicular subtypes, accounts for 90% of all thyroid malignancies. Currently, there are no clinical or molecular diagnostic tools to predict recurrence and aggressiveness of a subset of PTC. With the goal of better understanding the pathobiology of thyroid carcinogenesis we will use Whole Exome Sequencing to decipher the molecular and genetic signature of Saudi PTC based on International Cancer Genome Consortium (ICGC) Guidelines. The study will provide a better understanding of PTC and should have important clinical implications in the development of new and better strategies for targeted therapeutic intervention for the treatment of Saudi Arabian PTC
The Multi-Ethnic Study of Atherosclerosis (MESA) is a study of the characteristics of subclinical cardiovascular disease (disease detected non-invasively before it has produced clinical signs and symptoms) and the risk factors that predict progression to clinically overt cardiovascular disease or progression of the subclinical disease. MESA researchers study a diverse, population-based sample of 6,814 asymptomatic men and women aged 45-84. Thirty-eight percent of the recruited participants are white, 28 percent African-American, 22 percent Hispanic, and 12 percent Asian, predominantly of Chinese descent. Data available for request include phenotypic data previously released on BioLINCC as well as raw images (E.g. Echocardiogram) and raw tracing data files (e.g. Electrocardiogram). These new data are intended to be made available to interested researchers via BioData Catalyst, will include participants irrespective of participation in genetics (i.e. the full MESA cohort), and may be merged with currently posted (to dbGaP) phenotype and molecular datasets (WGS, GWA, RNA Seq, Methylation, Metabolomics, and Proteomics).
The Genetics and Epidemiology of Colorectal Cancer Consortium (GECCO) is a collaborative effort comprised of a coordinating center and scientific researchers from well-characterized cohort and case-control studies conducted in North America and Europe. This international consortium aims to accelerate the discovery of common and rare genetic risk variants for colorectal cancer by conducting large-scale meta-analyses of existing and newly generated genome-wide association study (GWAS) data, replicating and fine-mapping of GWAS discoveries, and investigating how genetic risk variants are modified by environmental risk factors. To expand these efforts, we assembled case-control sets or nested case-control sets from 20 different North American or European studies. Summary descriptions and study participant inclusions/exclusion criteria for each of these studies are detailed below. The Black Women's Health Study (BWHS): Is the largest follow-up study of the health of African-American women (Cozier et al., 2004; Rosenberg et al., 1995) [PMID: 15018884; PMID: 7722208]. The purpose is to identify and evaluate causes and preventives of cancers and other serious illnesses in African-American women. Among the diseases being studied are breast cancer, colorectal cancer, type 2 diabetes, uterine fibroids, systemic lupus erythematosus, and cardiovascular disease. The study began in 1995, when 59,000 black women from all parts of the United States enrolled through postal questionnaires. The women provided demographic and health data on the 1995 baseline questionnaire, including information on weight, height, smoking, drinking, contraceptive use, use of other selected medications, illnesses, reproductive history, physical activity, diet, use of health care, and other factors. The participants are followed through biennial questionnaires to determine the occurrence of cancers and other illnesses and to update information on risk factors. Self-reports of cancer are confirmed through medical records and state cancer registry records. Mouthwash-swish samples, as a source of DNA, were obtained from ~26,000 BWHS participants in 2002-2007. DNA was isolated from the mouthwash-swish samples at the Boston University Molecular Core Genetics Laboratory using the QIAAMP DNA Mini Kit (Qiagen). All incident colorectal cancer cases with a DNA sample were included in the present analysis. Two controls per case, selected from among BWHS participants free of colorectal cancer at end of follow-up, were matched to cases on year of birth (+/- 2 years) and geographical region of residence (Northeast, South, Midwest, and West). A total 209 colorectal cancer cases and 423 controls were sent for genotyping. Campaign Against Cancer and Heart Disease (CLUE II): The Campaign Against Cancer and Heart Disease, is a prospective cohort designed to identify biomarkers and other factors associated with risk of cancer, heart disease, and other conditions (Kakourou et al., 2015) [PMID: 26220152]. 32,894 participants were recruited from May through October 1989 from Washington County, Maryland and surrounding communities. Colorectal cancer cases (n = 297) and matched controls (n = 296) were identified between 1989 and 2000 among participants in the CLUE II cohort of Washington County, Maryland. Colorectal Cancer Study of Austria (CORSA): In the ongoing colorectal cancer study of Austria (CORSA), more than 13,000 Caucasian participants have been recruited within the province-wide screening project "Burgenland Prevention Trial of Colorectal Disease with Immunological Testing" (B-PREDICT) since 2003 (Hofer et al., 2011) [PMID: 21422235]. All inhabitants of the Austrian province Burgenland aged between 40 and 80 years are annually invited to participate in fecal immunochemical testing and haemoccult positive screening participants are invited for colonoscopy. CORSA includes genomic DNA and plasma of colorectal cancer cases, low-risk and high-risk adenomas, and colonoscopy-negative controls. Controls received a complete colonoscopy and were free of colorectal cancer or polyps. CORSA participants have been recruited in the four KRAGES hospitals in Burgenland, Austria, and additionally, at the Medical University of Vienna (Department of Surgery), the Viennese hospitals "Rudolfstiftung" and the "Sozialmedizinisches Zentrum Sud", and at the Medical University of Graz (Department of Internal Medicine). 1403 colorectal cancer and advanced colorectal adenoma cases, and 1404 matched controls were selected for the study. Distribution of factors sex and age (5 year strata) were evenly matched between cases and controls. Cancer Prevention Study II (CPS II): The CPS II Nutrition cohort is a prospective study of cancer incidence and mortality in the United States, established in 1992 and described in detail elsewhere (Calle et al., 2002; Campbell et al., 2014) [PMID: 12015775; PMID: 25472679]. At enrollment, participants completed a mailed self-administered questionnaire including information on demographic, medical, diet, and lifestyle factors. Follow-up questionnaires to update exposure information and to ascertain newly diagnosed cancers were sent biennially starting in 1997. Reported cancers were verified through medical records, state cancer registry linkage, or death certificates. The Emory University Institutional Review Board approves all aspects of the CPS II Nutrition Cohort. A total of 360 cases and 359 controls were selected for this study. Czech Republic Colorectal Cancer Study (Czech Republic CCS): Cases with positive colonoscopy results for malignancy, confirmed by histology as colon or rectal carcinomas, were recruited between September 2003 and May 2012 in several oncological departments in the Czech Republic (Prague, Pilsen, Benesov, Brno, Liberec, Ples, Pribram, Usti and Labem, and Zlin). Two control groups, sampled at the same time of cases recruitment, were included in the study. The first group consisted of hospital-based individuals with a negative colonoscopy result for malignancy or idiopathic bowel diseases. The reasons for the colonoscopy were: i) positive fecal occult blood test, ii) hemorrhoids, iii) abdominal pain of unknown origin, and iv) macroscopic bleeding. The second control group consisted of healthy blood donor volunteers from a blood donor center in Prague. All individuals were subjected to standard examinations to verify the health status for blood donation and were cancer-free at the time of the sampling. Details of CRC cases and controls have been reported previously (Vymetalkova et al., 2014; Naccarati et al., 2016; Vymetalkova et al., 2016) [PMID: 24755277; PMID: 26735576; PMID: 27803053]. All subjects were informed and provided written consent to participate in the study. They approved the use of their biological samples for genetic analyses, according to the Declaration of Helsinki. The design of the study was approved by the Ethics Committee of the Institute of Experimental Medicine, Prague, Czech Republic. All subjects included in the study were Caucasians and comprised 1792 cases and 1764 matched controls. Controls were matched to CRC cases as 1:1 ratio. Matching was done on age and sex. Age was matched on +-5 years, whereas sex was matched exactly. For the cases without matched controls, matching was done only on sex. Early Detection Research Network (EDRN): The aim of the EDRN initiative is to develop and sustain a biorepository for support of translational research (Amin et al., 2010) [PMID: 21031013]. High-quality biospecimens were accrued and annotated with pertinent clinical, epidemiologic, molecular and genomic information. A user-friendly annotation tool and query tool was developed for this purpose. The various components of this annotation tool include: CDEs are developed from the College of American Pathologists (CAP) Cancer Checklists and North American Association of Central Cancer Registries (NAACR) standards. The CDEs provides semantic and syntactic interoperability of the data sets by describing them in the form of metadata or data descriptor. A total of 352 colorectal case samples and 399 controls were selected for this study. Controls were matched to CRC cases based on age and sex. The EPICOLON Consortium (EPICOLON): The EPICOLON Consortium comprises a prospective, multicentre and population-based epidemiology survey of the incidence and features of CRC in the Spanish population (Fernandez-Rozadilla et al., 2013) [PMID: 23350875]. Cases were selected as patients with de novo histologically confirmed diagnosis of colorectal adenocarcinoma. Patients with familial adenomatous polyposis, Lynch syndrome or inflammatory bowel disease-related CRC, and cases where patients or family refused to participate in the study were excluded. Hospital-based controls were recruited through the blood collection unit of each hospital, together with cases. All of the controls were confirmed to have no history of cancer or other neoplasm and no reported family history of CRC. Controls were randomly selected and matched with cases for hospital, sex and age (+- 5 years). A total of 370 cases and 370 controls were selected for genotyping. Hawaii Adenoma Study: For this adenoma study, two flexible-sigmoidoscopy screening clinics were first used to recruit participants on Oahu, Hawaii. Adenoma cases were identified either from the baseline examination at the Hawaii site of the Prostate Lung Colorectal and Ovarian cancer screening trial during 1996-2000 or at the Kaiser Permanente Hawaii's Gastroenterology Screening Clinic during 1995-2007. In addition, starting in 2002 and up to 2007, we also approached for recruitment all eligible patients who underwent a colonoscopy in the Kaiser Permanente Hawaii Gastroenterology Department. Cases were patients with histologically confirmed first-time adenoma(s) of the colorectum and were of Japanese, Caucasian or Hawaiian race/ethnicity. Controls were selected among patients with a normal colorectum and were individually matched to the cases on age at exam, sex, race/ethnicity, screening date (+-3 months) and clinic and type of examination (colonoscopy or flexible sigmoidoscopy). We recruited 1016 adenoma cases (67.8% of all eligible) and 1355 controls (69.2% of all eligible); 889 cases and 1169 controls agreed to give a blood and 29 cases and 34 controls, a mouthwash sample. A total of 989 cases and 1185 controls were genotyped for this study. Columbus-area HNPCC Study (HNPCC, OSUMC): Patients with colorectal adenocarcinoma diagnosed at six participating hospitals were eligible for this study, regardless of age at diagnosis or family history of cancer. Patients with a clinical diagnosis of familial adenomatous polyposis were not eligible for this study. These six hospitals perform the vast majority of all operations for CRC in the Columbus metropolitan area (population 1.7 million). The institutional review board at all participating hospitals approved the research protocol and consent form in accordance with assurances filed with and approved by the United States Department of Health and Human Services. Briefly, during the period of January 1999 through August 2004, 1,566 eligible patients with CRC were accrued to the study (Hampel et al., 2008) [PMID 18809606]. A total of 1472 colorectal cancer samples had enough blood DNA remaining to be sent for genotyping. Control samples were provided by the Ohio State University Medical Center%#39;s (OSUMC) Human Genetics Sample Bank. The Columbus Area Controls Sample Bank is a collection of control samples for use in human genetics research that includes both donors' anonymized biological specimens and linked phenotypic data. The data and samples are collected under the protocol "Collection and Storage of Controls for Genetics Research Studies", which is approved by the Biomedical Sciences Institutional Review Board at OSUMC. Recruitment takes place in OSUMC primary care and internal medicine clinics. If individuals agree to participate, they provide written informed consent, complete a questionnaire that includes demographic, medical and family history information, and donate a blood sample. 4-7 ml of blood is drawn into each of 3 ACD Solution A tubes and is used for genomic DNA extraction and the establishment of an EBV-transformed lymphoblastoid cell culture, cell pellet in Trizol, and plasma. Controls were matched to CRC cases as 1:1. Matching was done on age at reference time (age_ref), race, and sex. Age_ref was matched on +-5 years. Sex and race were matched exactly. For the cases without matched controls, matching was done only on sex and race with 1:1 ratio. Since controls are fewer than cases, one control is matched on 2 cases at most. Health Professionals Follow-up Study (HPFS): A parallel prospective study to the NHS (Nurses' Health Study). The HPFS cohort comprised 51,529 men aged 40-75 who, in 1986, responded to a mailed questionnaire (Rimm et al., 1990) [PMID: 2090285]. Participants provided information on health related exposures, including current and past smoking history, age, weight, height, diet, physical activity, aspirin use, and family history of colorectal cancer. Colorectal cancer and other outcomes were reported by participants or next-of-kin and were followed up through review of the medical and pathology record by physicians. Overall, more than 97% of self-reported colorectal cancers were confirmed by medical record review. Information was abstracted on histology and primary location. Incident cases were defined as those occurring after the subject provided the blood sample. Prevalent cases were defined as those occurring after enrollment in the study but before the subject provided the blood sample. Follow-up evaluation has been excellent, with 94% of the men responding to date. Colorectal cancer cases were ascertained through January 1, 2008. In 1993-1995, 18,825 men in the HPFS mailed blood samples by overnight courier, which were aliquoted into buffy coat and stored in liquid nitrogen. In 2001-2004, 13,956 men in the HPFS who had not provided a blood sample previously mailed in a swish-and-spit sample of buccal cells. Incident cases were defined as those occurring after the subject provided a blood or buccal sample. Prevalent cases were defined as those occurring after enrollment in the study in 1986, but before the subject provided either a blood or buccal sample. After excluding participants with histories of cancer (except nonmelanoma skin cancer), ulcerative colitis, or familial polyposis, case-control sets were previously constructed. In addition to colorectal cancer cases and controls, a set of adenoma cases and matched controls with available DNA from buffy coat were selected for genotyping. Over the follow-up period, data were collected on endoscopic screening practices and, if individuals had been diagnosed with a polyp, the polyps were confirmed to be adenomatous by medical record review. Adenoma cases were ascertained through January 1, 2008. A separate case-control set was constructed of participants diagnosed with advanced adenoma matched to control participants who underwent a lower endoscopy in the same time period and did not have an adenoma. Advanced adenoma was defined as an adenoma 1 cm or larger in diameter and/or with tubulovillous, villous, or highgrade dysplasia/carcinoma-in-situ histology. Matching criteria included year of birth (within 1 year) and month/ year of blood sampling (within 6 months), the reason for their lower endoscopy (screening, family history, or symptoms), and the time period of any prior endoscopy (within 2 years). Controls matched to cases with a distal adenoma either had a negative sigmoidoscopy or colonoscopy examination, and controls matched to cases with proximal adenoma all had a negative colonoscopy. In total, 159 advanced adenoma cases and 109 controls were selected for genotyping. Leeds Colorectal Cancer Study (LCCS): Following local ethical approval, colorectal cancer cases were recruited from 1997 until 2012 in Leeds, UK through surgical clinics. Initially, funding was provided by the UK Ministry of Agriculture, Farming and Fisheries (subsequently the Food Standards Agency) and Imperial Cancer Research Fund (subsequently Cancer Research UK). Recruitment also occurred similarly in Dundee, Perth and York between the periods of 1997 and 2001 using the same protocol and the data and samples were combined. Pathologically confirmed cases were consented at outpatient clinics, providing information on known and postulated risk factors for colorectal cancer (diet, lifestyle and family history) as well as providing a blood sample for DNA. Exclusion criteria included pre-existing diverticular disease and an inability to complete the questionnaire. The General Practitioners of cases (all UK residents have a nominated General Practitioner to whom to refer initial medical queries) and these GPs were asked to send letters to other persons on their patient list of the same gender and born within 5 years of the case. Subsequently to enhance the number of controls, we systematically invited patients from selected GP practices. Diet was assessed in cases and controls using an extensive dietary and lifestyle questionnaire modified by that produced by the European Prospective Investigation in Cancer (EPIC). The frequency that each specific food items were eaten was recorded and we also obtained average fruit and vegetable consumption as a cross-check. In total, 1591 cases and 739 controls provided a DNA sample. The North Carolina Colon Cancer Studies (NCCCS I/II): The North Carolina Colon Cancer Studies (NCCCS I- colon and NCCCS II-rectal) were population-based case-control studies conducted in 33 counties of North Carolina. Cases were identified using the rapid case ascertainment system of the North Carolina Central Cancer Registry. Patients with a first diagnosis of histologically confirmed invasive adenocarcinoma of the colon (cecum through sigmoid colon) between October 1996 and September 2000 were classified as potential cases in the NCCCS I. The NCCCS II included patients with a first diagnosis of histologically confirmed invasive adenocarcinoma of the sigmoid colon, rectosigmoid, or rectum (hereafter collectively referred to as rectal cancer) between May 2001 and September 2006. Additional eligibility requirements were: aged 40-80 years, residence in one of the 33 counties, ability to give informed consent and complete an interview, had a driver's license or identification card issued by the North Carolina Department of Motor Vehicles (if under the age of 65), and had no objections from the primary physician in regards to contacting the individual. Controls, identified and sampled during the respective study dates, were selected from two sources. Potential controls under the age of 65 were identified using the North Carolina Department of Motor Vehicles records. For those 65 years and older, records from the Center for Medicare and Medicaid Services were used. Controls were matched to cases using randomized recruitment strategies. Recruitment probabilities were done using strata of 5-year age, sex, and race groups. Dietary information was collected using a modified version of the semiquantitative food frequency questionnaire developed at the National Cancer Institute. In addition, participants were asked about vitamin and mineral supplementation, special diets, restaurant eating, sodium use, and fats used in cooking. In NCCCS I, 515 colorectal cases and 687 matched controls were sent for genotyping. In NCCCS II, 796 colorectal cases and 823 controls were sent from the NCCCS II for genotyping. Controls were matched to CRC cases as 1:1 ratio. Matching was done on age, race, and sex. Age was matched on +-5 years. Race and sex was matched exactly. For the cases without matched controls, matching was done only on sex and race. Nurses Health Study (NHS): The NHS cohort began in 1976 when 121,700 married female registered nurses age 30-55 years returned the initial questionnaire that ascertained a variety of important health-related exposures (Belanger et al., 1978) [PMID: 248266]. Since 1976, follow-up questionnaires have been mailed every 2 years. Colorectal cancer and other outcomes were reported by participants or next-of-kin and followed up through review of the medical and pathology record by physicians. Overall, more than 97% of self-reported colorectal cancers were confirmed by medical-record review. Information was abstracted on histology and primary location. The rate of follow-up evaluation has been high: as a proportion of the total possible follow-up time, follow-up evaluation has been more than 92%. Colorectal cancer cases were ascertained through June 1, 2008. In 1989 -1990, 32,826 women in NHS I mailed blood samples by overnight courier, which were aliquoted into buffy coat and stored in liquid nitrogen. In 2001-2004, 29,684 women in NHS I who did not previously provide a blood sample mailed a swish-and-spit sample of buccal cells. Incident cases were defined as those occurring after the subject provided a blood or buccal sample. Prevalent cases were defined as those occurring after enrollment in the study in 1976 but before the subject provided either a blood or buccal sample. After excluding participants with histories of cancer (except nonmelanoma skin cancer), ulcerative colitis, or familial polyposis, case-control sets were previously constructed from which DNA was isolated from either buffy coat or buccal cells for genotyping. In addition to colorectal cancer cases and controls, a set of advanced adenoma cases and matched controls with available DNA from buffy coat were selected for genotyping. Over the follow-up period, data were collected on endoscopic screening practices and, if individuals had been diagnosed with a polyp, the polyps were confirmed to be adenomatous by medical record review. Adenoma cases were ascertained through June 1, 2011. A separate case-control set was constructed of participants diagnosed with advanced adenoma matched to control participants who underwent a lower endoscopy in the same time period and did not have an adenoma. Advanced adenoma was defined as an adenoma more than 1 cm in diameter and/or with tubulovillous, villous, or high-grade dysplasia/carcinoma-in-situ histology. Matching criteria included year of birth (within 1 year) and month/year of blood sampling (within 6 months), the reason for their lower endoscopy (screening, family history, or symptoms), and the time period of any prior endoscopy (within 2 years). Controls matched to cases with a distal adenoma either had a negative sigmoidoscopy or colonoscopy examination, and controls matched to cases with proximal adenoma all had a negative colonoscopy. A total of 272 cases and 236 matched controls were sent to CIDR for the advanced adenoma case-control set. Northern Swedish Health and Disease Study (NSHDS): Comprises over 110,000 participants, including approximately one third with repeated sampling occasions, from three population-based cohorts (Dahlin et al., 2010; Myte et al., 2016) [PMID: 20197478; PMID: 27367522]. The largest is the ongoing Vasterbotten Intervention Programme, in which all residents of Vasterbotten County are invited to a health examination upon turning 30 (some years), 40, 50 and 60 years of age. Extensive measured and self-reported health and lifestyle data, as well as blood samples for central biobanking in Umea, Sweden, are collected at the health exam. Leucocyte DNA samples for 1:1-matched CRC case-control sets from the NSHDS, of which 878 samples are included in this study, have been selected for genotyping. This is in addition to 354 samples from the NSHDS previously analyzed as part of the multicenter EPIC cohort. Cancer-specific and overall survival data are available for all patients. For at least 425 patients, archival tumor tissue has been analyzed for the BRAF V600E mutation and by sequencing codon 12 and 13 for KRAS mutations, as well as for MSI screening status by immunohistochemistry and for an eight-gene CIMP panel using quantitative real-time PCR (MethyLight). Ohio Colorectal Cancer Prevention Initiative (OCCPI, OSUMC): OCCPI (ClinicalTrials.gov identifier: NCT01850654) is a population-based study of colorectal cancer patients diagnosed in one of 51 hospitals throughout the state of Ohio from January 1, 2013 through December 31, 2016. The OCCPI was created to decrease CRC incidence in Ohio by identifying patients with hereditary predisposition (statewide universal tumor screening for newly diagnosed CRC patients), increase colonoscopy compliance for first-degree relatives of CRC patients, and encourage future research through the creation of a biorepository. The 51 Ohio hospitals participating in the OCCPI were selected to represent a cross-section of clinical centers in the state based on high reported volume of CRC patients, affiliation with a high volume hospital, or interest in participation. Institutional Review Board (IRB) approval was obtained by the individual hospitals, Community Oncology Programs, or by ceding review to the OSU IRB. Written informed consent was obtained. A total of 2139 colorectal cases were genotyped. Patients were considered eligible for this study if they were age 18 or older at the time of enrollment, if they had a surgical resection (or biopsy if unresectable) in the state of Ohio demonstrating an adenocarcinoma of the colorectum from 1/1/13 - 12/31/16. Matched control samples were selected from the Ohio State University Medical Center's (OSUMC) Human Genetics Sample Bank in an identical way to the selection for the Columbus-area HNPCC Study (please refer to the description for the Columbus-area HNPCC Study). Prostate, Lung, Colorectal and Ovarian Cancer Screening Trail (PLCO): PLCO enrolled 154,934 participants (men and women, aged between 55 and 74 years) at ten centers into a large, randomized, two-arm trial to determine the effectiveness of screening to reduce cancer mortality. Sequential blood samples were collected from participants assigned to the screening arm. Participation was 93% at the baseline blood draw. In the observational (control) arm, buccal cells were collected via mail using the "swish-and-spit" protocol and participation rate was 65%. Details of this study have been previously described (Huang et al., 2016) [PMID: 27673363] and are available online (http://dcp.cancer.gov/plco). For this study 1651 advanced adenoma cases and 1392 controls were selected for genotyping. Selenium and Vitamin E Prevention Trial (SELECT): The Selenium and Vitamin E Cancer Prevention Trial (SELECT) was a double-blind, placebo controlled clinical trial which explored using selenium and vitamin E alone and in combination to prevent prostate cancer in healthy men (Lippman et al., 2009) [PMID: 19066370]. Secondary endpoints included the prevention of colorectal and lung cancers. SELECT was conducted at 427 sites and centers in the United States, Canada and Puerto Rico; 35,533 men 55 years and older (50 or older if African American) were randomized beginning August 22, 2001. Supplementation was discontinued on October 23, 2008 due to futility. 308 colorectal cancer cases and 308 matched controls were selected from the SELECT population and sent for genotyping. Screening Markers For Colorectal Disease Study and Colonoscopy and Health Study (SMS-REACH): Details on this study population were previously reported (Burnett-Hartman et al., 2014) [PMID: 24875374]. Participants were enrollees in an integrated health-care delivery system in western Washington State (Group Health Cooperative, Seattle, Washington) aged 24-79 years who underwent an index colonoscopy for any indication between 1998 and 2007 and donated a buccal-cell or blood sample for genotyping analysis. Study recruitment took place in 2 phases, with phase 1 occurring in 1998-2003 and phase 2 occurring in 2004-2007. Persons who had undergone a colonoscopy less than 1 year prior to the index colonoscopy, persons with inadequate bowel preparation for the index colonoscopy, and persons with a prior or new diagnosis of colorectal cancer, a familial colorectal cancer syndrome (such as familial adenomatous polyposis), or another colorectal disease were ineligible. Patients diagnosed with adenomas or serrated polyps and persons who were polyp-free at the index colonoscopy (controls) were systematically recruited during both phases of recruitment. Approximately 75% agreed to participate and provided written informed consent. Based on medical records, persons who agreed to participate and those who refused study participation were similar with respect to age, sex, and colorectal polyp status. Study protocols were approved by the institutional review boards of the Group Health Cooperative and the Fred Hutchinson Cancer Research Center (Seattle, Washington). A total of 575 cases and 508 matched were selected for the study. Controls were matched to CRC cases as 1:1 ratio. Matching was done on age_ref, race, and sex. Age_ref was matched on +-5 years. The Women's Health Initiative (WHI): WHI is a long-term national health study that has focused on strategies for preventing heart disease, breast and colorectal cancer, and osteoporotic fractures in postmenopausal women. The original WHI study included 161,808 postmenopausal women enrolled between 1993 and 1998. The Fred Hutchinson Cancer Research Center in Seattle, WA serves as the WHI Clinical Coordinating Center for data collection, management, and analysis of the WHI. The WHI has two major parts: a partial factorial randomized Clinical Trial (CT) and an Observational Study (OS); both were conducted at 40 Clinical Centers nationwide. The CT enrolled 68,132 postmenopausal women between the ages of 50-79 into trials testing three prevention strategies. If eligible, women could choose to enroll in one, two, or all three of the trial components. The components are: Hormone Therapy Trials (HT): This double-blind component examined the effects of combined hormones or estrogen alone on the prevention of coronary heart disease and osteoporotic fractures, and associated risk for breast cancer. Women participating in this component with an intact uterus were randomized to estrogen plus progestin (conjugated equine estrogens [CEE], 0.625 mg/d plus medroxyprogesterone acetate [MPA] 2.5 mg/d] or a matching placebo. Women with prior hysterectomy were randomized to CEE or placebo. Both trials were stopped early, in July 2002 and March 2004, respectively, based on adverse effects. All HT participants continued to be followed without intervention until close-out. Dietary Modification Trial (DM): The Dietary Modification component evaluated the effect of a low-fat and high fruit, vegetable and grain diet on the prevention of breast and colorectal cancers and coronary heart disease. Study participants were randomized to either their usual eating pattern or a low-fat dietary pattern. Calcium/Vitamin D Trial (CaD): This double-blind component began 1 to 2 years after a woman joined one or both of the other clinical trial components. It evaluated the effect of calcium and vitamin D supplementation on the prevention of osteoporotic fractures and colorectal cancer. Women in this component were randomized to calcium (1000 mg/d) and vitamin D (400 IU/d) supplements or a matching placebo. The Observational Study (OS)examines the relationship between lifestyle, environmental, medical and molecular risk factors and specific measures of health or disease outcomes. This component involves tracking the medical history and health habits of 93,676 women not participating in the CT. Recruitment for the observational study was completed in 1998 and participants were followed annually for 8 to 12 years. All centrally confirmed cases of invasive colorectal cancers, or deaths from colorectal cancer were selected as potential cases from September 30, 2015 database. Controls were participants free of colorectal cancer (invasive or in situ) as of September 30, 2015. Potential cases and controls were excluded if they (1) were non-White; (2) had history of colorectal cancers at baseline; (3) lost to follow-up after enrollment; (4) DbGAP ineligible; (5) had <1.25ug of DNA; (6) selected for WHI study M26 Phase I or II; (7) selected for WHI study AS224 and also included in the imputation project. A total of 578 cases and 104,429 controls met the eligibility criteria. Each case was matched with 1 control (1:1) that exactly met the following matching criteria: age (+-5 years), 40 randomization centers (exact), WHI date (+-3 years), CaD date (+-3 years), OS flag (exact), HRT assignments (exact), DM assignments (exact), and CaD assignments (exact). Control selection was done in a time-forward manner, selecting one control for each case from the risk set at the time of the case's event. The matching algorithm was allowed to select the closest match based on a criteria to minimize an overall distance measure (Bergstralh EJ, Kosanke JL. Computerized matching of cases to controls. Technical Report #56, Department of Health Sciences Research, Mayo Clinic, Rochester MN. April 1995). Each matching factor was given the same weight. When exact matches could not be found, the matching criteria were gradually relaxed among unmatched cases and controls until all cases had found matched controls. Using the matching criteria specified above, 559 of the 578 eligible cases found exact matches. The matching criteria was then relaxed to : Age+-5, randomization centers, WHI date +- 3 years, CaD date +- 3 years, OS flag, HRT flag, DM flag, CaD flag. 17 of the remaining 19 unmatched cases found matched controls. By matching on Age+-5, randomization centers, WHI date +- 3 years, CaD date +- 3 years, OS flag, HRT flag, the remaining 2 unmatched cases found their matches.
T-cell acute lymphoblastic leukemia (T-ALL) is an aggressive hematological malignancy, and novel therapeutics are much needed. Profiling patient leukemia’ drug sensitivities ex vivo, we discovered striking inter-patient variability in T-ALL response to a panel of targeted agents. Applying network-based systems pharmacology analyses, by integrating ex vivo pharmacotyping, bulk and single-cell RNA-seq, genomics, phospho-proteomics, and genome-wide CRISPR screening data, to examine signal circuitry, we identified the pharmacogenomic basis of T-ALL drug response and revealed a therapeutic vulnerability that is directly related to the developmental stage of leukemia cells. In conclusion, our results indicate that heterogeneity in the differentiation of T-ALL drives activation of key signaling pathways in this leukemia, providing unique opportunities for targeted therapy.
The study aims to characterize the gut metagenome of Norwegian screening participants and assess the ability of the gut metagenome to identify those at need for colonoscopy. Background: Colorectal cancer (CRC) screening reduces CRC incidence and mortality. However, current screening methods are either hampered by invasiveness or suboptimal performance, limiting their effectiveness as primary screening methods. To aid in the development of a non-invasive screening test with improved sensitivity and specificity, we have initiated a prospective biomarker study (CRCbiome), nested within a large randomized CRC screening trial in Norway. We aim to develop a microbiome-based classification algorithm to identify advanced colorectal lesions in screening participants testing positive for an immunochemical fecal occult blood test (FIT). We will also examine interactions with host factors, diet, lifestyle and prescription drugs. The prospective nature of the study also enables the analysis of changes in the gut microbiome following the removal of precancerous lesions. Methods: The CRCbiome study recruits participants enrolled in the Bowel Cancer Screening in Norway (BCSN) study, a randomized trial initiated in 2012 comparing once-only sigmoidoscopy to repeated biennial FIT, where women and men aged 50–74 years at study entry are invited to participate. Since 2017, participants randomized to FIT screening with a positive test result have been invited to join the CRCbiome study. Self-reported diet, lifestyle and demographic data are collected prior to colonoscopy after the positive FIT-test (baseline). Screening data, including colonoscopy findings are obtained from the BCSN database. Fecal samples for gut microbiome analyses are collected both before and 2 and 12 months after colonoscopy. Samples are analyzed using metagenome sequencing, with taxonomy profiles, and gene and pathway content as primary measures. CRCbiome data will also be linked to national registries to obtain information on prescription histories and cancer relevant outcomes occurring during the 10 year follow-up period. Discussion: The CRCbiome study will increase our understanding of how the gut microbiome, in combination with lifestyle and environmental factors, influences the early stages of colorectal carcinogenesis. This knowledge will be crucial to develop microbiome-based screening tools for CRC. By evaluating biomarker performance in a screening setting, using samples from the target population, the generalizability of the findings to future screening cohorts is likely to be high.
Recessive forms for many structural brain disorders (SBDs), including cerebellar hypoplasia (CBH), corpus callosum hypoplasia (CCH), cobblestone lissencephaly (COB), classical lissencephaly (LIS), microcephaly (MIC), and pontocerebellar hypoplasia (PCH), have been described. Patients usually present to the clinic with neurodevelopmental disorders (NDDs), including autism, epilepsy, intellectual disability, and cerebral palsy. Subsequent brain MRI usually leads to the diagnosis of an SBD. While a handful of genes that cause these diseases have been identified, most patients do not have mutations in these genes, suggesting that there is still much work to be done in identifying the genetic causes of these disorders. We have recruited a cohort of families with inherited SBDs, most from families with documented parental consanguinity with more than one affected member. Current approaches utilizing exome sequencing of SBD affected members still leave most cases unsolved, in part because there are too many possible variants to consider. Linkage analysis will allow us to extract all the useful genetic information from the family, taking advantage of each genetically informative meiosis in each family. We propose to analyze up to 100 SBD families for 500 samples per year for 4 years, through the CIDR SNP Genotyping Service, in parallel with already-funded exome sequencing of affects, leveraging off an already funded Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) effort to identify new causes of SBDs. Understanding the pathogenetic mechanisms underlying recessive pediatric SBDs will lead to better diagnostic and therapeutic opportunities for patients, and may shed light on complex NDDs. Ultimately, this work will identify new genetic disease loci to provide insight into pediatric SBD, with the hope to mitigate the impact of SBD worldwide.Public Health Significance: NDDs affect approximately 4-­6% of the general population, most notably children, and are evident in disorders, such as intellectual disability, epilepsy, and autism, with estimated yearly costs of $51.2B for intellectual disability, $11.5B for cerebral palsy, and $2.5B for visual impairment, making up >5% of total health care costs. Recessive NDDs impose enormous personal, social, and economic costs because of the early onset and the lifetime of medical dependence that often ensues. Genetic testing is critical to the diagnosis and treatment of these diseases. Many of the diseases under study have no treatment or cure. Structural brain diseases (SBDs) underlie a large percentage of these diseases, and are defined loosely as any condition in which defects in the structure or volume of a key brain component. SBDs are divided into major categories based upon the anatomical brain location. The major types involve the anatomical structures of the developing neural tube, forebrain, hindbrain, cerebral cortex, and those affecting overall brain size. For the purposes of this application, we have excluded SBDs resulting from environmental causes (i.e. prematurity-­induced), those below the brainstem, or those unlikely to be due to a germline or de novo mutation (i.e. focal SBDs or simplex disease).
We are aiming to investigate repair of a double strand break (DSB) within the genome in the presence and absence of the BLOOM protein. Zinc Finger Nucleases introduce DSBs at specified loci within the genome. Using sequencing we will assess the size of the deletion following repair. Protocol1. Transfect normal and BLOOM deficient human iPS cells with ZFNs, using AMXA2. Harvest cells after 5 days3. Perform column extraction of DNA4. PCR-amplify the ZFN region 5. Sequence and analyse repair of the DSBThis data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
We sought to identify MS susceptibility variants specific to the Major Histocompatibility Complex and to assess the effect of ancestral risk modification within 2652 Latinx and Hispanic individuals, as well as 2435 Black and African American individuals ascertained in collaboration with the Alliance for Research in Hispanic MS (ARHMS). All DNA samples were obtained through whole blood extraction and were genotyped on the MS Chip, an Illumina Infinium custom genotyping array that contains targeted and dense coverage of the extended MHC, specifically designed for imputation. Classical HLA alleles, SNPs, and amino acid residues were imputed simultaneously from genotyped SNPs using HLA-TAPAS and a multi-ethnic reference panel of 2504 samples from the 1000 Genomes Project. We have identified several novel susceptibility alleles which are rare in European populations including HLA-B*53:01, and we have identified an independent role for HLA-DRB1*15:01 and HLA-DQB1*06:02 on MS risk. We found a decrease in Native American ancestry in MS cases vs controls across the MHC, peaking near the previously identified MICB locus; and we have identified several susceptibility variants for which MS risk is modified by global or local ancestry. We are making data on all genotyped and imputed SNPs, classical HLA alleles, and amino acid residues available through dbGaP for the Hispanic and African American samples utilized in these analyses.
The aim of this project is to investiagte somatic mutations, including rates of mutations and signatures, and clonal dynamics in normal (non-cancerous) human tissues from the gastrointestinal tract. The project will involve the studyof normal tissues from healthy individuals and those with specific candidate diseases. We are utilising laser capture microscopy to idnetifiy and isolate specific cell populations which are then subjected to whole genome sequencing to estimate mutational burdens and begin to understand clonal structures across the normal tissues.
The aim of this project is to investiagte somatic mutations, including rates of mutations and signatures, and clonal dynamics in normal (non-cancerous) human tissues from the biliary tree. The project will involve the studyof normal tissues from healthy individuals and those with specific candidate diseases. We are utilising laser capture microscopy to idnetifiy and isolate specific cell populations which are then subjected to whole genome sequencing to estimate mutational burdens and begin to understand clonal structures across the normal tissues.
The aim of this project is to investigate somatic mutations, including rates of mutations and signatures, and clonal dynamics in normal (non-cancerous) human tissues from the gastrointestinal tract. The project will involve the study of normal tissues from healthy individuals and those with specific candidate diseases. We are utilising laser capture microscopy to identify and isolate specific cell populations which are then subjected to whole genome sequencing to estimate mutational burdens and begin to understand clonal structures across the normal tissues.
Liquid biopsy analysis refers to methods designed to detect tumour-specific material (e.g., circulating tumour cells or tumour DNA) in body fluids, such as urine or blood samples. A widely-used liquid biopsy approach consists of genotyping the circulating tumour DNA (ctDNA) through sequencing of plasma/serum DNA. Although useful in the context of metastatic disease (where the concentration of ctDNA is high), current liquid biopsy technologies show limited sensitivity of detection for the early detection of cancer, and low specificity, as cancer-related mutations accumulate in healthy tissues as part of the ageing process, thus making it difficult to distinguish these from tumour mutations, and because sequencing errors and true mutations show overlapping profiles. Therefore, developing liquid biopsy protocols with increased sensitivity and specificity represents an urgent clinical need. Here we harness extrachromosomal circular DNA (eccDNA) elements, which are circular DNA structures physically separated from the chromosomes of up to several Mbp long pervasive in human cancers, for liquid biopsy analysis. In this pilot study we will focus on the analysis of glioblastomas, because there is strong evidence for the presence of eccDNA in these tumour types, and because developing liquid biopsy approaches for brain tumours to reduce the invasiveness of brain tumour biopsies remains an unmet clinical need.
When comparing the differentiation capacities of pluripotent stem cell lines that have different genetic backgrounds, batch to batch experimental variablility poses a significant challenge, especially when trying to identify smaller effects. One way to address this issue is to differentiate several different lines in the same culture dish, thereby elimating experimental variation. In addition, it allows researchers to analyze many more lines with less experiments. Parallel single cell RNA-Seq exploits that individual cells are tagged and hence each cell can be reliably assigned to the donor of origin based on the genetic variants it contains. In addition, analyzing the genetic signature of single cells within a differentiating population can reveal differentation stages that are not easily detected in bulk RNAseq data. This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
Intestinal T-cell lymphomas (ITCL) are rare, aggressive T-cell lymphomas that are composed primarily of enteropathy-associated lymphomas Type 1 and type 2, and a small fraction of cases not classifiable as EATL that are classified as peripheral T-cell lymphoma, not otherwise classifiable. The molecular pathogenesis of these rare lymphomas is largely unknown. We studied a series of 34 ITCL cases using a combination of amplicon based targeted next generation sequencing and pyrosequencing for mutations in 34 genes previously reported to be mutated in other types of T-cell lymphomas and other genes known to have significant roles in normal T-cell proliferation. We identified a high percentage of cases with mutations in JAK/STAT signaling pathway members, and also in RAS family members. Notably, we did not detect GNAI-2 mutations in our North American cases of EATL type 2 by targeted pyrosequencing, which had been reported previously in Asian cases.
The NHLBI "Grand Opportunity" Exome Sequencing Project (GO-ESP), a signature project of the NHLBI Recovery Act investment, was designed to identify genetic variants in coding regions (exons) of the human genome (the "exome") that are associated with heart, lung and blood diseases. These and related diseases that are of high impact to public health and individuals from diverse racial and ethnic groups will be studied. These data may help researchers understand the causes of disease, contributing to better ways to prevent, diagnose, and treat diseases, as well as determine whether to tailor prevention and treatments to specific populations. This could lead to more effective treatments and reduce the likelihood of side effects. GO-ESP is comprised of five collaborative components: 3 cohort consortia - HeartGO, LungGO, and WHISP - and 2 sequencing centers - BroadGO and SeattleGO. Two major goals in the study of human malignancies are (i) to understand the genetic basis that underlies the predisposition of certain individuals to develop cancer and (ii) to identify the full spectrum of acquired mutations that occur in human cancers. There has been some progress in both of these areas with respect to hematologic malignancies but much remains to be discovered. As a result of recent advantages in sequencing technologies it is now feasible to sequence multiple individual whole exomes at reasonable cost. Such technologies provide a unique opportunity to transform our understanding of the biological basis of human cancer and to identify inherited genetic variants that predispose individuals to the development of hematologic malignancies. Myeloproliferative neoplasms (MPN), myelodysplastic syndrome (MDS) and acute leukemia are clonal disorders of hematopoieis. Our study involves performing whole exome sequencing of individuals with these disorders for the purpose of identifying (i) novel inherited genetic variants associated with the risk of developing of MPN/MDS/acute leukemia and (ii) novel acquired mutations associated with the development of MPN/MDS/acute leukemia.
ost adults with intellectual disabilities (ID) do not undergo genetic diagnostic investigation as part of their clinical care and have 'missed the boat' with regard to the WES and WGS genetic testing that is now being provided for children with ID. There is a dramatically increased risk of psychistric disorders in adults with ID, e.g. the risk of psychoses is 10X higher than in the general population. It remains an open question as to how much of adult ID is genetic in origin and how similar the genetic forms of adult ID are to those being diagnosed in children, in part due to survivor bias. There is also the opportunity to identify adults with treatable forms of ID, of which over 80 have been described, thus improving their clinical management. Furthermore, analysis of medical records of adults with genetic forms of ID can help to characterise the 'natural history' of individual disorders, resulting in more accurate prognoses for diagnosed children and identifying opportunities for improved management and possibly therapeutic intervention (e.g. optimal anti-epileptic therapy). Here we propose to exome sequence (to ~50X coverage) 200 adults with ID and co-morbid psychiatric disorders. This cohort has previously been assayed with chromosomal microarrays (Wolfe et al 2017 EJHG, 25, 66-72) identifying a diagnostic yield of ~11% which is comparable to the CNV diagnostic yield in various child ID cohorts (10-15%). The authors observed no substantive biases in diagnostic yield between different psychiatric diagnostic classes. The WES data will be analysed using the diagnostic workflows developed in the DDD study to ensure comparability between child and adult ID datasets. This study is intended as a pilot study to demonstrate the value of WES in adults with ID. This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/ . This dataset contains all the data available for this study on 2019-08-07.
Germline blood DNA sequencing data generated in routine diagnostics of hereditary cancer using the I2HCP gene panel (~135 genes). There are 130 samples sequenced in a MiSeq machine and 108 sequenced in a HiSeq machine. There is a partial overlap between those two sets, meaning that some samples were sequenced in both machines. There is a strong enrichment in samples with copy-number variants (CNV), both single- and multi-exon, since this dataset was compiled for a benchmarking effort of CNV calling tools for genetic diagnostics.
Knowledge of the interrelationships between what we eat and the configurations of our gut microbial communities is providing important insights into how food components that are not directly metabolized by human enzymes are linked to our physiology and health status. Changing food preferences brought about by Westernization that have deleterious health effects1,2, plus rapid population expansion, ongoing challenges to sustainable agriculture, and other forces contributing to increased food insecurity, are catalyzing efforts to identify more nutritious and affordable foods3. The gut microbial community is complex, dynamic, and exhibits considerable intra- and interpersonal variation in its composition and functions. The massive number of potential interactions between its components makes it challenging to define the mechanisms by which food ingredients affect community properties. There is also a paucity of information about the ‘bioactive’ ingredients of foods that influence the fitness and expressed functions of community members. Here, plant fibres, from different sustainable sources and targeting distinct features of obese human gut microbiomes in gnotobiotic mice, were formulated into snack prototypes and used to supplement controlled diets consumed by overweight and obese adults; the results revealed fibre-specific changes in their microbiomes that were linked to changes in their plasma proteomes indicative of altered physiologic state.
Actinic keratoses (AK) are lesions of epidermal keratinocyte dysplasia and are precursors for invasive cutaneous squamous cell carcinoma (CSCC). Identifying the specific genomic alterations driving progression from normal skin-AK-invasive CSCC is challenging due to the massive ultraviolet radiation-induced mutational burden characteristic at all stages of this progression. Here, we report the largest AK whole exome sequencing study to date and perform mutational signature and candidate driver gene analysis on these lesions. We demonstrate in 37 AK, from both immunosuppressed and immunocompetent patients, that there are significant similarities to CSCC in terms of mutational burden, copy number alterations, mutational signatures and patterns of driver gene mutations. We identify 44 significantly mutated AK driver genes and confirm that these genes are similarly altered in CSCC. We identify the azathioprine mutational signature in all AK from patients exposed to the drug, providing further evidence for its role in keratinocyte carcinogenesis. CSCC differ from AK in having higher levels of intra-sample heterogeneity. Alterations in signaling pathways also differ, with immune-related signaling and TGF-β signaling significantly more mutated in CSCC. Integrating our findings with independent gene expression datasets confirms that dysregulated TGF-β signaling may represent an important event in AK-CSCC progression.