A key characteristic of osteosarcoma is extensive and complex genomic rearrangements. However, published models explaining how these genomic rearrangements occur disagree about if these rearrangements occur early in tumor progression and then are stable afterwards, or if there is ongoing genomic instability. Previous studies have employed bulk sequencing technologies to characterize genomic alterations. A limitation of this approach is that it averages all changes within the bulk sample, essentially masking intra-sample heterogeneity and making evaluation of ongoing genomic instability difficult. To overcome this limitation and compliment previous work, we utilized single-cell whole genome sequencing to quantify somatic copy number alterations (SCNA). By interrogating individual cells within a sample, we were able to examine intra-sample heterogeneity. We found that the SCNA patterns between cells within a single sample were remarkably consistent, which supports the concept that the genomes of osteosarcoma tumors are relatively stable after early genome fragmentation events.
HPV16 is the most oncogenic type of human papillomaviruses (HPV). Integration of HPV into the human genome is an important mechanism of carcinogenesis but is absent in at least 30% of HPV16+ tumors. We applied long-read whole-genome sequencing (WGS) to cervical cancer cell lines and tumors to characterize HPV16 carcinogenesis in the absence of integration. WGS of HPV16+ cervical tumor samples from Latin America revealed that 11/20 tumors with only episomal HPV (EP) had intact monomer episomes. The remaining 9 EP tumors had multimer and rearranged HPV genomes. The majority (80%) of the HPV rearrangements and deletions disrupted the E1 and E2 genes, and EP tumors overexpressed the E6 and E7 viral oncogenes, a similar profile to tumors with HPV integration. Tumors with putative multimer HPV integrations display HPV multimers and concatemers of human and viral sequences. Our data uncovered a novel mechanism for HPV16 to cause cancer without integration through aberrant episomal replication, and forming rearranged, mutated, and multimer episomes.
Tissue adaptation is required for regulatory T (Treg) cells to function within organs. Whether this program shares similarities between different tissue-localized immune populations is poorly understood. Here, we addressed this by analyzing single-cell chromatin accessibility data, including the transposable element (TE) landscape, of CD45+ immune cells from different tissues. We identified features of organ-specific tissue adaptation across different immune cells. Focusing on tissue-Treg cells, we found that the Treg tissue adaptation program was conserved in other tissue-localized immune cells, such as amphiregulin-producing Th17 cells. Accessible TEs can act as regulatory elements but their contribution to tissue adaptation is unclear. TE landscape analysis revealed an enrichment of specific transcription factor binding motifs in TE regions within accessible chromatin peaks. TEs, specifically from the LTR family, were located in enhancer regions and associated with tissue adaptation. These findings broaden our understanding of immune tissue-residency, an important step towards organ-specific immune interventions.
Transition of cytosine to thymine in CpG dinucleotides is the most frequent type of mutation in cancer. This increased mutability is commonly attributed to spontaneous deamination of 5-methylcytosine (5mC), which is normally repaired by the base-excision repair (BER) pathway. However, the contribution of 5mC deamination in the increasing diversity of cancer mutational signatures remains poorly explored. Here, we integrate mutational signatures analysis in a large series of tumor whole genomes with lineage-specific epigenomic data to draw a detailed view on 5mC deamination in cancer. We uncover tumor type-specific patterns of 5mC deamination signatures in CpG and non-CpG contexts. We demonstrate that the BER glycosylase MBD4 preferentially binds to open chromatin domains, which correlates with lower mutational burden in these domains. We validate our findings by modelling BER deficiencies in isogenic cell models. Overall, we establish MBD4 as the main actor responsible for 5mC deamination repair in humans.
IL-17 family cytokines are critical to host defense responses at cutaneous and mucosal surfaces. Whereas IL-17A, IL-17F, and IL-17C induce overlapping inflammatory cascades to promote neutrophil-mediated immunity, IL-17E/IL-25 drives type 2 immune pathways and eosinophil activity. Genetic and pharmacological studies reveal the significant contribution these cytokines play in antimicrobial and autoimmune mechanisms. However, little is known about the related family member, IL-17B, with contrasting reports of both pro- and anti-inflammatory function in rodents. We demonstrate that in the human immune system, IL-17B is functionally similar to IL-25 and elicits type 2 cytokine secretion from innate type 2 lymphocytes, NKT, and CD4+ CRTH2+ Th2 cells. Like IL-25, this activity is dependent on the IL-17RA and IL-17RB receptor subunits. Furthermore, IL-17B can augment IL-33–driven type 2 responses. These data position IL-17B as a novel component in the regulation of human type 2 immunity.
Acute myeloid leukemia with complex karyotype (ckAML) is characterized by high genomic complexity, including frequent TP53 mutations and chromothripsis. We hypothesized that the numerous genomic rearrangements could reposition active enhancers near proto-oncogenes, leading to their aberrant expression. We developed pyjacker, a computational tool for the detection of enhancer hijacking events, and applied it to a cohort of 39 ckAML samples. Pyjacker identified motor neuron and pancreas homeobox 1 (MNX1), a gene aberrantly expressed in 1.4% of AML patients, often as a result of del(7)(q22q36) associated with hijacking of a CDK6 enhancer. MNX1-activated cases show significant co-occurrence with BCOR mutations and a gene signature shared with t(7;12)(q36;p13) pediatric AML. We demonstrated that MNX1 is a dependency gene, as its knockdown in a xenograft model reduces leukemia cell fitness. In conclusion, enhancer hijacking is a frequent mechanism for oncogene activation in AML.
DNA methylation is globally reprogrammed after fertilization and the parental genomes have similar DNA methylation profiles after implantation except at the germline differentially methylated regions (gDMRs). We, and others, have previously shown that there are a large number of transient gDMRs in human blastocysts, whose differential methylation is lost in embryonic tissues after implantation. In this study we employed genome-wide allelic DNA methylation analyses of highly purified trophoblast cells from human placentas. ~40% of the transient-in-embryo gDMRs maintained their parent-of-origin-dependent biased allelic methylation. RNA sequencing-based allelic expression analyses revealed that some of the placental-specific gDMRs were associated with novel imprinted genes. This approach identified the first examples of X-linked gDMRs specific to the placenta. These findings highlight the unique regulation of allelic DNA methylation in the human placenta, which is important for understanding normal placental development and the pathogenesis of pregnancy complications.
Interstitial deletion of the long arm of chromosome 5 (del(5q)) is the commonest structural genomic variant in myelodysplastic syndromes (MDS). Lenalidomide (LEN) is the treatment of choice for patients with del(5q) MDS, but half of the responding patients become resistant within two years. TP53 mutations are detected in ~20% of patients who become resistant to LEN. Our data show that patients who become resistant to LEN harbor either TP53 or RUNX1 mutations or loss of RUNX1 expression. Here we show that LEN-induced degradation of IKZF1 permits a RUNX1/GATA2 complex to drive megakaryocytic differentiation and consequent del(5q) MDS progenitor cell death via CRBN-mediated CSNK1A1 degradation. Overexpression of GATA2 is able to restore LEN sensitivity in the context of RUNX1 or TP53 mutations by enhancing LEN-induced megakaryocytic differentiation. Screening for TP53 and RUNX1 mutations or downregulation should identify patients resistant to LEN, and strategies to activate GATA2 may resensitize del(5q) MDS cells to LEN
Diffuse intrinsic pontine glioma (DIPG) is an incurable malignant childhood brain tumour, with no active systemic therapies and a 5-year survival of less than 1%. Polyamines are small organic polycations that are essential for DNA replication, translation and cell proliferation. Ornithine decarboxylase 1 (ODC1), the rate limiting enzyme in polyamine synthesis, is irreversibly inhibited by difluoromethylornithine (DFMO). Herein we show that polyamine synthesis is upregulated in DIPG, leading to sensitivity to DFMO. DIPG cells compensate for ODC1 inhibition by upregulation of the polyamine transporter SLC3A2. Treatment with the polyamine transporter inhibitor AMXT 1501 reduced uptake of polyamines in DIPG cells, and co-administration of AMXT 1501 and DFMO led to potent in vitro activity, and significant extension of survival in three aggressive DIPG orthotopic animal models. Collectively, these results demonstrate the potential of dual targeting of polyamine synthesis and uptake as a therapeutic strategy for incurable DIPG.
We studied whether Vγ9Vδ2-T cells can exploited as autologous effector lymphocytes in chronic lymphocytic leukemia (CLL). Healthy controls (HC) Vγ9Vδ2-T cells were activated by and had potent cytolytic activity against CLL cells, however CLL-derived Vγ9Vδ2-T cells proved dysfunctional with respect to effector cytokine production and cytotxicity. Dysfunction of Vγ9Vδ2-T cells was reversible upon activation with autologous monocyte-derived dendritic cells (mODCs). RNA sequencing confirmed that Vγ9Vδ2-T cells from CLL patients have a transcriptional profile that is distinct from HC Vγ9Vδ2-T cells prior to ex vivo expansion, although the profile after ex vivo expansion is similar between CLL and HC Vγ9Vδ2-T cells. Gene expression profiling implicated alterations in both synapse formation and exhaustion as conceivable contributors to compromised Vγ9Vδ2-T cell function in CLL patients. Taken together, CLL-mediated dysfunction of autologous Vγ9Vδ2-T cells is fully reversible, resulting in potent cytotoxicity towards CLL cells.
Retinoblastoma (RB), the commonest eye cancer in children was the first cancer for which a genetic cause was identified: the Rb1 gene is a tumour suppressor gene that is mutated in RB. The Rb1 gene defect alone does not predict the clinical outcome. We propose to study other possible mechanisms: 1. Stepwise further mutations occur in RB, increasing its carcinogenesis. We will sequence the whole genome in RB tissue, and relate the different genes expressed to the treatments used. 2. Extracellular matric proteins contribute to a tumour permissive environment for RB to continue to grow. This includes Samll Leucine Rich Proteoglycans (SLRP), a family of 15 secreted extracellular matrix proteins involved in eye development. 3. Cancer stem cells (CSC), a subpopulation of treatment resistant cells, drive RB tumours, and whether these stem cells can be manipulated for new therapies. The aim of this study is to assist finding targeted diagnostic techniques and treatments for RB.
Whole Exome Sequencing of Gingivo-buccal Cancer: ICGC-India ProjectAs a part of the ICGC, India has undertaken genomic studies on gingivobuccal cancer of the oral cavity, which is the most prevalent form of cancer among men in India. There are various known environmental (life-style) correlates of this cancer, the most important of which are tobacco chewing and HPV infection. Exome of paired DNA samples – isolated from the tumor tissue and from the blood of 47 patients – have been performed to catalog germline and somatic mutations. Association between each observed genomic alteration and exposure to environmental risk factors is being explored. Detailed clinical characterization of the patients, collection of data on demographic and environmental exposures, and isolation of DNA samples from blood and tumor tissues collected from each patient are being done at the Advanced Centre for Research, Treatment and Education on Cancer, Mumbai. Whole exome sequencing is being performed at the National Institute of Biomedical Genomics, Kalyani.
Data from a study of a large Iberian family (n=65 subjects, 5 generations) affected by pulmonary arterial hypertension (PAH) and segregating with the BMPR2 missense mutation p.Arg491Gln (rs137852749, c.1472G>A). PAH is a rare disease characterized by an abnormal rise in mean pulmonary arterial pressure (> or equal to 25 mmHg at rest), which, in turn, leads to a progressive increase in pulmonary vascular resistance and ultimately to death, due to right ventricular failure. Heritable PAH has an overall prevalence below 1 case per million adults and is defined by either the presence of a known genetic defect linked to the disease or a positive family history. Heritable PAH is inherited as an autosomal dominant disease. However, not all BMPR2 mutation carriers develop the disease, highlighting the presence of reduced penetrance. In this family, there are 22 mutation carriers from which 8 were diagnosed with heritable PAH and the other 14 were healthy at the time of examination.
Prostate cancer is a highly heterogeneous disease that is thought to develop over many years. Identifying the earliest somatic changes can give important insights into the tumour evolution and aid in stratifying high- from low-risk diseases. Here we pursued integrative whole-genome, transcriptome and methylome-based analysis of early-onset prostate cancer patients. Characterisation of genomic alterations across 270 PCa tumours revealed age-related genomic alterations and mutation signatures including BRCAness and APOBEC. We used methylation- and expression-data to identify four molecular subgroups, which included a highly aggressive tumour subgroup that frequently involved recurrent duplications and increased expression of ESRP1. Analysis of 12,000 tissue-microarray tumour samples demonstrated that ESRP1 is a candidate biomarker associated with faster proliferation rate and shorter time to relapse. We combine the patterns of molecular co-occurrence, risk-stratification and subgroup information to deconvolute tumor heterogeneity, which reveal complex but recurrent clinical trajectories of prostate cancer.
The RAS/MAPK pathway is an emerging targeted pathway across a spectrum of both adult and pediatric cancers. Typically, this is associated with a single, well-characterized point mutation in an oncogene. Hypermutant tumors which harbor many somatic mutations may obscure the interpretation of such targetable genomic events. We find that replication repair deficient (RRD) cancers which are universally hypermutant and affect children born with RRD cancer predisposition, are enriched for RAS/MAPK mutations (p=10-8). These mutations are not random, exist in subclones, and increase in allelic frequency over time. The RAS/MAPK pathway is activated both transcriptionally and at the protein level in patient derived RRD tumors and these tumors responded to MEK inhibition in vitro and in vivo. Treatment of patients with RAS/MAPK hypermutant gliomas reveal durable responses to MEK inhibition. Our observations suggest that hypermutant tumors may be addicted to oncogenic pathways resulting in favorable response to targeted therapies.
Systematic interrogation of single nucleotide variations (SNVs) is one of the most promising approaches to delineate the cellular heterogeneity and phylogenetic relationships at the single cell level. While SNV detection from abundant single cell RNA sequencing (scRNA-seq) data is applicable and cost-effective in identifying expressed variants, inferring sub-clones, and deciphering genotype-phenotype linkages, there is a lack of computational methods specifically developed for SNV calling in scRNA-seq. Although variant callers for bulk RNA-seq have been sporadically used in scRNA-seq, the performances of different tools have not been assessed. Here, we performed a systematic comparison of seven tools including SAMtools, the GATK pipeline, CTAT, FreeBayes, MuTect2, Strelka2 and VarScan2, using both simulation and scRNA-seq datasets, and identified multiple elements influencing their performance. Our study provided the first benchmarking to evaluate the performances of different SNV detection tools for scRNA-seq data.
Aristolochic Acids (AAs) are a family of carcinogenic phytochemical compounds commonly found in plants of Aristolochia and Asarum genus. Comprehensive genomic profiling of genitourinary and hepatobiliary cancers has highlighted the widespread prevalence of Aristolochic Acid (AA) signatures in cancer patients across parts of Asia, particularly in Taiwan. The aim of our study was to determine in Oro-Gastrointestinal Tract (OGITC) cancers, the prevalence, role and significance that AA plays as a driver of tumorigenesis as AA containing products are commonly administered orally. This would suggest a possible etiological relationship between cancers of OGITC. However, in this study the rarity of AA mutational signatures in OGITC suggests that AA is unlikely to drive carcinogenesis in OGITC through direct exposure. Our study is valuable to show that AA exposure is not an equal driver of tumorigenesis in different organs and represents an important piece of information in the field.
Approximately 4% of colorectal cancer (CRC) patients have at least two simultaneous cancers in the colon. Due to the shared environment, these synchronous CRCs (SCRCs) provide a unique setting to study colorectal carcinogenesis. Understanding whether these tumors are genetically similar or distinct is essential when designing therapeutic approaches. We performed exome sequencing of 47 primary cancers and corresponding normal samples from 23 patients. Additionally, we carried out a comprehensive mutational signature analysis to assess whether tumors had undergone similar mutational processes and the first immune cell score analysis (IS) of SCRC to analyze the interplay between immune cell invasion and mutation profile in both lesions of an individual. The tumor pairs shared only few mutations, favoring different mutations in known CRC genes and signaling pathways, and displayed variation in their signature content. Two tumor pairs had discordant mismatch repair statuses. In majority of the pairs, IS varied between primaries. Differences were not explained by any clinicopathological variable or mutation burden.
Whether neurodevelopmental defects underlie the selective neuronal death that characterizes neurodegenerative diseases is becoming an intriguing question. To address it, we focused on the motor neuron (MN) disease Spinal Muscular Atrophy (SMA), caused by reduced levels of the ubiquitous protein SMN. Taking advantage of the first isogenic human induced pluripotent stem cell-derived SMA model that we have generated and a spinal cord organoid system, here we report that the relative and temporal expression of early neural progenitor and MN markers is altered in SMA. Furthermore, the corrected isogenic controls only partially reverse these abnormalities. These findings raise the relevant clinical implication that SMN-increasing treatments might not fully amend SMA pathological phenotypes. The approach we have taken demonstrates that the discovery of new disease mechanisms is greatly improved by using human isogenic models. Moreover, our study implies that SMA has a developmental component that might trigger the MN degeneration.
This study is a part of NHGRI's Centers for Common Disease Genomics (CCDG), which is a collaborative large-scale genome sequencing effort to comprehensively identify rare risk and protective variants contributing to multiple common disease phenotypes. Current estimates anticipate that the CCDG program will sequence approximately 140K whole genomes and 225K whole exomes during the life of the project. The Cardiovascular Disease working group of the CCDG considered five diseases: early-onset coronary artery disease (EOCAD), stroke, atrial fibrillation (AF), congestive heart failure and type 2 diabetes. AF will affect between 6-12 million individuals in the US by 2050. AF is also associated with increased risks of stroke, dementia, heart failure, death, and high health care costs. Many risk factors for AF have been identified, including advancing age, cardiovascular disease (CVD), and CVD risk factors. However, there is little knowledge how to prevent AF. Furthermore, therapies for AF are only partially effective, and are themselves associated with substantial morbidity. Previously, heritable forms of AF have been considered rare; yet in the last decade, it has been established that AF, and in particular early-onset forms of AF, are heritable. Genome-wide association studies (GWAS) provide a powerful tool to identify common variants underlying disease risk. The AFGen Consortium currently consists of investigators from more than 25 studies with >20,000 individuals with AF and >100,000 without AF. In the latest analyses, 14 loci have been identified for AF1. Broadly, the loci implicate genes related to cardiopulmonary development, cardiac-expressed ion channels, and cell signaling molecules. Source: https://ccdg.rutgers.edu/sites/default/files/CCDG_CVD_EOAF_FINAL_w_link.pdf Three collections were brought together to form Swiss atrial fibrillation cases: SWISS-AF, SWISS-AF-PVI, and BEAT-AF. SWISS-AF is a prospective observational, multicentric cohort study in Switzerland to increase our knowledge on structural brain damage and its changes over time in patients with atrial fibrillation. Overall, 2400 patients with documented atrial fibrillation aged >65 years will be included and followed on a yearly basis. SWISS-AF-PVI is a registry of patients with atrial fibrillation undergoing catheter-based ablation of atrial fibrillation (pulmonary vein isolation) to determine long-term success rates of catheter-based ablation of atrial fibrillation (pulmonary vein isolation) and to evaluate factors associated with long-term success of catheter-based ablation of atrial fibrillation (pulmonary vein isolation). BEAT-AF is a prospective observational, multicenter cohort study in Switzerland. Overall, 1550 patients with documented atrial fibrillation were enrolled in seven study centers. The main aim of this study is to investigate the therapy and the medical consequences of patients with atrial fibrillation. Samples from all three collections were sent to The Broad Institute of MIT and Harvard to generate genetic data using whole exome sequencing and whole genome genotyping (Infinium Global Screening Array); all cases with atrial fibrillation.
The Genetics and Epidemiology of Colorectal Cancer Consortium (GECCO) is a collaborative effort comprised of a coordinating center and scientific researchers from well-characterized cohort and case-control studies conducted in North America and Europe. This international consortium aims to accelerate the discovery of common and rare genetic risk variants for colorectal cancer by conducting large-scale meta-analyses of existing and newly generated genome-wide association study (GWAS) data, replicating and fine-mapping of GWAS discoveries, and investigating how genetic risk variants are modified by environmental risk factors. To expand these efforts, we assembled case-control sets or nested case-control sets from 20 different North American or European studies. Summary descriptions and study participant inclusions/exclusion criteria for each of these studies are detailed below. The Black Women's Health Study (BWHS): Is the largest follow-up study of the health of African-American women (Cozier et al., 2004; Rosenberg et al., 1995) [PMID: 15018884; PMID: 7722208]. The purpose is to identify and evaluate causes and preventives of cancers and other serious illnesses in African-American women. Among the diseases being studied are breast cancer, colorectal cancer, type 2 diabetes, uterine fibroids, systemic lupus erythematosus, and cardiovascular disease. The study began in 1995, when 59,000 black women from all parts of the United States enrolled through postal questionnaires. The women provided demographic and health data on the 1995 baseline questionnaire, including information on weight, height, smoking, drinking, contraceptive use, use of other selected medications, illnesses, reproductive history, physical activity, diet, use of health care, and other factors. The participants are followed through biennial questionnaires to determine the occurrence of cancers and other illnesses and to update information on risk factors. Self-reports of cancer are confirmed through medical records and state cancer registry records. Mouthwash-swish samples, as a source of DNA, were obtained from ~26,000 BWHS participants in 2002-2007. DNA was isolated from the mouthwash-swish samples at the Boston University Molecular Core Genetics Laboratory using the QIAAMP DNA Mini Kit (Qiagen). All incident colorectal cancer cases with a DNA sample were included in the present analysis. Two controls per case, selected from among BWHS participants free of colorectal cancer at end of follow-up, were matched to cases on year of birth (+/- 2 years) and geographical region of residence (Northeast, South, Midwest, and West). A total 209 colorectal cancer cases and 423 controls were sent for genotyping. Campaign Against Cancer and Heart Disease (CLUE II): The Campaign Against Cancer and Heart Disease, is a prospective cohort designed to identify biomarkers and other factors associated with risk of cancer, heart disease, and other conditions (Kakourou et al., 2015) [PMID: 26220152]. 32,894 participants were recruited from May through October 1989 from Washington County, Maryland and surrounding communities. Colorectal cancer cases (n = 297) and matched controls (n = 296) were identified between 1989 and 2000 among participants in the CLUE II cohort of Washington County, Maryland. Colorectal Cancer Study of Austria (CORSA): In the ongoing colorectal cancer study of Austria (CORSA), more than 13,000 Caucasian participants have been recruited within the province-wide screening project "Burgenland Prevention Trial of Colorectal Disease with Immunological Testing" (B-PREDICT) since 2003 (Hofer et al., 2011) [PMID: 21422235]. All inhabitants of the Austrian province Burgenland aged between 40 and 80 years are annually invited to participate in fecal immunochemical testing and haemoccult positive screening participants are invited for colonoscopy. CORSA includes genomic DNA and plasma of colorectal cancer cases, low-risk and high-risk adenomas, and colonoscopy-negative controls. Controls received a complete colonoscopy and were free of colorectal cancer or polyps. CORSA participants have been recruited in the four KRAGES hospitals in Burgenland, Austria, and additionally, at the Medical University of Vienna (Department of Surgery), the Viennese hospitals "Rudolfstiftung" and the "Sozialmedizinisches Zentrum Sud", and at the Medical University of Graz (Department of Internal Medicine). 1403 colorectal cancer and advanced colorectal adenoma cases, and 1404 matched controls were selected for the study. Distribution of factors sex and age (5 year strata) were evenly matched between cases and controls. Cancer Prevention Study II (CPS II): The CPS II Nutrition cohort is a prospective study of cancer incidence and mortality in the United States, established in 1992 and described in detail elsewhere (Calle et al., 2002; Campbell et al., 2014) [PMID: 12015775; PMID: 25472679]. At enrollment, participants completed a mailed self-administered questionnaire including information on demographic, medical, diet, and lifestyle factors. Follow-up questionnaires to update exposure information and to ascertain newly diagnosed cancers were sent biennially starting in 1997. Reported cancers were verified through medical records, state cancer registry linkage, or death certificates. The Emory University Institutional Review Board approves all aspects of the CPS II Nutrition Cohort. A total of 360 cases and 359 controls were selected for this study. Czech Republic Colorectal Cancer Study (Czech Republic CCS): Cases with positive colonoscopy results for malignancy, confirmed by histology as colon or rectal carcinomas, were recruited between September 2003 and May 2012 in several oncological departments in the Czech Republic (Prague, Pilsen, Benesov, Brno, Liberec, Ples, Pribram, Usti and Labem, and Zlin). Two control groups, sampled at the same time of cases recruitment, were included in the study. The first group consisted of hospital-based individuals with a negative colonoscopy result for malignancy or idiopathic bowel diseases. The reasons for the colonoscopy were: i) positive fecal occult blood test, ii) hemorrhoids, iii) abdominal pain of unknown origin, and iv) macroscopic bleeding. The second control group consisted of healthy blood donor volunteers from a blood donor center in Prague. All individuals were subjected to standard examinations to verify the health status for blood donation and were cancer-free at the time of the sampling. Details of CRC cases and controls have been reported previously (Vymetalkova et al., 2014; Naccarati et al., 2016; Vymetalkova et al., 2016) [PMID: 24755277; PMID: 26735576; PMID: 27803053]. All subjects were informed and provided written consent to participate in the study. They approved the use of their biological samples for genetic analyses, according to the Declaration of Helsinki. The design of the study was approved by the Ethics Committee of the Institute of Experimental Medicine, Prague, Czech Republic. All subjects included in the study were Caucasians and comprised 1792 cases and 1764 matched controls. Controls were matched to CRC cases as 1:1 ratio. Matching was done on age and sex. Age was matched on +-5 years, whereas sex was matched exactly. For the cases without matched controls, matching was done only on sex. Early Detection Research Network (EDRN): The aim of the EDRN initiative is to develop and sustain a biorepository for support of translational research (Amin et al., 2010) [PMID: 21031013]. High-quality biospecimens were accrued and annotated with pertinent clinical, epidemiologic, molecular and genomic information. A user-friendly annotation tool and query tool was developed for this purpose. The various components of this annotation tool include: CDEs are developed from the College of American Pathologists (CAP) Cancer Checklists and North American Association of Central Cancer Registries (NAACR) standards. The CDEs provides semantic and syntactic interoperability of the data sets by describing them in the form of metadata or data descriptor. A total of 352 colorectal case samples and 399 controls were selected for this study. Controls were matched to CRC cases based on age and sex. The EPICOLON Consortium (EPICOLON): The EPICOLON Consortium comprises a prospective, multicentre and population-based epidemiology survey of the incidence and features of CRC in the Spanish population (Fernandez-Rozadilla et al., 2013) [PMID: 23350875]. Cases were selected as patients with de novo histologically confirmed diagnosis of colorectal adenocarcinoma. Patients with familial adenomatous polyposis, Lynch syndrome or inflammatory bowel disease-related CRC, and cases where patients or family refused to participate in the study were excluded. Hospital-based controls were recruited through the blood collection unit of each hospital, together with cases. All of the controls were confirmed to have no history of cancer or other neoplasm and no reported family history of CRC. Controls were randomly selected and matched with cases for hospital, sex and age (+- 5 years). A total of 370 cases and 370 controls were selected for genotyping. Hawaii Adenoma Study: For this adenoma study, two flexible-sigmoidoscopy screening clinics were first used to recruit participants on Oahu, Hawaii. Adenoma cases were identified either from the baseline examination at the Hawaii site of the Prostate Lung Colorectal and Ovarian cancer screening trial during 1996-2000 or at the Kaiser Permanente Hawaii's Gastroenterology Screening Clinic during 1995-2007. In addition, starting in 2002 and up to 2007, we also approached for recruitment all eligible patients who underwent a colonoscopy in the Kaiser Permanente Hawaii Gastroenterology Department. Cases were patients with histologically confirmed first-time adenoma(s) of the colorectum and were of Japanese, Caucasian or Hawaiian race/ethnicity. Controls were selected among patients with a normal colorectum and were individually matched to the cases on age at exam, sex, race/ethnicity, screening date (+-3 months) and clinic and type of examination (colonoscopy or flexible sigmoidoscopy). We recruited 1016 adenoma cases (67.8% of all eligible) and 1355 controls (69.2% of all eligible); 889 cases and 1169 controls agreed to give a blood and 29 cases and 34 controls, a mouthwash sample. A total of 989 cases and 1185 controls were genotyped for this study. Columbus-area HNPCC Study (HNPCC, OSUMC): Patients with colorectal adenocarcinoma diagnosed at six participating hospitals were eligible for this study, regardless of age at diagnosis or family history of cancer. Patients with a clinical diagnosis of familial adenomatous polyposis were not eligible for this study. These six hospitals perform the vast majority of all operations for CRC in the Columbus metropolitan area (population 1.7 million). The institutional review board at all participating hospitals approved the research protocol and consent form in accordance with assurances filed with and approved by the United States Department of Health and Human Services. Briefly, during the period of January 1999 through August 2004, 1,566 eligible patients with CRC were accrued to the study (Hampel et al., 2008) [PMID 18809606]. A total of 1472 colorectal cancer samples had enough blood DNA remaining to be sent for genotyping. Control samples were provided by the Ohio State University Medical Center%#39;s (OSUMC) Human Genetics Sample Bank. The Columbus Area Controls Sample Bank is a collection of control samples for use in human genetics research that includes both donors' anonymized biological specimens and linked phenotypic data. The data and samples are collected under the protocol "Collection and Storage of Controls for Genetics Research Studies", which is approved by the Biomedical Sciences Institutional Review Board at OSUMC. Recruitment takes place in OSUMC primary care and internal medicine clinics. If individuals agree to participate, they provide written informed consent, complete a questionnaire that includes demographic, medical and family history information, and donate a blood sample. 4-7 ml of blood is drawn into each of 3 ACD Solution A tubes and is used for genomic DNA extraction and the establishment of an EBV-transformed lymphoblastoid cell culture, cell pellet in Trizol, and plasma. Controls were matched to CRC cases as 1:1. Matching was done on age at reference time (age_ref), race, and sex. Age_ref was matched on +-5 years. Sex and race were matched exactly. For the cases without matched controls, matching was done only on sex and race with 1:1 ratio. Since controls are fewer than cases, one control is matched on 2 cases at most. Health Professionals Follow-up Study (HPFS): A parallel prospective study to the NHS (Nurses' Health Study). The HPFS cohort comprised 51,529 men aged 40-75 who, in 1986, responded to a mailed questionnaire (Rimm et al., 1990) [PMID: 2090285]. Participants provided information on health related exposures, including current and past smoking history, age, weight, height, diet, physical activity, aspirin use, and family history of colorectal cancer. Colorectal cancer and other outcomes were reported by participants or next-of-kin and were followed up through review of the medical and pathology record by physicians. Overall, more than 97% of self-reported colorectal cancers were confirmed by medical record review. Information was abstracted on histology and primary location. Incident cases were defined as those occurring after the subject provided the blood sample. Prevalent cases were defined as those occurring after enrollment in the study but before the subject provided the blood sample. Follow-up evaluation has been excellent, with 94% of the men responding to date. Colorectal cancer cases were ascertained through January 1, 2008. In 1993-1995, 18,825 men in the HPFS mailed blood samples by overnight courier, which were aliquoted into buffy coat and stored in liquid nitrogen. In 2001-2004, 13,956 men in the HPFS who had not provided a blood sample previously mailed in a swish-and-spit sample of buccal cells. Incident cases were defined as those occurring after the subject provided a blood or buccal sample. Prevalent cases were defined as those occurring after enrollment in the study in 1986, but before the subject provided either a blood or buccal sample. After excluding participants with histories of cancer (except nonmelanoma skin cancer), ulcerative colitis, or familial polyposis, case-control sets were previously constructed. In addition to colorectal cancer cases and controls, a set of adenoma cases and matched controls with available DNA from buffy coat were selected for genotyping. Over the follow-up period, data were collected on endoscopic screening practices and, if individuals had been diagnosed with a polyp, the polyps were confirmed to be adenomatous by medical record review. Adenoma cases were ascertained through January 1, 2008. A separate case-control set was constructed of participants diagnosed with advanced adenoma matched to control participants who underwent a lower endoscopy in the same time period and did not have an adenoma. Advanced adenoma was defined as an adenoma 1 cm or larger in diameter and/or with tubulovillous, villous, or highgrade dysplasia/carcinoma-in-situ histology. Matching criteria included year of birth (within 1 year) and month/ year of blood sampling (within 6 months), the reason for their lower endoscopy (screening, family history, or symptoms), and the time period of any prior endoscopy (within 2 years). Controls matched to cases with a distal adenoma either had a negative sigmoidoscopy or colonoscopy examination, and controls matched to cases with proximal adenoma all had a negative colonoscopy. In total, 159 advanced adenoma cases and 109 controls were selected for genotyping. Leeds Colorectal Cancer Study (LCCS): Following local ethical approval, colorectal cancer cases were recruited from 1997 until 2012 in Leeds, UK through surgical clinics. Initially, funding was provided by the UK Ministry of Agriculture, Farming and Fisheries (subsequently the Food Standards Agency) and Imperial Cancer Research Fund (subsequently Cancer Research UK). Recruitment also occurred similarly in Dundee, Perth and York between the periods of 1997 and 2001 using the same protocol and the data and samples were combined. Pathologically confirmed cases were consented at outpatient clinics, providing information on known and postulated risk factors for colorectal cancer (diet, lifestyle and family history) as well as providing a blood sample for DNA. Exclusion criteria included pre-existing diverticular disease and an inability to complete the questionnaire. The General Practitioners of cases (all UK residents have a nominated General Practitioner to whom to refer initial medical queries) and these GPs were asked to send letters to other persons on their patient list of the same gender and born within 5 years of the case. Subsequently to enhance the number of controls, we systematically invited patients from selected GP practices. Diet was assessed in cases and controls using an extensive dietary and lifestyle questionnaire modified by that produced by the European Prospective Investigation in Cancer (EPIC). The frequency that each specific food items were eaten was recorded and we also obtained average fruit and vegetable consumption as a cross-check. In total, 1591 cases and 739 controls provided a DNA sample. The North Carolina Colon Cancer Studies (NCCCS I/II): The North Carolina Colon Cancer Studies (NCCCS I- colon and NCCCS II-rectal) were population-based case-control studies conducted in 33 counties of North Carolina. Cases were identified using the rapid case ascertainment system of the North Carolina Central Cancer Registry. Patients with a first diagnosis of histologically confirmed invasive adenocarcinoma of the colon (cecum through sigmoid colon) between October 1996 and September 2000 were classified as potential cases in the NCCCS I. The NCCCS II included patients with a first diagnosis of histologically confirmed invasive adenocarcinoma of the sigmoid colon, rectosigmoid, or rectum (hereafter collectively referred to as rectal cancer) between May 2001 and September 2006. Additional eligibility requirements were: aged 40-80 years, residence in one of the 33 counties, ability to give informed consent and complete an interview, had a driver's license or identification card issued by the North Carolina Department of Motor Vehicles (if under the age of 65), and had no objections from the primary physician in regards to contacting the individual. Controls, identified and sampled during the respective study dates, were selected from two sources. Potential controls under the age of 65 were identified using the North Carolina Department of Motor Vehicles records. For those 65 years and older, records from the Center for Medicare and Medicaid Services were used. Controls were matched to cases using randomized recruitment strategies. Recruitment probabilities were done using strata of 5-year age, sex, and race groups. Dietary information was collected using a modified version of the semiquantitative food frequency questionnaire developed at the National Cancer Institute. In addition, participants were asked about vitamin and mineral supplementation, special diets, restaurant eating, sodium use, and fats used in cooking. In NCCCS I, 515 colorectal cases and 687 matched controls were sent for genotyping. In NCCCS II, 796 colorectal cases and 823 controls were sent from the NCCCS II for genotyping. Controls were matched to CRC cases as 1:1 ratio. Matching was done on age, race, and sex. Age was matched on +-5 years. Race and sex was matched exactly. For the cases without matched controls, matching was done only on sex and race. Nurses Health Study (NHS): The NHS cohort began in 1976 when 121,700 married female registered nurses age 30-55 years returned the initial questionnaire that ascertained a variety of important health-related exposures (Belanger et al., 1978) [PMID: 248266]. Since 1976, follow-up questionnaires have been mailed every 2 years. Colorectal cancer and other outcomes were reported by participants or next-of-kin and followed up through review of the medical and pathology record by physicians. Overall, more than 97% of self-reported colorectal cancers were confirmed by medical-record review. Information was abstracted on histology and primary location. The rate of follow-up evaluation has been high: as a proportion of the total possible follow-up time, follow-up evaluation has been more than 92%. Colorectal cancer cases were ascertained through June 1, 2008. In 1989 -1990, 32,826 women in NHS I mailed blood samples by overnight courier, which were aliquoted into buffy coat and stored in liquid nitrogen. In 2001-2004, 29,684 women in NHS I who did not previously provide a blood sample mailed a swish-and-spit sample of buccal cells. Incident cases were defined as those occurring after the subject provided a blood or buccal sample. Prevalent cases were defined as those occurring after enrollment in the study in 1976 but before the subject provided either a blood or buccal sample. After excluding participants with histories of cancer (except nonmelanoma skin cancer), ulcerative colitis, or familial polyposis, case-control sets were previously constructed from which DNA was isolated from either buffy coat or buccal cells for genotyping. In addition to colorectal cancer cases and controls, a set of advanced adenoma cases and matched controls with available DNA from buffy coat were selected for genotyping. Over the follow-up period, data were collected on endoscopic screening practices and, if individuals had been diagnosed with a polyp, the polyps were confirmed to be adenomatous by medical record review. Adenoma cases were ascertained through June 1, 2011. A separate case-control set was constructed of participants diagnosed with advanced adenoma matched to control participants who underwent a lower endoscopy in the same time period and did not have an adenoma. Advanced adenoma was defined as an adenoma more than 1 cm in diameter and/or with tubulovillous, villous, or high-grade dysplasia/carcinoma-in-situ histology. Matching criteria included year of birth (within 1 year) and month/year of blood sampling (within 6 months), the reason for their lower endoscopy (screening, family history, or symptoms), and the time period of any prior endoscopy (within 2 years). Controls matched to cases with a distal adenoma either had a negative sigmoidoscopy or colonoscopy examination, and controls matched to cases with proximal adenoma all had a negative colonoscopy. A total of 272 cases and 236 matched controls were sent to CIDR for the advanced adenoma case-control set. Northern Swedish Health and Disease Study (NSHDS): Comprises over 110,000 participants, including approximately one third with repeated sampling occasions, from three population-based cohorts (Dahlin et al., 2010; Myte et al., 2016) [PMID: 20197478; PMID: 27367522]. The largest is the ongoing Vasterbotten Intervention Programme, in which all residents of Vasterbotten County are invited to a health examination upon turning 30 (some years), 40, 50 and 60 years of age. Extensive measured and self-reported health and lifestyle data, as well as blood samples for central biobanking in Umea, Sweden, are collected at the health exam. Leucocyte DNA samples for 1:1-matched CRC case-control sets from the NSHDS, of which 878 samples are included in this study, have been selected for genotyping. This is in addition to 354 samples from the NSHDS previously analyzed as part of the multicenter EPIC cohort. Cancer-specific and overall survival data are available for all patients. For at least 425 patients, archival tumor tissue has been analyzed for the BRAF V600E mutation and by sequencing codon 12 and 13 for KRAS mutations, as well as for MSI screening status by immunohistochemistry and for an eight-gene CIMP panel using quantitative real-time PCR (MethyLight). Ohio Colorectal Cancer Prevention Initiative (OCCPI, OSUMC): OCCPI (ClinicalTrials.gov identifier: NCT01850654) is a population-based study of colorectal cancer patients diagnosed in one of 51 hospitals throughout the state of Ohio from January 1, 2013 through December 31, 2016. The OCCPI was created to decrease CRC incidence in Ohio by identifying patients with hereditary predisposition (statewide universal tumor screening for newly diagnosed CRC patients), increase colonoscopy compliance for first-degree relatives of CRC patients, and encourage future research through the creation of a biorepository. The 51 Ohio hospitals participating in the OCCPI were selected to represent a cross-section of clinical centers in the state based on high reported volume of CRC patients, affiliation with a high volume hospital, or interest in participation. Institutional Review Board (IRB) approval was obtained by the individual hospitals, Community Oncology Programs, or by ceding review to the OSU IRB. Written informed consent was obtained. A total of 2139 colorectal cases were genotyped. Patients were considered eligible for this study if they were age 18 or older at the time of enrollment, if they had a surgical resection (or biopsy if unresectable) in the state of Ohio demonstrating an adenocarcinoma of the colorectum from 1/1/13 - 12/31/16. Matched control samples were selected from the Ohio State University Medical Center's (OSUMC) Human Genetics Sample Bank in an identical way to the selection for the Columbus-area HNPCC Study (please refer to the description for the Columbus-area HNPCC Study). Prostate, Lung, Colorectal and Ovarian Cancer Screening Trail (PLCO): PLCO enrolled 154,934 participants (men and women, aged between 55 and 74 years) at ten centers into a large, randomized, two-arm trial to determine the effectiveness of screening to reduce cancer mortality. Sequential blood samples were collected from participants assigned to the screening arm. Participation was 93% at the baseline blood draw. In the observational (control) arm, buccal cells were collected via mail using the "swish-and-spit" protocol and participation rate was 65%. Details of this study have been previously described (Huang et al., 2016) [PMID: 27673363] and are available online (http://dcp.cancer.gov/plco). For this study 1651 advanced adenoma cases and 1392 controls were selected for genotyping. Selenium and Vitamin E Prevention Trial (SELECT): The Selenium and Vitamin E Cancer Prevention Trial (SELECT) was a double-blind, placebo controlled clinical trial which explored using selenium and vitamin E alone and in combination to prevent prostate cancer in healthy men (Lippman et al., 2009) [PMID: 19066370]. Secondary endpoints included the prevention of colorectal and lung cancers. SELECT was conducted at 427 sites and centers in the United States, Canada and Puerto Rico; 35,533 men 55 years and older (50 or older if African American) were randomized beginning August 22, 2001. Supplementation was discontinued on October 23, 2008 due to futility. 308 colorectal cancer cases and 308 matched controls were selected from the SELECT population and sent for genotyping. Screening Markers For Colorectal Disease Study and Colonoscopy and Health Study (SMS-REACH): Details on this study population were previously reported (Burnett-Hartman et al., 2014) [PMID: 24875374]. Participants were enrollees in an integrated health-care delivery system in western Washington State (Group Health Cooperative, Seattle, Washington) aged 24-79 years who underwent an index colonoscopy for any indication between 1998 and 2007 and donated a buccal-cell or blood sample for genotyping analysis. Study recruitment took place in 2 phases, with phase 1 occurring in 1998-2003 and phase 2 occurring in 2004-2007. Persons who had undergone a colonoscopy less than 1 year prior to the index colonoscopy, persons with inadequate bowel preparation for the index colonoscopy, and persons with a prior or new diagnosis of colorectal cancer, a familial colorectal cancer syndrome (such as familial adenomatous polyposis), or another colorectal disease were ineligible. Patients diagnosed with adenomas or serrated polyps and persons who were polyp-free at the index colonoscopy (controls) were systematically recruited during both phases of recruitment. Approximately 75% agreed to participate and provided written informed consent. Based on medical records, persons who agreed to participate and those who refused study participation were similar with respect to age, sex, and colorectal polyp status. Study protocols were approved by the institutional review boards of the Group Health Cooperative and the Fred Hutchinson Cancer Research Center (Seattle, Washington). A total of 575 cases and 508 matched were selected for the study. Controls were matched to CRC cases as 1:1 ratio. Matching was done on age_ref, race, and sex. Age_ref was matched on +-5 years. The Women's Health Initiative (WHI): WHI is a long-term national health study that has focused on strategies for preventing heart disease, breast and colorectal cancer, and osteoporotic fractures in postmenopausal women. The original WHI study included 161,808 postmenopausal women enrolled between 1993 and 1998. The Fred Hutchinson Cancer Research Center in Seattle, WA serves as the WHI Clinical Coordinating Center for data collection, management, and analysis of the WHI. The WHI has two major parts: a partial factorial randomized Clinical Trial (CT) and an Observational Study (OS); both were conducted at 40 Clinical Centers nationwide. The CT enrolled 68,132 postmenopausal women between the ages of 50-79 into trials testing three prevention strategies. If eligible, women could choose to enroll in one, two, or all three of the trial components. The components are: Hormone Therapy Trials (HT): This double-blind component examined the effects of combined hormones or estrogen alone on the prevention of coronary heart disease and osteoporotic fractures, and associated risk for breast cancer. Women participating in this component with an intact uterus were randomized to estrogen plus progestin (conjugated equine estrogens [CEE], 0.625 mg/d plus medroxyprogesterone acetate [MPA] 2.5 mg/d] or a matching placebo. Women with prior hysterectomy were randomized to CEE or placebo. Both trials were stopped early, in July 2002 and March 2004, respectively, based on adverse effects. All HT participants continued to be followed without intervention until close-out. Dietary Modification Trial (DM): The Dietary Modification component evaluated the effect of a low-fat and high fruit, vegetable and grain diet on the prevention of breast and colorectal cancers and coronary heart disease. Study participants were randomized to either their usual eating pattern or a low-fat dietary pattern. Calcium/Vitamin D Trial (CaD): This double-blind component began 1 to 2 years after a woman joined one or both of the other clinical trial components. It evaluated the effect of calcium and vitamin D supplementation on the prevention of osteoporotic fractures and colorectal cancer. Women in this component were randomized to calcium (1000 mg/d) and vitamin D (400 IU/d) supplements or a matching placebo. The Observational Study (OS)examines the relationship between lifestyle, environmental, medical and molecular risk factors and specific measures of health or disease outcomes. This component involves tracking the medical history and health habits of 93,676 women not participating in the CT. Recruitment for the observational study was completed in 1998 and participants were followed annually for 8 to 12 years. All centrally confirmed cases of invasive colorectal cancers, or deaths from colorectal cancer were selected as potential cases from September 30, 2015 database. Controls were participants free of colorectal cancer (invasive or in situ) as of September 30, 2015. Potential cases and controls were excluded if they (1) were non-White; (2) had history of colorectal cancers at baseline; (3) lost to follow-up after enrollment; (4) DbGAP ineligible; (5) had <1.25ug of DNA; (6) selected for WHI study M26 Phase I or II; (7) selected for WHI study AS224 and also included in the imputation project. A total of 578 cases and 104,429 controls met the eligibility criteria. Each case was matched with 1 control (1:1) that exactly met the following matching criteria: age (+-5 years), 40 randomization centers (exact), WHI date (+-3 years), CaD date (+-3 years), OS flag (exact), HRT assignments (exact), DM assignments (exact), and CaD assignments (exact). Control selection was done in a time-forward manner, selecting one control for each case from the risk set at the time of the case's event. The matching algorithm was allowed to select the closest match based on a criteria to minimize an overall distance measure (Bergstralh EJ, Kosanke JL. Computerized matching of cases to controls. Technical Report #56, Department of Health Sciences Research, Mayo Clinic, Rochester MN. April 1995). Each matching factor was given the same weight. When exact matches could not be found, the matching criteria were gradually relaxed among unmatched cases and controls until all cases had found matched controls. Using the matching criteria specified above, 559 of the 578 eligible cases found exact matches. The matching criteria was then relaxed to : Age+-5, randomization centers, WHI date +- 3 years, CaD date +- 3 years, OS flag, HRT flag, DM flag, CaD flag. 17 of the remaining 19 unmatched cases found matched controls. By matching on Age+-5, randomization centers, WHI date +- 3 years, CaD date +- 3 years, OS flag, HRT flag, the remaining 2 unmatched cases found their matches.
NCI-H209 is an immortal cell line derived from a bone marrow metastasis of a patient with small cell lung cancer, taken before chemotherapy. The specimen showed histologically typical small cells with classic neuroendocrine features. NCI-BL209 is an EBV-transformed B-cell line derived from the same patient as the small cell lung cancer cell line, NCI-H209
This is a comprehensive genomic analysis of a rare set of secondary histiocytic/dendritic sarcomas from the consult service of the Hematopathology section of the Laboratory of Pathology, NCI, NIH.
We performed whole exome sequencing of bone marrow monoclear cells derived from a cold agglutinin disease patient. Aim of our study is to elucidate the pathogenesis of hemolytic anemia.