ATAC-seq dataset for the paper: Title Multi-omics analysis of human population variation in immune function and in vivo response to BCG vaccination Abstract Immune responses are tightly regulated, yet highly variable between individuals. To investigate human population variation of trained immunity, we immunized healthy individuals with Bacillus Calmette-Guérin (BCG). This live attenuated vaccine induces not only an adaptive immune response against tu-berculosis, but also triggers innate immune activation and memory. We established personal immune profiles and chromatin accessibility maps over a time course of BCG vaccination in 323 individuals. This large resource uncovered genetic and epigenetic predictors of baseline immunity and BCG vaccine response. We found that BCG vaccination enhances the innate immune response only in individuals with dormant immune states at baseline, suggesting that exogeneous induction of trained immunity is not a universal booster of innate immunity, but specifically elevates weak innate immune responses. This study advances our understanding of BCG’s heterologous immune-stimulatory effects and trained immunity in humans. Moreover, our results highlight the value of epigenetic cell states as an “endo-phenotype” that connects immune function with genotype and the environment.
This dataset represents two combined study populations. Serrated Colorectal Cancer: An Emerging Disease Subtype (called the Advanced Colorectal Cancer of Serrated Subtype Study or ACCESS Study) was a grant awarded to investigate a newly-recognized, biologically-distinct subtype of colorectal cancer (CRC) called “serrated CRC.” The objective of this project was to characterize factors related to the genetic predisposition, clinical presentation, and prognosis of serrated CRC. The study recruited incident invasive CRC cases diagnosed between April 2016 and December 2018, aged 20-74 years at diagnosis. Cases were identified through the Surveillance, Epidemiology and End Results (SEER) cancer registry serving 13 counties in western Washington State. Eligibility for all individuals was limited to those who were English-speaking and could consent. Participation included completing a baseline epidemiologic questionnaire shortly after diagnosis, optional donation of a saliva sample for genetic analysis, and optional consent to release of medical records and tissue specimens related to their diagnosis. Tumor specimens were tested for serrated CRC-defining molecular characteristics. Further, we have vital status on all participants and cause of death on those that have died since enrollment. Hormones and Colon Cancer: Epigenetic Subtypes, Risks, and Survival (called the Post-Menopausal Hormones Study or PMH Study) was a grant awarded to investigate the impact of post-menopausal hormone use on colon cancer risk, tumor molecular characteristics, and outcomes. Eligible cases were females, newly diagnosed with invasive colorectal adenocarcinoma between October 1998 and February 2002, aged 50 to 74 years. Cases were residents of 10 out of the 13 counties in western Washington State served by the Surveillance, Epidemiology and End Results (SEER) cancer registry. Eligibility for all individuals was limited to those who were English-speaking with available telephone numbers, in which they could be contacted. Unrelated population-based controls were randomly selected according to age distribution (in 5-year age intervals) of the eligible cases by using lists of licensed drivers from the Washington State Department of Licensing (for individuals aged 50 to 64 years) and rosters from the Health Care Financing Administration (now the Centers for Medicare and Medicaid, for individuals older than 64 years). Participation included completing a baseline epidemiologic questionnaire, optional donation of a saliva sample for genetic analysis, and (for cases only) optional consent to release of medical records and tissue specimens related to their diagnosis. Tumor specimens were tested for epigenetic and other molecular characteristics. The ACCESS study was supported by funding from the National Cancer Institute of the National Institutes of Health (NCI/NIH) (R01CA196337, PI: Newcomb, PA), as was the PMH Study (R01CA076366, PI: Newcomb, PA). Additional support for the PMH Study came from the Seattle site of the Colon Cancer Family Registry (SCCFR) (U01CA167551, PI: Jenkins, M, and U01/U24CA074794, PI: Newcomb, PA). Additional support for case ascertainment was provided by the Cancer Surveillance System of the Fred Hutchinson Cancer Center, which is funded by Contract Number HHSN261201300012I; NCI Control Number: N01 PC-2013-00012; Contract Number HHSN261201800004I; and NCI Control Number: N01 PC-2018-00004 from the Surveillance, Epidemiology and End Results (SEER) Program of the National Cancer Institute with additional support from the Fred Hutchinson Cancer Center and the State of Washington. This research was also supported by the Genomics and Bioinformatics, Comparative Medicine, Specialized Pathology, Collaborative Data Services, and Experimental Histopathology Shared Resources of the Fred Hutch/University of Washington Cancer Consortium (P30 CA015704).Tumor marker testing was performed using formalin-fixed paraffin-embedded diagnostic tumor tissue specimens, and DNA extracted from those specimens. Testing for microsatellite instability (MSI) was based on either a 10-gene panel (BAT25, BAT26, BAT40, MYCL, D5S346, D17S250, ACTC, D18S55, D10S197, BAT34C4) or a 4-marker immunohistochemistry panel of DNA mismatch repair proteins (MLH1, MSH2, MSH6, PMS2). CpG island methylator phenotype (CIMP) testing was based on a validated quantitative DNA methylation assay using a five-gene panel (CACNA1G, IGF2, NEUROG1, RUNX3, SOCS1) or eight-gene panel (CACNA1G, IGF2, NEUROG1, RUNX3, SOCS1, MLH1, CRABP1, CDKN2A). Somatic p.V600E BRAF mutation status was tested for using a fluorescent allele-specific PCR assay. KRAS mutations in codons 12 and 13 were also assessed through forward and reverse sequencing of amplified tumor DNA. DNA was extracted from blood/saliva samples using conventional methods. The genotyping panel completed was the Build37 OncoArray500K-C, including 1%-6% blinded duplicates to monitor the quality of the genotyping. Quality control procedures were performed to 1) make sure that there were no patterns of missing data by batch, study, or plate, 2) check for gender discrepancies and kinship, 3) complete Principal Component Analysis, and 4) test for Hardy-Weinberg equilibrium (HWE). Samples were excluded based on call rate, heterozygosity, unexpected duplicates, gender discrepancy, and unexpectedly high identity-by-descent or unexpected genotypic concordance (>65%) with another individual. In addition, variants were excluded based on call rate (98%), lack of HWE in controls (P
This is a set of cases diagnosed with idiopathic pulmonary fibrosis, a fatal interstitial lung disease. These cases were included in the TOPMed phase three studies. The planned study will compare these cases to within-TOPMed controls for genome-wide association studies.
Study to evaluate the molecular genetics of ccRCC through integrative analyses of tumors and tumorgrafts using multiple platforms in clinically annotated samples. Genome and exome data for patients giving consent to have the information available on a public, secure, database is being submitted.
For this project about non-muscle invasive bladder cancer (NMIBC), we analysed long read sequencing data from 4 patients using the Nanopore platform. The dataset is composed of 8 BAM files: 4 tumor samples and 4 paired blood samples.
HumanOmni2.5-8 data from Chad and Yemen.This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
Sequencing of B-cell receptor repertoires in healthy individuals and patients with chronic lymphocytic leukemia. 1) This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute please see http://www.sanger.ac.uk/datasharing/
Patients with metastatic urothelial carcinoma treated at our institution, who elected to undergo rapid autopsy at the time of death, are included in this study. These patients agreed to submit their tissue (normal samples for germline analysis [kidney or liver] as well as primary and metastatic tumor sites [urothelial carcinoma]) and the time of autopsy. Each patient signed consent to allow study of these tissues including histologic, sequencing (DNA/RNA), and implantation in to animals as able for future study.
Recent sequencing studies have extensively explored the somatic alterations present in the nuclear genomes of cancers. Although mitochondria control energy metabolism and apoptosis, the origins and impact of cancer-associated mutations in mitochondrial DNA (mtDNA) are unclear. Here, we analysed somatic alterations in mtDNA from 1,675 tumors across 31 histologies. We identified 1,907 somatic substitutions, which exhibited dramatic replicative strand bias, predominantly C>T and A>G on the mitochondrial heavy strand. This strand-asymmetric signature differs from those found in nuclear cancer genomes but matches the inferred germline process shaping primate mtDNA sequence content. Numbers of mtDNA mutations showed considerable heterogeneity across tumor types. Missense mutations were selectively neutral and often gradually drifted towards homoplasmy over time. In contrast, mutations resulting in protein truncation undergo negative selection and were almost exclusively heteroplasmic. Our findings indicate that the endogenous mutational mechanism has far greater impact than any other external mutagens in mitochondria, and is fundamentally linked to mtDNA replication.
Single-cell DNA sequencing with antibody-oligonucleotide staining was performed using the Mission Bio Tapestri single-cell DNA sequencing platform, per the manufacturer’s instructions. All libraries were sized and quantified using an Agilent Bioanalyzer and pooled for sequencing on an Illumina NovaSeq6000 with 150 base-paired ending multiplexed runs. Fastq files generated by the sequencers were processed using the Tapestri Pipeline V2 and included adapter trimming, sequence alignment (BWA), barcode correction, cell finding, and variant calling (GATK v4/Haplotypecaller). FastQ files are provided for the 5 samples that were processed using the aforementioned pipeline.