Somatic mutations of 256 whole-genome sequenced colorectal tumors. 234 MSS, 19 MSI and 3 POLE mutants. See Katainen R. et al. CTCF/cohesin-binding sites are frequently mutated in cancer, Nature Genetics 2015. doi:10.1038/ng.3335
Sequencing data for ICGC Oesophageal Adenocarcinoma tissue samples - 129_rnaseq EAC expression data - Publication Secrier & Li et al., 2016, Nature Genetics
Data supporting: "SMAD4 and KCNQ3 Alterations are Associated with Lymph Node Metastases in Oesophageal Adenocarcinoma" RNAseq (FASTQ files) 6 samples
Data supporting: "The transcriptional landscape of endogenous retroelements delineates esophageal adenocarcinoma subtypes" RNAseq for 279 samples
Whole genome bisulfite sequencing data for 6 ependymomas plus 3 fetal controls (f1, f2, f4) and 3 adult controls (a2, a3, a4). See Mack, Witt et al. Nature 506(7489):445-50, 2014 (PMID: 24553142).
This dataset contains the RPKM and raw read counts of expression for all the individuals. This dataset was generated as part of the following study: Panousis et al (2019). Combined genetic and transcriptome analysis of patients with SLE: Distinct, targetable signatures for susceptibility and severity.
This dataset contains the clinical phenotypes/covariates information for all the individuals. This dataset was generated as part of the following study: Panousis et al (2019). Combined genetic and transcriptome analysis of patients with SLE: Distinct, targetable signatures for susceptibility and severity.
VCF for 87 Argentinean samples. Only SNPs (no indels) that passed the Affymetrix QC. Data from Luisi et al. 2020. Plos One. Fine-Scale Genomic Analyses Of Admixed Individuals Reveal Unrecognized Genetic Ancestry Components In Argentina. Reference Allele column does NOT contain reference allele from genome assembly.
The electronic Medical Records and Genomics (eMERGE) Network is a consortium of ten participating sites funded by the NHGRI to investigate the use of electronic medical record systems for genomic research. The goal of eMERGE is to conduct genome-wide association studies in approximately 19,000 individuals using EMR-derived phenotypes and DNA from linked Biorepositories. The eMERGE Network brings together researchers with a wide range of expertise in genomics, statistics, ethics, informatics, and clinical medicine from leading medical research institutions across the country. Each center participating in the consortium is uniquely situated to provide critical resources to this highly collaborative and productive network. Each site combines a biobank or study cohort with extensive genomic data and access to clinical data derived from electronic medical records. Sites are geographically dispersed and have diverse patient populations, including two sites focusing specifically on pediatrics. The eMERGE Network is comprised of 9 sites and one coordinating center. Each site maintains its own biorepository where DNA specimens are linked to phenotypic data contained within EMRs. Children's Hospital of Pennsylvania, PI: Hakon Hakonarson, MD, PhD Cincinnati Children's Medical Center with Boston Children's Hospital, PI: John Harley, MD, PhD - CCHMC and Ingrid Holm, MD, MPH - BCH Geisinger Health System, PI: David Carey, PhD and Marc Williams, MD Group Health Cooperative with University of Washington, PI: Eric Larson, MD, MPH - GHC and Gail Jarvik, MD, PhD - UW Marshfield Clinic with Essentia Institute of Rural Health, PI: Catherine McCarty, PhD, MPH - Essentia and Murray Brilliant, PhD - Marshfield Mayo Clinic, PI: Christopher Chute, MD, DrPH and Iftikhar Kullo, MD Icahn School of Medicine at Mount Sinai, PI: Erwin Bottinger, MD Northwestern University, PI: Rex Chisholm, PhD and Maureen Smith, MS Vanderbilt University, PI: Dan Roden, MD eMERGE Coordinating Center - Vanderbilt University, PI: Paul Harris, PhD Using electronic phenotyping methods, the consortium used DNA samples from all participating sites to explore the genetic determinants of red cell indices, white blood count (WBC) differential, diabetic retinopathy, height, serum lipid levels, specifically total cholesterol, HDL (high density lipoprotein), LDL (low density lipoprotein), and triglycerides, and autoimmune hypothyroidism as well as using the phenome-wide association study (PheWAS) paradigm to replicate and discover relationships between targeted genotypes with multiple phenotypes.
Introduction: Tumour Mutation Burden (TMB) is a potential biomarker for immune therapies. Identifying factors that may affect TMB prediction is key to its utility as biomarker for treatment options. Here we investigated parameters that might affect TMB using cytology smears obtained from endobronchial ultrasound transbronchial needle aspiration (EBUS TBNA)-sampled malignant lymph nodes, which are often the only diagnostic samples collected in patients with advanced lung cancer. Methods: Individual Diff-Quik cytology smears were prepared for each needle pass from the same enlarged malignant mediastinal lymph nodes. DNA extracted from smears of each pass underwent sequencing using a large gene panel sequencing (TSO500, Illumina). TMB was estimated for each individual needle pass using the TSO500 Local App v. 2.0 (Illumina). Results: Twenty patients had two or more Diff-Quik smears (total 45 smears) which passed sequencing quality control. Average TMB for all smears was 8.7 ± 5.0 mutations per megabase (Mb). Sixteen of the 20 patients had paired samples with minimal differences in TMB score (average difference 1.3 ± 0.85). Paired samples from 13 patients had concordant TMB in terms of scores being below or above a threshold of 10 mutations/Mb. Samples from seven patients had discrepant TMB, with four cases in particular exhibiting an average difference of 11.3 ± 2.7 mutations/Mb. Factors affecting TMB calling involved tumour content of the samples, the amount of DNA used in sequencing as well as bone fide intra tumour heterogeneity between paired samples. Conclusions: TMB assessment is feasible from EBUS-TBNA derived Diff-Quik slides from a single needle pass. Repeated samples of one or more lymph nodes have minimal variation in TMB in most cases (80%), suggesting accuracy of the test overall. To enhance accuracy of the result, care should be taken in relation to the tumour content of the cytology smear. Further, clinicians should be aware that even slight differences in lymph node site selection could give rise to variable genomics and discrepant TMB due to tumour heterogeneity. Therefore, it would be reasonable for multi-lymph node site analysis of TMB to be performed using EBUS TBNA aspirates from different easily accessible lymph nodes. Further investigations need to determine whether these samples should be combined as one sample or tested separately.