Genome-wide association studies (GWAS) of colorectal cancer (CRC) have been instrumental in identifying a number of common susceptibility loci in Non Hispanic (NH)-White populations, and a NCI priority is to extend GWAS findings to other populations to address racial/ethnic disparities in cancer susceptibility. Currently, GWA studies of CRC in NH-Whites, Japanese and African-Americans are ongoing. We propose a complementary study to address this critical research area in Hispanics. Hispanics represent the fastest growing ethnic population in the U.S. and have been largely understudied in terms of genetic susceptibility to cancer. There are noted differences in incidence, survival and mortality in CRC by ethnic/racial groups. Hispanics often present with CRC at a younger age and have a significantly greater incidence of stage IV tumors or metastatic disease compared to NH-Whites. We propose to conduct a large, cost-efficient, population-based GWAS in Hispanics by building upon existing NIH-funded resources, the Colon Cancer Family Registry (Colon CFR) and the Multiethnic Cohort Study (MEC). We plan to recruit 2,500 Hispanic men and women diagnosed with CRC between 01/2008 to present using cancer registries in California, physican referrals and familial referrals. Risk factor/diet questionnaires, pathology reports, Oragene saliva samples (for genotyping), optional blood samples (for genotyping and biometric analysis) and tumor blocks (for MSI testing) will be collected using methodologies developed in the Colon CFR/MEC. Cases of CRC in the MEC (currently 473; anticipated 600 at end) will also be included. Population-based Hispanic individuals without a diagnosis of CRC participating in other GWA studies in the MEC (n=3,900, U01HG004726, Haiman) will be used as controls. We will genotype all 3,100 cases using the Illumina 1M array and use available genotype and epidemiologic data collected on 3,900 controls. Our statistical analyses will include: single-SNP and haplotype effects, gene-environment interactions and heterogeneity by MSI, tumor subtype and family history of CRC. We will replicate findings in a second-stage using CRC cases and controls from Mexico (1,000 cases and 1,000 controls, EU FP7 funding, CHIBCHA, Carvajal-Carmona/Tomlinson). We will also examine heterogeneity of the risk estimates by ethnicity/race by leveraging GWA data on NH-Whites (2,142 cases, 1,909 controls, U01 CA122839, Casey), (4,000 cases, 6,000 NH-White controls, UK-CHIBCHA, Tomlinson), Colombians (2,000 cases and 2,000 controls, CHIBCHA), Japanese (1,000 cases and 1,000 controls) and African-Americans (1,500 cases and 1,500 controls, R01CA126895, Le Marchand). We will genotype replicated significant SNPs in our main and combined analysis in several Hispanic populations (note: studies funded by EU or NIH for data collection but not GWAS), including 800 Puerto Ricans, 2,000 Brazilians, 2,000 Argentineans and 3,000 Spanish/Portuguese, to assess generalizability of findings. We will examine the differences in inflammatory gene transcription dynamics in leukocytes (from blood sample collection) by fatigue level (as assessed from study questionnaire data). This study will have a high impact by addressing the key question of racial/ethnic disparities related to genetic susceptibility to CRC, will provide translational guidelines on biological mechanisms during the cancer survivorship period to increase quality of life among cancer survivors, and will enable further growth and investment into research among Hispanics by providing a resource of genetic data and biospecimens, which is lacking.
Total RNA (100ng) from 21 ETMRs with C19MC structural alterations and 28 other PBTs was prepared with nCounter miRNA Sample Prep Kit according to standard protocol. miRNA expression profiling was conducted with human v1, v2, or v3 miRNA panel on nCounter miRNA expression platform (NanoString Technologies, Seattle, WA) according to manufacturer’s protocol. Signal normalization was done using nSolver Analysis and batch corrected using ComBat (Johnson et al. Biostatistics. 2007). 565 miRNAs overlapped between all three versions and was used for further analyses. Fold change and supervised t-test with FDR correction was calculated between the ETMRs and other PBTs.
Further investigation and characterisation of 12q-amplified low- and high-grade osteosarcomas with MDM2 and/or CDK4 amplification focusing on SV, copy number and gene fusion analyses. In total, 25 cases (33 samples total due to multi-sampling) were included, with some form of sequencing data available for 27 samples. Mate-pair whole genome sequencing (Illumina) is available for 19 samples, longread whole genome sequencing (PacBio HiFi) on 10 samples and RNA-sequencing (Illumina Truseq) on 21 samples. Data is available as BAM files.
This dataset contains transcriptome data generated from naive and memory B cells isolated from patients with inflammatory bowel disease and healthy controls. Data was generated from both patients with Crohn's disease and ulcerative colitis. In addition, it contains BCR repertoire data from patients with IBD and healthy controls. The repertoire data was generated using amplicon sequencing of RNA isolated from peripheral blood, lymph nodes from inflamed gut and gut associated lymphatic tissue.
Squamous cell carcinoma arising from mature teratoma is one of the rare ovarian cancers. Therefore, the detailed molecular background of this disease has not been elucidated. The purpose of this study is to investigate the single-cell transcriptional landscape of this disease.
Biopsies from the terminal ileum and rectum of healthy individuals are digested on ice to single cells and processed for single-cell RNA-sequencing (10X Genomics and Illumina) This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/ . This dataset contains all the data available for this study on 2020-05-12.
Five commercially available parallel sequencing assays were evaluated for their ability to detect gene fusions in eight cell lines and 18 FFPE tissue samples carrying a variety of known gene fusions. Four RNA-based assays and one DNA-based assay were compared; two were hybrid capture-based, TruSight Tumor 170 Assay (Illumina) and SureSelect XT HS Custom Panel (Agilent), and three were amplicon-based, Archer FusionPlex Lung Panel (ArcherDX), QIAseq RNAscan Custom Panel (Qiagen) and Oncomine Focus Assay (Thermo Fisher Scientific).
The intersection of genome-wide association analyses with physiological and functional data indicates that variants regulating islet gene transcription influence type 2 diabetes (T2D) predisposition and glucose homeostasis. However, the specific genes through which these regulatory variants act remain poorly characterized. To identify such effector transcripts for T2D and glycemic traits, we generated expression quantitative trait locus (eQTL) data in 118 human islet samples using RNA-sequencing and high-density genotyping.
This dataset contains RNA-sequencing of Bone marrow-derived CD34+ cells from Healthy Controls (n=2) and SLE patients (n=10). SLE patients are divided into two categories based on severity: patients with moderate/mild disease (n=4) and patients with severe disease (n=6). Libraries were generated using the Illumina TruSeq Sample Preparation kit v2. Single-end 75-bp mRNA sequencing was performed on Illumina NextSeq 500. The raw fastq files are uploaded.
Data supporting: "Evidence that polyploidy in esophageal adenocarcinoma originates from mitotic slippage caused by defective chromosome attachments" Scott et al. WGS and RNAseq sequencing data Organoid, tumour and normal samples BAM files