Enrichment of oral-derived bacteria in inflamed colorectal tumors and distinct associations of Fusobacterium in the mesenchymal subtype

Study ID Alternative Stable ID Type
EGAS00001006757 Other

Study Description

The association between colorectal cancer (CRC) clinical variables and Fusobacterium, but not other intra-tumoral bacteria, has been extensively studied. Here we leveraged whole-transcriptome sequencing from 807 CRC tumor samples from the AVANT phase III trial to dually characterize tumor gene expression and intra-tumoral bacteria. After stringent filtering, 74 high-confidence taxa were identified. 17 of these species, including 4 Fusobacterium spp., were classified as orally-derived and had a robust signal within right-sided, MSI-H, and BRAF mutant tumors. Across consensus molecular subtypes (CMS), integration of Fusobacterium animalis presence and tumor gene expression revealed that F. animalis had the greatest number of associations in mesenchymal CMS4 tumors, despite an overall lower prevalence than in immune CMS1 tumors. Pathway analysis within CMS4 revealed that F. animalis, but not other highly prevalent species, was uniquely associated with pathways for collagen degradation and formation as well as IL-6 and IL-1 cytokine signaling. These associations could explain in part ... (Show More)

Study Datasets 1 dataset.

Click on a Dataset ID in the table below to learn more, and to find out who to contact about access to these data

Dataset ID Description Technology Samples
RNAseq FASTq files from 797 tumors from AVANT. Sequencing libraries were generated with the TruSeq Stranded Total RNA kit (Illumina) following ribosomal RNA (rRNA) depletion with the Ribo-Zero Gold kit (Illumina). The libraries were sequenced on the HiSeq4000 (Illumina) with a sequencing protocol of 75 bp paired-end sequencing. Note: 10 samples used in the original publication were excluded from this upload due to regulations from the Human Genetics Resources Administration of China (HGRAC).
Illumina HiSeq 4000 797

Who archives the data?