For two cases of breast cancers, semibulk RNA-seq and single cell RNA-seq were performed.
Multimodal characterization of cell-free DNA (cfDNA) in the blood can enable the sensitive and non-invasive detection of human cancers but remains technically challenging and costly. Here, we developed Multimodal Epigenetic Sequencing Analysis (MESA), a flexible and sensitive method of capturing and integrating multimodal epigenetic information of cfDNA using a single experimental assay, i.e., non-disruptive bisulfite-free methylation sequencing, such as Enzymatic Methyl-seq (EM-seq) and TET-assisted pyridine borane sequencing (TAPS). MESA can simultaneously infer cfDNA methylation, nucleosome occupancy, nucleosome fuzziness, and fragmentation profile for regions surrounding the promoters and polyadenylation sites (PASs). MESA’s integrated analysis of multimodal epigenetic features significantly improved the performance of cancer detection models compared to the usage of any single modality alone. MESA captures additional and highly complementary epigenetic information from cfDNA without additional experimental assays, highlighting the importance and clinical prospect of using multimodal epigenetic features for non-invasive cancer detection
Genome and transcriptome sequencing of cancer of unknown primary tumours was used to determine yield of clinical biomarkers for a molecular guided trial or for resolving cancer type of origin. This study includes profiling of germline DNA and tumour DNA by whole genome sequencing of tissue and cfDNA biopsies, as well as targeted genome sequencing using two panels, including Comprehensive cancer panel (CCP) and TruSight Oncology 500 (TSO500). This study also includes profiling of tumour RNA by whole transcriptome sequencing.
U1 small nuclear RNA (snRNA) mutations are recurrent non-coding alterations found in various malignancies, including medulloblastoma, yet their identification has proven challenging due to their repetitive nature. To establish a robust DNA-based, and RNA-based method for detecting U1 snRNA mutations, we performed targeted DNA sequencing using our original gene panel for medulloblastoma and RNA-sequencing. As an RNA-based method, we developed a random forest classifier employing splicing patterns which were massively altered as a result of U1 snRNA mutations. By using both methods, we successfully detected U1 snRNA mutation in five DNA samples and four RNA samples.
Rare cancer sequencing data of 23 runs in tumor/control pairs, which were uploaded to umbrella studies. The sequencing was always paired
The study was conducted under the auspices of the Transdisciplinary Research In Cancer of the Lung (TRICL) Research Team, which is a part of the Genetic Associations and MEchanisms in ONcology (GAME-ON) consortium, and associated with the International Lung Cancer Consortium (ILCCO). Ethics All participants provided written informed consent. All studies were reviewed and approved by institutional ethics review committees at the involved institutions. Sequencing data are derived from four sub-studies. The sub-studies that contributed include Harvard, Liverpool, Toronto, and IARC. The IARC and Toronto studies are described above. A description of the Harvard and Liverpool studies is provided below. Liverpool Lung Project: The Liverpool Lung Project (LLP)1 is a case control and cohort study, which has over 11,500 individuals, with detailed epidemiological, clinical and outcome data with associated specimens (i.e. tumour tissue, blood, plasma, sputum, bronchial lavage, EBUS and oral brushings). The participants have completed a detailed lifestyle questionnaire and updated data on clinical outcome and hospital events are collected through the Office of National Statistics, Cancer Registry and from Health Episode Statistics. The project is registered on the UK National Institute for Health Research (NIHR) lung cancer portfolio and has all the required ethical approvals and sponsorship arrangements in place. The LLP has detailed standard operating procedures (SOP) for all aspects of the recruitment, data, specimen collection as well as the data storage. The LLP Cohort study has 8,224 participants with blood and 7,761 with plasma samples. The LLP case-control samples have been incorporated into in a large number of international GWAS and molecular studies 2,3, methylation 4-7, microRNA 8and next generation studies 9-11, resulting in high ranking publications, as well as forming the basis for the LLP risk prediction model 12-14 which has been utilised in the UK lung cancer screening trial (UKLS) 15-17 Patient and control DNAs were derived from EDTA-venous blood samples. Harvard Samples. David Christiani at the Harvard University School of Public Health has been directing research studies to investigate etiological factors influencing lung cancer development since 1983 and has amassed a collection of 2000 controls and 5055 lung cancer cases. He has been actively collecting and storing snap frozen tumor samples since 1992. Around 1500 tumor samples have been collected and the average wet tumor yield is about 30 grams of tumor, of which 631 cases have completely annotated clinical and survival information. Pathology confirmation is provided by two pathologists. At the time of surgery, a minimum of 30 grams of wet lung tumor tissue and 30 grams of non-involved tissue from the same lobe is sectioned, flash frozen and sent to Dr. Christiani's lab for logging and storage. A blood sample for DNA and serum is collected. A structured interview by trained research staff is conducted on each case, and clinical outcomes and treatments is extracted and entered into the molecular epidemiology data base at Harvard. Fresh frozen samples have been collected from 1451 lung cancer and are available for study. Samples from this collaborative study have played key roles in major studies, including the initial finding describing EGFR mutations in lung cancer 22. Participants in this study are patients, > 18 years of age, with newly diagnosed histologically confirmed lung cancer. Samples that are included in the analysis have the following histologies: Adenocarcinoma: 8140/3, 8250/3, 8260/3, 8310/3, 8480/3 8560/3; LCC: 8012/3, 8031/3; squamous carcinoma: 8070/3, 8071/3, 8072/3, 8074/3; and other NSCLC: 8010/3, 8020/3, 8021/3, 8032/3, 8230/3. The Toronto Study: The Toronto study was conducted in the Great Toronto Area between 1997 and 2014. Cases were recruited at the hospitals in the network of University of Toronto and Lunenfeld- Tanenbaum Research Institute. At the time of recruitment in the clinical setting, provisional diagnoses of lung carcinoma were first assigned based on clinical criteria. Diagnoses for all cases included were histologically confirmed by the reference pathologist who is a specialist in pulmonary pathology, based on review of pathology reports from surgery, biopsy or cytology samples in 100% of cases. Diagnostic classification was done initially according to ICD-9, ICD-10, and ICD for oncology-2, and subsequently converted to ICD-O-3. Tumors were grouped into the major categories included in this analysis according to primary cancer type based on the ICD-3 definitions. Controls were randomly selected from individual visiting family medicine clinics and Ministry of Finance Municipal Tax Tapes. All subjects were interviewed using a standard questionnaire and information on lifestyle risk factors, occupational history, medical and family history was collected. Blood samples were collected from more than 85% of the subjects. IARC: The IARC data are derived from case-control studies conducted in Russia and include samples that have available tissue samples. Patient and control DNAs were derived from EDTA-venous blood samples. The lung cancer patients were classified according to ICD-O-3; SQ: 8070/3, 8071/3, 8072/3, 8074/3; AD: 8140/3, 8250/3, 8260/3, 8310/3, 8480/3, 8560/3, 8251/3, 8490/3, 8570/3, 8574/3; with tumous with overlapping histologies classified as mixed. The Lung Cancer Transdisciplinary Research Cohort is utilized in the following dbGaP sub-studies. To view genotypes, other molecular data, and derived variables collected in these sub-studies, please click on the following sub-studies below or in the "Sub-studies" section of this top-level study page phs000876 Lung Cancer Transdisciplinary Research Cohort. phs000877 Meta Analysis phs000878 CIDR Lung Cancer phs001681 Affy Axiom Array
Peritoneal metastases (PM) in colorectal cancer (CRC) portend a poor prognosis. We sought to elucidate molecular features differentiating primary tumors (PTs) from PMs and actionable targets facilitating transcoelomic dissemination and progression. We performed multi-omic profiling of 217 samples from 136 patients, including 56 primary tumor (PT) and 120 PMs comprising 25 matched PT-PM pairs. Whole exome, and bulk RNA-seq analysis was conducted to identify underlying genomic aberrations and transcriptomic differences between primary and peritoneal lesions.
Epigenetic regulation of transcription plays a crucial role in lineage commitment of embryonic stem cells. Promoters of key lineage-specific differentiation genes are found in a repressed bivalent state, having both activating H3K4me3 and repressive H3K27me3 histone marks, making them poised for transcription upon loss of H3K27me3 in response to environmental cues. Whether the tumour-initiating, self-renewing, cancer-initiating cells (C-ICs) have similar epigenetic regulatory mechanism that prevent lineage commitment is unknown. In order to investigate bivalently marked and repressed promoters, we used a patient-derived CC-IC enriched model to identify the changes in transcriptome following inhibition of EZH2, the H3K27 methyltransferase. We also performed ChIP-seq for H3K27me3 and H3K4me3 at baseline in order to identify repressed and bivalently marked promoters.
For this dataset we performed single cell RNAseq paired with single cell TCR-seq on tumor and blood samples from 4 patients. This dataset contains 4 tumor samples as well as 4 blood samples. Each sample is made up of 2 sets of paired fastq files. The first pair contains reads corresponding to RNA transcripts (_Transcripts in file name), while the second pair contain reads corresponding to TCRs (_VDJ in file name). Sequenced on the Illumina NovaSeq6000 platform in a paired-end run using an SP flow cell (v1.5, 300 cycles).
The Mood Methylation Study (MMS) is an observational study designed to understand epigenetic mechanisms underlying major depression symptoms (MDD) using a co-twin control design. Twins enrolled in the MMS were monozygotic (MZ) twin pairs discordant on lifetime history of MDD. All twins were members of the Washington State Twin Registry (WSTR), a community-based twin registry consisting of over 10,000 twin pairs. Lifetime and current MDD diagnoses were determined using the Structured Clinical Interview for DSM-IV Research Version (SCID-4-RV). DNA methylation in blood monocytes was measured using the Infinium HumanMethylationEPIC BeadChip. Monocyte gene expression was quantified by paired end RNA-seq (50 PE). Zygosity of the twin pairs included in the MMS was self-reported and confirmed using the polymorphic SNPs included in the DNA methylation array.