This dataset contains RNA-seq data (Fastq files of paired-end data) of 18 patient tumors used for identification of neotranscripts in 18 different types of fusion-driven sarcomas and other cancers as described in Vibert et al., Mol Cell 2022 (PMID: 35550257)
Single nuclei RNA sequencing (snRNA-seq) on tissue samples from 11 patients (SDHB and RET). Files are fastq files of 10x-5'scRNAseq libraries.
RNA-seq dataset (BAM files) of 28 HCCs and 19 non-tumor livers derived from 8 patients undergoing sorafenib treatment.
Human data for transcriptome (bulk RNA-Seq) in eight B-cell precursors: HSC, CLP, pro-B, pre-B, Immature B, Transitional B, Naive B CD5-, Naive B CD5+ cells
We performed RNA-Seq in DIPG and hemispheric HGG.
RNA-seq of primary and metastatic sites from highly clinically annotated HGSC samples. Samples were obtained pre-treatment based on a laparoscopic triage algorithm from patients who underwent R0 tumor debulking, or received neoadjuvant chemotherapy (NACT) with excellent (ER) or poor response(PR).
The data set consists of fastq raw files from RNA-seq of seven mucosal biopsies of the colon from seven patients, among them three patients with irritable bowel syndrome with mixed type symptoms. Paired end sequencing on Illumina NovaSeq 6000 was used.
To investigate ALS pathology at the cellular level, we conducted a multiome (single-nucleus RNA sequencing and assay for transposase-accessible chromatin using sequencing) analysis using the motor cortex and spinal cord of patients with ALS and control individuals.
ATAC-seq data from multiple colorectal cancer cell lines
Single-cell measurements of cellular characteristics have been instrumental in understanding the heterogeneous pathways that drive differentiation, cellular responses to signals, and human disease. Recent advances have allowed paired capture of protein abundance and transcriptomic state, but a lack of epigenetic information in these assays has left a missing link to gene regulation. Using the heterogeneous mixture of cells in human peripheral blood as a test case, we developed a novel scATAC-seq workflow that increases signal-to-noise and allows paired measurement of cell surface markers and chromatin accessibility: Integrated Cellular Indexing of Chromatin Landscape and Epitopes (ICICLE-seq). We extended this approach using a droplet-based multiomics platform to develop a trimodal assay that simultaneously measures Transcriptomics (scRNA-seq), Epitopes, and chromatin Accessibility (scATAC-seq) from thousands of single cells, which we term TEA-seq. Together, these multimodal single-cell assays provide a novel toolkit to identify type-specific gene regulation and expression grounded in phenotypically defined cell types.
Background and Rationale for the Childhood Cancer Survivor Study (CCSS) Over the last several decades, advances in treatments for childhood and adolescent cancer have substantially improved survival following diagnosis. These improvements gave rise to the responsibility for investigating long-term treatment-associated morbidity and mortality. Early efforts to describe late effects were largely conducted through single-institution and limited consortia studies. However, by the mid-1980s, it became increasingly clear that these approaches had inherent limitations, including small sample size, convenience sampling, incompletely characterized populations, and limited length of follow-up. To overcome these limitations, the CCSS was proposed and funded by the National Cancer Institute (NCI) as a U01 grant in 1994. Subsequently, the strengths of the CCSS, including an efficient and extensive infrastructure, plus expanding database and biorepository, were recognized and appreciated. Thus, in consultation with the NCI, the CCSS was converted to a U24 (resource grant) funding mechanism to serve the scientific community in 2000. The overarching goal of the CCSS resource is to increase the conduct of innovative and high impact research related to pediatric cancer survivorship. CCSS has been used extensively by researchers from a wide range of disciplines to address a broad spectrum of topics. Strengths of the resource include its large size, comprehensive annotation of treatment exposures, ongoing longitudinal follow-up with characterization of a wide array of participant characteristics and outcomes, and an established biorepository. Design of the Childhood Cancer Survivor Study The Childhood Cancer Survivor Study (CCSS) is a multi-institutional, multi-disciplinary collaborative research resource comprised of a retrospective hospital-based cohort of survivors of childhood cancer and a comparison sibling cohort. Eligible survivors from 31 participating institutions were diagnosed between 1970 and 1999, prior to age 21 years, with selected common pediatric cancers (leukemia, central nervous system tumors, Hodgkin lymphoma, non-Hodgkin lymphoma, kidney tumors, neuroblastoma, soft tissue sarcoma, or bone tumors). All patients who survived five years from the date of diagnosis were eligible, regardless of disease or treatment status. The baseline questionnaire was completed by 24,368 survivors and 5,039 siblings recruited to serve as a comparison group. To date, participants have completed three general follow-up surveys, as well as a number of specialized surveys on specific topics (e.g. health care, insurance, screening practices, men's and women's health issues, adolescent health, sleep and fatigue). In addition, biological samples (buccal cells, saliva and/or blood) have been collected for over 11,000 participants. Full descriptions of the design and characteristics of the CCSS have been previously published (Robison et al; Leisenring et al.), and available data and samples are described at https://ccss.stjude.org/develop-a-study/gwas-data-resource.html. Treatment Data in the Childhood Cancer Survivor Study A key feature of CCSS is the availability of detailed treatment data, which were collected by abstraction of medical records for each individual member of the cohort. Detailed abstraction included dates of therapy, protocol information, and specific details regarding surgery, chemotherapy and radiation. Quantitative dose details were collected for 22 specific chemotherapeutic agents, including alkylating agents, anthracyclines, platinum compounds and epipodophyllotoxins. In addition to individual agent doses, algorithms have been created to calculate cumulative doses of all drugs in a specific class, such as anthracyclines (doxorubicin, daunomycin and idarubicin) or platinum agents (cisplatinum and carboplatinum). Data abstracted for surgeries included dates and both the names and corresponding International Classification of Diseases (9th revision) code. For radiation treatment data, all relevant records were sent to the Radiation Physics Center at M.D. Anderson Cancer Center for detailed abstraction and dosimetry. Initial body region dosimetry was performed for all participants, followed by more detailed dosimetry as needed for specific studies. Genomics Data in the Childhood Cancer Survivor StudyThe NCI's Division of Cancer Epidemiology and Genetics and CCSS investigators collaborated to conduct genomics studies (SNP array genotyping and whole exome sequencing) using samples from the CCSS Biorepository. Studies included all cohort participants with available DNA regardless of sex or ancestry when the genomics studies were initiated. Phenotype Data in the Childhood Cancer Survivor Study Vital status and cause of death for both participants and non-participants is determined via linkage with the National Death Index (NDI). Identification of subsequent neoplasms is based on self-report, followed by validation using medical records, or via NDI. A wide array of additional health outcomes have been ascertained via a comprehensive set of questions on the CCSS questionnaires, covering potential adverse events across a range of organ systems (hearing/vision/speech, urinary, hormonal, heart and circulatory, respiratory, digestive, brain and nervous systems). In addition to health outcomes, longitudinal data have been collected on demographics, health behaviors, family history, screening practices, insurance status, and a range of psychosocial and neurocognitive factors. A full listing of available variables and copies of the CCSS questionnaires are available at http://ccss.stjude.org. Research Areas in the Childhood Cancer Survivor Study Extensive use by the research community has resulted in over 265 published manuscripts on a wide range of topics, including associations between treatment factors and mortality, subsequent neoplasms, chronic health conditions, cardiac events, neurocognitive sequelae, psychosocial factors, fertility, and health status. Additional topics have included health behaviors, screening practices, health care access and utilization, statistical and exposure assessment methodology, and development of risk prediction models. A full listing of published manuscripts using CCSS data is available on the CCSS website at https://ccss.stjude.org/published-research/publications.html. The Childhood Cancer Survivor Study as a Resource for Investigators The CCSS is an NCI-funded resource (U24 CA55727) to promote and facilitate research among long-term survivors of cancer diagnosed during childhood and adolescence. Interested investigators are encouraged to develop research ideas and propose projects within CCSS, whether or not they are from a participating CCSS institution. The CCSS is now accepting proposals to collaborate with CCSS and NCI investigators in the use of genomics data and corresponding outcomes-related data to address innovative research questions relating to potential genetic contributions to risk for treatment-related outcomes. Any researcher, or group of researchers, qualified to conduct genetic research can submit a proposal. There are no restrictions relative to country, institution, or prior involvement in CCSS. A full description of the process for developing a proposal for genetic research in CCSS can be found at https://ccss.stjude.org/develop-a-study/gwas-data-resource.html, along with listings of approved proposals.
Head and neck squamous cell carcinoma (HNSCC) tumors were sampled from 88 patients. Samples were implanted subcutaenously into the flank of NOD/SCID/IL2R-/- mice. Mice were then monitored weekly for tumor growth and the time of initial palpation was recorded. Mice were euthanized when tumors reached 15 mm in diameter or after 6 months. If no tumor formed by 6 months, the patient tumor was defined as Non-engrafter (N). RNA-seq was performed on all primary tumor samples and successfully engrafted patient-derived xenograft (PDX) samples.
This data set contains the RNA-seq BAM files for der(1;7)(q10;p10) MDS cases, -7/del(7q) MDS cases, and OTHER MDS cases. The CD34 selected RNA were collected from patient BM/PB. The dataset was used to identify the unique expression profiles for der(1;7)(q10;p10) MDS cases compared to non-der(1;7)(q10;p10) MDS cases.
We performed single cell RNA sequencing (scRNA-seq) for 208,506 cells derived from 58 lung adenocarcinomas from 44 patients, which covers primary tumour, lymph node and brain metastases, and pleural effusion in addition to normal lung tissues and lymph nodes.
This dataset contains fastq-files from bulk RNA sequencing of 120 AML samples. RNA was extracted from either bone marow or peripheral blood from primary AML samples. The libraries were prepared using Illumina Truseq RNA library preparation kit v2 and sequenced on an Illumina NextSeq 500 using 2x151bp paired end chemistry.
In this study, we utilize macrophage-tropic and T cell-tropic HIV-1 Env proteins to establish accurate infection profiles for multiple CNS cells under basal and interferon alpha (IFN-α) or lipopolysaccharide (LPS)-induced inflammatory states. We found that macrophage-tropic viruses confer entry advantages in primary myeloid cells, including monocyte-derived macrophages, microglia, and induced pluripotent stem cell (iPSC)-derived microglia. Following 24-h IFN-α or LPS treatments, total RNA was harvested from 100,000 to 200,000 cells using an RNeasy RNA isolation kit (Qiagen, 74004). RNA was quantified using a NanoDrop spectrophotometer and TapeStation electrophoresis prior to library preparation. cDNA libraries were prepared using the KAPA RNA HyperPrep kit (Roche, 08098115702). Paired-end RNA sequencing was performed on a NovaSeq S4 platform at a read length of 100 bp, resulting in approximately 60 million reads per sample.