Login
Register
Need Help?
ABOUT
ABOUT THE EGA
EGA
Privacy Notice
Security
Team
STATISTICS
Bibliography
Growth
Community
Archive
Distribution
Catalog
PROJECTS AND FUNDERS
Projects
Funders
GA4GH
Federated EGA
Beacon
DISCOVERY
CATALOGUE
Studies
Datasets
DACs
Synthetic Data
METADATA
Search Box
Public Metadata API
SUBMISSION
DATA
File preparation
Uploading files
METADATA
EGA Schema
Sequencing & Phenotype
Submitter Portal
Submitter Portal API
Array
Programmatic Submission XML
ACCESS
DATA ACCESS COMMITTEE
What is a DAC?
Best Practices
DAC Portal
Data Use Conditions
REQUEST DATA
How to request data?
Quality Control Reports
DOWNLOAD
Metadata
Files
PyEGA3
Live Outbox
Visualisation
FUSE Client
EGA QuickView
Tips on how to search
DACs
EGAC00001002683
BostonGene Data Access Committee
Contact Information
aleksander.bagaev@bostongene.com
Request Access
This DAC controls 3 datasets
Dataset ID
Description
Technology
Samples
EGAD00001008776
Cellular deconvolution algorithms virtually reconstruct tissue composition by analyzing the gene expression of complex tissues. Here, we present the decision tree machine learning algorithm, Kassandra, trained on a broad collection of > 9,400 tissue and blood sorted cell RNA profiles to accurately reconstruct the tumor microenvironment (TME). Bioinformatic correction for technical and biological variability, aberrant cancer cell expression inclusion, and the accurate quantification and normalization of transcript expression increased the stability and robustness of Kassandra. Performance was validated on over 4,000 H&E tissue slides and more than 1,000 normal and tumor tissues by comparison with cytometric, immunohistochemical or single-cell RNA-seq measurements. Kassandra accurately deconvolved stromal and immune elements of blood and tumors, revealing the role of the TME in tumor pathogenesis. Digital TME reconstruction revealed that the presence of PD1-positive CD8+ T cells strongly correlated with immunotherapy response and increased the predictive potential of established biomarkers, indicating that Kassandra could potentially be utilized in future clinical applications.
Illumina NovaSeq 6000
NextSeq 550
348
EGAD50000000414
RNA sequencing (fastq files) of white blood cells (WBCs) from healthy donors (n=376) and cancer patients (n=421) with different diagnoses, stages of disease and previously administered treatments, was performed. Samples from cancer patients were collected from the BostonGene clinical program; all patients provided written consent per IRB-approved protocols. Blood samples from healthy donors were purchased from multiple collection centers throughout the United States. Whole blood samples (3 ml) in K2-EDTA tubes received within 24 hours of collection at RT underwent red blood cell (RBC) lysis to isolate WBCs. Isolated WBCs for RNA sequencing were centrifuged at 300 x g for 5 minutes with a maximum of 10^6 cells per vial. The supernatant was removed, and the cells were resuspended in cold Homogenization Buffer (2% 1-Thioglycerol, Promega). Samples were then frozen at -80°C until extraction. RNA extraction was performed from frozen samples with Maxwell RSC simplyRNA Cells Kit (Promega) using the benchtop automated Maxwell RSC Instrument (Promega). Libraries were prepared with Illumina TruSeq® Stranded mRNA Library Prep (Poly-A mRNA; stranded). Libraries were sequenced on NovaSeq 6000 as Paired-End Reads (2x150) with targeted coverage of 50 mln reads.
Illumina NovaSeq 6000
797
EGAD50000001822
The dataset, in .bam file format, consists of whole exome sequencing (WES) data of tumors from patients (n=24) with Renal Medullary Carcinoma (RMC), generated using Illumina NovaSeq 6000 sequencing technology. DNA was extracted from FFPE solid tumor samples using the AllPrep DNA FFPE Kit (Qiagen, CA). Libraries from FFPE tissue were prepared with the SureSelect XT HS2 DNA Kit (Agilent, CA) for exome capture. The All Exon V7 exome probe set (Agilent, CA) was used for hybridization and capture of DNA.
Illumina NovaSeq 6000
24