Click on a Dataset ID in the table below to learn more, and to find
out who to contact about access to these data
Dataset ID
Description
Technology
Samples
EGAD00001008776
Cellular deconvolution algorithms virtually reconstruct tissue composition by analyzing the gene expression of complex tissues. Here, we present the decision tree machine learning algorithm, Kassandra, trained on a broad collection of > 9,400 tissue and blood sorted cell RNA profiles to accurately reconstruct the tumor microenvironment (TME). Bioinformatic correction for technical and biological variability, aberrant cancer cell expression inclusion, and the accurate quantification and normalization of transcript expression increased the stability and robustness of Kassandra. Performance was validated on over 4,000 H&E tissue slides and more than 1,000 normal and tumor tissues by comparison with cytometric, immunohistochemical or single-cell RNA-seq measurements. Kassandra accurately deconvolved stromal and immune elements of blood and tumors, revealing the role of the TME in tumor pathogenesis. Digital TME reconstruction revealed that the presence of PD1-positive CD8+ T cells strongly correlated with immunotherapy response and increased the predictive potential of established biomarkers, indicating that Kassandra could potentially be utilized in future clinical applications.
Illumina NovaSeq 6000
NextSeq 550
348