Copied to clipboard!

RNA sequencing and Illumina 2.5M SNP array data collected from 675 commonly used human cancer cell lines.

Tumor-derived cell lines have served as vital models to advance our understanding of oncogene function and therapeutic response1. Although substantial effort has been directed to defining the genomic constitution of cancer cell line panels2–4, the transcriptome – which represents the active program of a cell – remains understudied. Here, we describe RNA sequencing and SNP array analysis of 675 commonly used human cancer cell lines. We explore numerous transcriptome features including coding and non-coding gene expression, transcribed mutations, gene fusion and expression of non-human sequences. Aside from many known aberrations we find new surprising characteristics, including more than 2200 unique fusion gene pairs representing a vast, testable repertoire of oncogenic fusions, many of which have analogs found in primary human tumors. We show that a combination of multiple genome and transcriptome features in a novel pathway-based approach enhances prediction of response to various targeted therapeutics. Our results provide valuable new insights into these critical pre-clinical models and provide added context for interpreting the numerous studies that employ these widely used cell lines.

Type: Other
Archiver: European Genome-phenome Archive (EGA)

3 Datasets 21 Publications

Click on a Dataset ID in the table below to learn more, and to find out who to contact about access to these data

Dataset ID	Description	Technology	Samples
EGAD00001000725	This dataset contains RNA sequencing data for 675 cancer cell lines. RNA libraries were made with the TruSeq RNA Sample Preparation kit (Illumina) according to the manufacturer protocol. The libraries were sequenced on an Illumnia HiSeq 2000	Illumina HiSeq 2000	675
EGAD00001001013	RNAseq and exome sequencing data of gastric cancer cell lines.	Illumina HiSeq 2000	30
EGAD00010000951	SNP array data for 668 cancer cell lines	Illumina 2.5M	668

Publications	Citations
A comprehensive transcriptional portrait of human cancer cell lines. Klijn C, Durinck S, Stawiski EW, Haverty PM, Jiang Z, Liu H, Degenhardt J, Mayba O, Gnad F, Liu J, Pau G, Reeder J, Cao Y, Mukhyala K, Selvaraj SK, Yu M, Zynda GJ, Brauer MJ, Wu TD, Gentleman RC, Manning G, Yauch RL, Bourgon R, Stokoe D, Modrusan Z, Neve RM, de Sauvage FJ, Settleman J, Seshagiri S, Zhang Z. Nat Biotechnol 33: 2015 306-312	509
Human biosample authentication using the high-throughput, cost-effective SNPtrace(TM) system. Liang-Chu MM, Yu M, Haverty PM, Koeman J, Ziegle J, Lee M, Bourgon R, Neve RM. PLoS One 10: 2015 e0116218	38
Metabolite profiling stratifies pancreatic ductal adenocarcinomas into subtypes with distinct sensitivities to metabolic inhibitors. Daemen A, Peterson D, Sahu N, McCord R, Du X, Liu B, Kowanetz K, Hong R, Moffat J, Gao M, Boudreau A, Mroue R, Corson L, O'Brien T, Qing J, Sampath D, Merchant M, Yauch R, Manning G, Settleman J, Hatzivassiliou G, Evangelista M. Proc Natl Acad Sci U S A 112: 2015 E4410-7	300
TCLP: an online cancer cell line catalogue integrating HLA type, predicted neo-epitopes, virus and gene expression. Scholtalbers J, Boegel S, Bukur T, Byl M, Goerges S, Sorn P, Loewer M, Sahin U, Castle JC. Genome Med 7: 2015 118	75
Modeling the integration of bacterial rRNA fragments into the human cancer genome. Sieber KB, Gajer P, Dunning Hotopp JC. BMC Bioinformatics 17: 2016 134	4
A gene expression signature of retinoblastoma loss-of-function is a predictive biomarker of resistance to palbociclib in breast cancer cell lines and is prognostic in patients with ER positive early breast cancer. Malorni L, Piazza S, Ciani Y, Guarducci C, Bonechi M, Biagioni C, Hart CD, Verardo R, Di Leo A, Migliaccio I. Oncotarget 7: 2016 68012-68022	110
Gene isoforms as expression-based biomarkers predictive of drug response in vitro. Safikhani Z, Smirnov P, Thu KL, Silvester J, El-Hachem N, Quevedo R, Lupien M, Mak TW, Cescon D, Haibe-Kains B. Nat Commun 8: 2017 1126	43
Transcription Factor Activities Enhance Markers of Drug Sensitivity in Cancer. Garcia-Alonso L, Iorio F, Matchan A, Fonseca N, Jaaks P, Peat G, Pignatelli M, Falcone F, Benes CH, Dunham I, Bignell G, McDade SS, Garnett MJ, Saez-Rodriguez J. Cancer Res 78: 2018 769-780	156
The cancer cell proteome and transcriptome predicts sensitivity to targeted and cytotoxic drugs. Rydenfelt M, Wongchenko M, Klinger B, Yan Y, Blüthgen N. Life Sci Alliance 2: 2019 e201900445	9
Network-based method for drug target discovery at the isoform level. Ma J, Wang J, Ghoraie LS, Men X, Liu L, Dai P. Sci Rep 9: 2019 13868	6
B cells extract antigens at Arp2/3-generated actin foci interspersed with linear filaments. Roper SI, Wasim L, Malinova D, Way M, Cox S, Tolar P. Elife 8: 2019 e48093	36
A bioinformatics approach to identify novel long, non-coding RNAs in breast cancer cell lines from an existing RNA-sequencing dataset. Zaheed O, Samson J, Dean K. Noncoding RNA Res 5: 2020 48-59	11
The cell line A-to-I RNA editing catalogue. Schaffer AA, Kopel E, Hendel A, Picardi E, Levanon EY, Eisenberg E. Nucleic Acids Res 48: 2020 5849-5858	53
LncRBase V.2: an updated resource for multispecies lncRNAs and ClinicLSNP hosting genetic variants in lncRNAs for cancer patients. Das T, Deb A, Parida S, Mondal S, Khatua S, Ghosh Z. RNA Biol 18: 2021 1136-1151	17
Molecular and cellular characterization of two patient-derived ductal carcinoma in situ (DCIS) cell lines, ETCC-006 and ETCC-010. Samson J, Derlipanska M, Zaheed O, Dean K. BMC Cancer 21: 2021 790	10
BK Channel in the Physiology and in the Cancer of Pancreatic Duct: Impact and Reliability of BK Openers. Zuccolini P, Gavazzo P, Pusch M. Front Pharmacol 13: 2022 906608	9
Transposon-activated POU5F1B promotes colorectal cancer growth and metastasis. Simó-Riudalbas L, Offner S, Planet E, Duc J, Abrami L, Dind S, Coudray A, Coto-Llerena M, Ercan C, Piscuoglio S, Andersen CL, Bramsen JB, Trono D. Nat Commun 13: 2022 4913	20
An intrinsic purine metabolite AICAR blocks lung tumour growth by targeting oncoprotein mucin 1. Aftab F, Rodriguez-Fuguet A, Silva L, Kobayashi IS, Sun J, Politi K, Levantini E, Zhang W, Kobayashi SS, Zhang WC. Br J Cancer 128: 2023 1647-1664	7
Investigating the suitability of <i>in vitro</i> cell lines as models for the major subtypes of epithelial ovarian cancer. McCabe A, Zaheed O, McDade SS, Dean K. Front Cell Dev Biol 11: 2023 1104514	26
Non-canonical integrin signaling activates EGFR and RAS-MAPK-ERK signaling in small cell lung cancer. Rubio K, Romero-Olmedo AJ, Sarvari P, Swaminathan G, Ranvir VP, Rogel-Ayala DG, Cordero J, Günther S, Mehta A, Bassaly B, Braubach P, Wygrecka M, Gattenlöhner S, Tresch A, Braun T, Dobreva G, Rivera MN, Singh I, Graumann J, Barreto G. Theranostics 13: 2023 2384-2407	36
Systematic transcriptional analysis of human cell lines for gene expression landscape and tumor representation. Jin H, Zhang C, Zwahlen M, von Feilitzen K, Karlsson M, Shi M, Yuan M, Song X, Li X, Yang H, Turkez H, Fagerberg L, Uhlén M, Mardinoglu A. Nat Commun 14: 2023 5417	93