Systematic comparative analysis of single-nucleotide variants detection methods from single-cell RNA sequencing data

Study ID Alternative Stable ID Type
EGAS00001003883 Other

Study Description

Systematic interrogation of single nucleotide variations (SNVs) is one of the most promising approaches to delineate the cellular heterogeneity and phylogenetic relationships at the single cell level. While SNV detection from abundant single cell RNA sequencing (scRNA-seq) data is applicable and cost-effective in identifying expressed variants, inferring sub-clones, and deciphering genotype-phenotype linkages, there is a lack of computational methods specifically developed for SNV calling in scRNA-seq. Although variant callers for bulk RNA-seq have been sporadically used in scRNA-seq, the performances of different tools have not been assessed. Here, we performed a systematic comparison of seven tools including SAMtools, the GATK pipeline, CTAT, FreeBayes, MuTect2, Strelka2 and VarScan2, using both simulation and scRNA-seq datasets, and identified multiple elements influencing their performance. Our study provided the first benchmarking to evaluate the performances of different SNV detection tools for scRNA-seq data.

Study Datasets 1 dataset.

Click on a Dataset ID in the table below to learn more, and to find out who to contact about access to these data

Dataset ID Description Technology Samples
In this study, we performed systematic comparative analysis of seven widely-used SNV-calling methods, including SAMtools, the GATK Best Practices pipeline, CTAT, FreeBayes, MuTect2, Strelka2 and VarScan2, on both simulated and real single-cell RNA-seq datasets. We generated SMART-seq2 data for 70 CD45- single cells, which were derived from two colorectal cancer patients (P0411 and P0413). The average sequencing depths of these cells were 1.4 million reads per cell. We also generated tumor and ... (Show More)
Illumina HiSeq 4000 75

Who archives the data?