Error-corrected flow-based sequencing at whole genome scale and its application to circulating cell-free DNA profiling
Differentiating sequencing errors from true variants is a central genomics challenge calling for error suppression strategies that balance costs when sparse material or ultra-rare variants require high-accuracy methods. For example, circulating cell-free DNA (ccfDNA) sequencing for cancer monitoring is limited by sparsity of circulating tumor DNA (ctDNA), abundance of genomic material in samples, library preparation error rates and sequencing errors. Whole-genome sequencing (WGS) can overcome the low abundance of ccfDNA by integrating signal across the mutation landscape, but higher cost limits adoption compared with targeted panels, where ccfDNA abundance limits maximal coverage depths. Here we applied deep (~120x) lower-cost WGS (Ultima Genomics) with analytic error correction for tumor-informed ctDNA detection within the part-per-million range. We further developed high-coverage duplex error-corrected WGS of ccfDNA, achieving 7.7x10-8 error rates, to assess disease burden in melanoma and urothelial cancer patients without matched tumor sequencing. Together, we show that deeply sequenced error-corrected WGS accurately calls somatic variants, demonstrating the feasibility of WGS for tumor-informed and tumor-agnostic ctDNA detection.
- Type: Whole Genome Sequencing
- Archiver: European Genome-Phenome Archive (EGA)
Click on a Dataset ID in the table below to learn more, and to find out who to contact about access to these data
Dataset ID | Description | Technology | Samples |
---|---|---|---|
EGAD50000001234 | HiSeq X Ten unspecified | 93 |