Genome-wide cell-free DNA termini in patients with cancer
|Study ID||Alternative Stable ID||Type|
The structure, fragmentation pattern, length and terminal sequence of cell-free DNA (cfDNA) is under the influence of nucleases present in the blood. We hypothesized that differences in the diversity of bases at the end of cfDNA fragments can be leveraged on a genome-wide scale to enhance the sensitivity for detecting the presence of tumor signals in plasma. We surveyed the cfDNA termini in 72 plasma samples from 319 patients with 18 different cancer types using low-coverage whole genome sequencing. The fragment-end sequence and diversity were altered in all cancer types in comparison to 76 healthy controls. We converted the fragment end sequences into a quantitative metric and observed that this correlates with circulating tumor DNA tumor fraction (R = 0.58, p < 0.001, Spearman). Using these metrics, we were able to classify cancer samples from control at a low tumor content (AUROC of 91% at 1% tumor fraction) and shallow sequencing coverage (mean AUROC = 0.99 at >1M fragments). Combining fragment-end sequences and diversity using machine learning, we classified cancer from ... (Show More)
Study Datasets 1 dataset.
Click on a Dataset ID in the table below to learn more, and to find out who to contact about access to these data
The dataset contains 295 plasma cfDNA samples from various stages of resectable esophageal adenocarcinoma from the PERFECT cohort and the nCRT cohort. Shallow WGS was performed on an Illumina Novaseq S4 PE150bp. Samples are provided as raw reads without any prior processing.
|Illumina NovaSeq 6000||295|