A standardised framework for robust fragmentomic feature extraction from cell-free DNA sequencing data
Fragmentomic features of cell-free DNA represent promising non-invasive biomarkers for cancer diagnosis. However, a lack of systematic evaluation of biases in feature quantification has hindered the adoption of such applications. We compared features derived from whole-genome sequencing of ten healthy donors using nine library kits and ten data-processing routes, and validated them in 1,182 plasma samples from published studies. Our results clarify the variations resulting from library preparation and feature quantification methods. We designed the Trim Align Pipeline and the cfDNAPro R package as unified interfaces for data pre-processing, feature extraction, and visualisation, aiming to standardise multimodal feature engineering and integration for machine learning.
- Type: Other
- Archiver: European Genome-Phenome Archive (EGA)
Click on a Dataset ID in the table below to learn more, and to find out who to contact about access to these data
| Dataset ID | Description | Technology | Samples |
|---|---|---|---|
| EGAD00001015535 | Illumina NovaSeq 6000 | 82 |
| Publications | Citations |
|---|---|
|
A standardized framework for robust fragmentomic feature extraction from cell-free DNA sequencing data.
Genome Biol 26: 2025 141 |
4 |
