Long-read trio sequencing of unsolved patients with intellectual disability
Long-read sequencing (LRS) has the potential to comprehensively identify all medically-relevant genome variation, including variation commonly missed by short read sequencing (SRS) approaches. To determine this potential, we performed LRS up to 40x coverage using the Pacific Biosciences Sequel instrument for 5 patient-parent trios with intellectual disability, whose aetiology remained unresolved after SRS exomes and genomes. A first systematic assessment of LRS coverage showed that ~35Mb of the genome was only accessible by LRS and not SRS. Structural variant (SV) calling yielded on average 28,292 SV calls per individual, totalling 12.9 MB of sequence. Trio-based analyses showed concordance for up to 95% of these SV calls across the genome, and 80% of the LRS SV calls were not identified by SRS. De novo mutation analysis did not identify any de novo SVs, confirming these are rare events. Because of high sequence coverage, we were also able to call single nucleotide substitutions. On average, we identified 3 million substitutions per genome, with a Mendelian inheritance concordance of up to 97%. Of these, approximately 100,000 were located in the ~35Mb of the genome that was only captured by LRS. Moreover, these variants affected the coding sequence of 64 genes, including 32 known to cause Mendelian disorders. Although no clinical diagnoses were made in these 5 patients, our data show the potential added value of LRS compared to SRS for identifying medically relevant genome variation.
- Type: Other
- Archiver: European Genome-Phenome Archive (EGA)
Click on a Dataset ID in the table below to learn more, and to find out who to contact about access to these data
Dataset ID | Description | Technology | Samples |
---|---|---|---|
EGAD00001006050 | Sequel | 15 |
Publications | Citations |
---|---|
Long-read trio sequencing of individuals with unsolved intellectual disability.
Eur J Hum Genet 29: 2021 637-648 |
23 |