Long-read trio sequencing of unsolved patients with intellectual disability

Study ID Alternative Stable ID Type
EGAS00001004319 Other

Study Description

Long-read sequencing (LRS) has the potential to comprehensively identify all medically-relevant genome variation, including variation commonly missed by short read sequencing (SRS) approaches. To determine this potential, we performed LRS up to 40x coverage using the Pacific Biosciences Sequel instrument for 5 patient-parent trios with intellectual disability, whose aetiology remained unresolved after SRS exomes and genomes. A first systematic assessment of LRS coverage showed that ~35Mb of the genome was only accessible by LRS and not SRS. Structural variant (SV) calling yielded on average 28,292 SV calls per individual, totalling 12.9 MB of sequence. Trio-based analyses showed concordance for up to 95% of these SV calls across the genome, and 80% of the LRS SV calls were not identified by SRS. De novo mutation analysis did not identify any de novo SVs, confirming these are rare events. Because of high sequence coverage, we were also able to call single nucleotide substitutions. On average, we identified 3 million substitutions per genome, with a Mendelian inheritance concordance ... (Show More)

Study Datasets 1 dataset.

Click on a Dataset ID in the table below to learn more, and to find out who to contact about access to these data

Dataset ID Description Technology Samples
5 trios were whole genome sequenced with PacBio Sequel to a depth of 15X (Trios 1-4) or 40X (Trio 5). For each trio the child was affected with severe ID, and the parents were unaffected. Dataset consists of Trio 2 samples: T2P, T2F and T2M
Sequel 15

Who archives the data?