Need Help?

PCR-free HiSeqX whole genome sequence data on 120 samples with triplet repeat expansions (premutation and full expansions)

Whole genome sequence (WGS) data was generated on 120 Coriell samples with the following validated repeat expansions: Fragile X Syndrome, Huntington disease, Friedreich’s ataxia, Amyotrophic Lateral Sclerosis, Myotonic Dystrophy, Spinocerebellar Ataxia 1/3 and Dentatorubral-Pallidoluysian Atrophy. These samples were sequenced using 2x150bp reads on an Illumina HiSeqX sequencer and the repeat expansions were called using ExpansionHunter to demonstrate the ability to call large repeats from high throughput, PCR-free WGS data.

Click on a Dataset ID in the table below to learn more, and to find out who to contact about access to these data

Dataset ID Description Technology Samples
EGAD00001003562 HiSeq X Ten 118
Publications Citations
Detection of long repeat expansions from PCR-free whole-genome sequence data.
Genome Res 27: 2017 1895-1903
203
Profiling the genome-wide landscape of tandem repeat expansions.
Nucleic Acids Res 47: 2019 e90
112
Genome-wide sequencing as a first-tier screening test for short tandem repeat expansions.
Genome Med 13: 2021 126
25
Detection of repeat expansions in large next generation DNA and RNA sequencing data without alignment.
Sci Rep 12: 2022 13124
7
STRling: a k-mer counting approach that detects short tandem repeat expansions at known and novel loci.
Genome Biol 23: 2022 257
21
Large-Scale Whole Genome Sequence Analysis of >22,000 Subjects Provides no Evidence of <i>FMR1</i> Premutation Allele Involvement in Autism Spectrum Disorder.
Genes (Basel) 14: 2023 1518
0
RExPRT: a machine learning tool to predict pathogenicity of tandem repeat loci.
Genome Biol 25: 2024 39
4
A comparison of software for analysis of rare and common short tandem repeat (STR) variation using human genome sequences from clinical and population-based samples.
PLoS One 19: 2024 e0300545
3