The genomic complexity of early T-cell progenitor acute lymphoblastic leukemia

Study ID Alternative Stable ID Type
phs000340 Whole Genome Sequencing

Study Description


The accurate identification of structural variations using whole-genome DNA sequencing data generated by next-generation sequencing technology is extremely difficult. To address this challenge, we have developed CREST, an algorithm that uses sequencing reads with partial alignments to the reference human genome (so-called soft-clipped reads) to directly map the breakpoints of somatic structural variations. We applied CREST to paired tumor/normal whole genome sequencing data from five cases of T-lineage acute lymphoblastic leukemia (T-ALL). A total of 110 somatic structural variants were identified, >80% of which were validated by genomic PCR and Sanger sequencing. The validated structural variants included 31 inter-chromosomal translocations, 19 intra-chromosomal translocations, one inversion, 22 deletions and 16 insertions. A comparison of the results generated with CREST to those obtained using the traditional paired-end discordant mapping methods demonstrate CREST to have a much higher sensitivity and specificity. In addition, application of CREST to ... (Show More)

Archive Link Archive Accession
dbGaP phs000340

Who archives the data?

There are no publications available