Computational approach to discriminate human and mouse sequences in patient-derived tumour xenografts

Dataset ID Technology Samples
EGAD00001003800 Illumina HiSeq 2500,Illumina HiSeq 4000 12

Whole Exome Sequencing was performed in a dilution series containing known amounts of human and mouse DNA, 3x 100% human 0% mouse, 2x 90/10, 3x 50/50, 2x 25/75 and 3x 0/100. A set of breast cancer clinical samples, matched normal tissue and matched PDTXs (total number = 14) were also analysed. Paired-end 75bp sequences for the dilution series and paired-end 125bp for the clinical samples were obtained on Illumina HiSeq2500; fastq files are provided. A triplicate analysis of the transcriptome using RNA-seq was also performed for the Universal Human RNA Reference and the Universal Mouse RNA Reference samples. Paired-end 150bp fastq files obtained on Illumina HiSeq4000 are provided.