Tumor Genomics Committee - EGAC00001000987

Dac ID Contact Person Email Access Information
EGAC00001000987 Janne Ravantti dac-tumorgenomics [at] helsinki [dot] fi No additional information is available

This DAC controls 4 datasets:

Dataset ID Description Technology Samples
EGAD00001004281 The dataset contains the somatic point mutation data from the exome-targeted region of 36 exome or whole genome sequenced microsatellite unstable colorectal cancers and the somatic point mutation data from 93 additional MiSeq sequenced microsatellite unstable colorectal cancers. 129
EGAD00001004329 Somatic mutations of 256 whole-genome sequenced colorectal tumors. 234 MSS, 19 MSI and 3 POLE mutants. See Katainen R. et al. CTCF/cohesin-binding sites are frequently mutated in cancer, Nature Genetics 2015. doi:10.1038/ng.3335 256
EGAD00001004884 The data consists of 47 exome-sequenced synchronous colorectal cancers from 23 patients. The exomes of corresponding normal samples were used to remove germline variants. All patients are Finnish (white Caucasian). All except one patient (sync_11 who belongs to a LS family) were assumed sporadic. The sequence data was produced with Illumina HiSeq 4000. 47
EGAD00001006572 The dataset contains somatic variants in 344 colorectal cancer samples. Variants are called with Mutect2 (GRCh38). Important: VCF-files include also variants, which have been annotated as "str_contraction" and "panel_of_normals". Please, use only "PASS" variants in studies, which are not microsatellite repeat related. Samples are sequenced with Novaseq 6000, HiSeq 2000, and HiSeq X Ten instruments (average coverage depth ~30+). The dataset consists of 257 MSS, 58 MSI, 25 MSS IBD, and 4 POLE mutant CRCs. 344