Recently, significant progress has been made in characterizing and sequencing the genomic alterations in statistically robust numbers of samples from several types of cancer. For example, The Cancer Genome Atlas (TCGA), International Cancer Genome Consortium (ICGC) and other similar efforts are identifying genomic alterations associated with specific cancers (e.g., copy number aberrations, rearrangements, point mutations, epigenomic changes, etc.). The availability of these multi-dimensional data to the scientific community sets the stage for the development of new molecularly targeted cancer interventions. Understanding the comprehensive functional changes in cancer proteomes arising from genomic alterations and other factors is the next logical step in the development of high-value candidate protein biomarkers. Hence, proteomics can greatly advance the understanding of molecular mechanisms of disease pathology via the analysis of changes in protein expression, their modifications and variations, as well as protein-protein interaction, signaling pathways and networks responsible for cellular functions such as apoptosis and oncogenesis. Realizing this great potential, the NCI launched the second phase of the CPTC initiative in September 2011. Renamed the Clinical Proteomic Tumor Analysis Consortium, CPTAC is beginning to leverage its analytical outputs from Phase I to define cancer proteomes on genomically-characterized biospecimens. The purpose of this integrative approach is to provide the broad scientific community with knowledge that links genotype to proteotype and ultimately phenotype. The data contained in this dataset are derived from samples designed to confirm CPTAC findings from the TCGA samples. These confirmatory samples contain breast, ovarian, colon, and lung tumors collected via a protocol optimized for proteomics. Specifically, ischemic time of the sample was controlled and restricted to less than 30 minutes. ACGT, Inc. produced whole exome, mRNAseq, and miRNAseq for these samples. Corresponding proteomic data are available at: https://cptac-data-portal.georgetown.edu/cptacPublic/ The study design was to profile colon, breast, ovarian, and lung tumors both genomically and proteomically. Germline DNA was obtained from blood. Normal control samples for proteomics varied by organ site: adjacent colon tissue for colon cases, contralateral breast tissue for some breast cases, and Fallopian tube fimbria for some ovarian cases. Lung cases had no normal control for proteomic analysis. All cancer samples were derived from primary and untreated tumors.
This DAC will be created for sequencing data for the manuscript "Resolving spatial subclonal genomic heterogeneity of loss of heterozygosity and extrachromosomal DNA in gliomas." The dataset includes Illumina-based exome sequencing of tumor specimens, paired blood specimens, bulk RNA-seq, and Visium FFPE.
WTCCC genome-wide case-control association study for Hypertension (HT) using the 1958 British Birth Cohort and the UK National Blood Service collections as controls.
This dataset is generated for EpiHK. It contains Histone PTM ChIP-seq, WGBS, RNA-seq of Human Hepatocellular Carcinoma and tumor adjacent normal tissue sample.
This data set contains scRNA-seq data from MPN patients and healthy individuals. It is accompanied by amplicon data for the JAK2V617F mutation. For each individual an INFa treated and untreated sample is included.
ATAC-seq data for 2 sample(s) for osteoclast from venous blood, on Genome GRCh38. 2 run(s), 2 experiment(s), 2 alignment(s). Part of BLUEPRINT (September 2016).
ATAC-seq data for 2 sample(s) for plasma cell from tonsil, on Genome GRCh38. 2 run(s), 2 experiment(s), 2 alignment(s). Part of BLUEPRINT (September 2016).
CGMH data access committee for the CGMH-OCCC-WES data