The aim of CAGEKID is to carry out comprehensive detection of DNA markers for conventional (clear cell) renal carcinoma. The project includes complete analysis of somatic and constitutional DNA variation, methylation patterns and expression in a large number of constitutional/tumor pairs. CAGEKID is a part of the International Cancer Genome Consortium, ICGC.
Short read whole genome sequencing (WGS) CRAM files for the NIHR BioResource Rare Diseases WGS project – Participants from the Pulmonary Arterial Hypertension (PAH) Rare Disease domain
This dataset contains RNA sequencing data for 675 cancer cell lines. RNA libraries were made with the TruSeq RNA Sample Preparation kit (Illumina) according to the manufacturer protocol. The libraries were sequenced on an Illumnia HiSeq 2000
The dataset entails 48 RRBS libraries of 24 siblings. 24 individuals are conceived during the Dutch Famine, a severe 6 month famine at the end of World War 2. A same sex sibling was added as a control, allowing partial matching for (early) familial environment and genetics.
Data represent genome-wide DNA methylation profiles obtained by MethylCap-seq (Diagenode’s MethylCap-kit based purification followed by Illumina GAIIx sequencing), for 70 brain tissue samples, including 65 glioblastoma samples and 5 non-tumoral tissues (obtained from epilepsy surgery).
All samples from the "100" project
Genomic data from FFPE tumor samples of 258 patients with non-small cell lung cancer (NSCLC), profiled by shallow whole-genome sequencing (sWGS) at 1x. The study aimed to characterize somatic copy number alterations (SCNAs) and SCNA burden to assess their clinical relevance for patient stratification across different first-line treatments.
These data have been used to produce cancer incidence results for https://pubmed.ncbi.nlm.nih.gov/37181409/ , https://pubmed.ncbi.nlm.nih.gov/40231433/ and https://plsd.eu as available in March 2026. The tables contain demographic information at individual level, individual cancer events with given age, as well as prophylactic and therapeutic surgeries to remove organs. The data have been de-identified to ensure anonymity.
RNA-seq and scRNA/TCR-seq data from patient-derived fragments (PDFs) analysed in publication by R. Vendramin, H. Fu, S. Fernandez Patel, Y. Zhao et al.
Five US studies using the Mg tolerance test, the "gold standard" test of Mg status, indicated that >50% participants had Mg deficiency. In our ongoing US trial, we have found a similar result. In growing recognition of the importance of Mg in human health, very recently, Mg was selected by the US Federal Dietary Reference Intake (DRI) Committee to update the DRI. In US and other populations at high risk of Mg deficiency, high Mg intake has been linked to reduced risk of colorectal neoplasia, insulin resistance, metabolic syndrome, type 2 diabetes, and cardiovascular disease (CVD). Conversely, in populations not at high risk of Mg deficiency, high Mg intake has been related to increased risk of total mortality (e.g. total cancer, colorectal cancer, and CVD), particularly when calcium intake is low. These observations suggest that the associations between high Mg intake and disease risks may completely differ by the underlying Mg status. Due to major limitations, the Mg tolerance test is not used in conventional clinical practice and rarely used in research. Instead, serum Mg is used for clinical diagnosis. However, serum Mg performs very poorly at identifying those with Mg deficiency. There is a great need to develop implementable, sensitive, and specific biomarkers which can be easily used for identifying people with Mg deficiency. Without such a marker, it is impossible to develop effective strategies that minimize adverse effects by targeting those who, as a result of Mg deficiency, are at risk of common diseases. It is known that DNA methylation changes are inducible by environmental exposures, including nutrients, and reversible when the exposure disappears. There are two major types of cytosine modifications in DNA, 5-hydroxymethylcytosine (5-hmC) and 5-methylcytosine (5-mC). 5-mC is often associated with suppressed gene expression. 5-hmC, generated by the oxidation of 5-mC, is specifically enriched in expressed genes and plays a critical role in activating and/or maintaining gene expression. However, current epigenomic profiling platforms cannot distinguish 5-hmC from 5-mC. Just recently, our collaborator, Dr. Chuan He (HHMI Investigator), established a novel and state of the art technique, TAB-Seq and TAB-Array protocol, which can separate 5-hmC from 5-mC in the genome. Our pilot studies indicate that Mg intake or treatment significantly affects methylation capacity, 5-mC and 5-hmC biomarkers. Our findings suggest blood leukocyte 5-hmC/5-mC biomarkers may serve as more sensitive biomarkers to identify Mg deficiency than serum Mg. Thus, a comprehensive epigenome-wide association study (EWAS) may help to identify the most sensitive 5-hmC/ 5- mC biomarkers for Mg deficiency. We propose to identify 5-hmC/5-mC biomarkers for Mg deficiency by a 4- phase EWAS study in the "Personalized Prevention of Colorectal Cancer Trial [PPCCT, R01CA149633; PI, Dai and Yu]" with a total of 240 participants. Mg tolerance test will be used as the gold standard. Finally, using newly identified biomarkers, we will evaluate if 12-week Mg treatment reduces TRPM7 expression, essential in Mg homeostasis and colorectal carcinogenesis, in rectal tissues only among those with Mg deficiency.