We aim to discover gene regulatory networks that govern heart morphogenesis in the early fetal heart. To this end, we have performed single-nucleus ATAC-seq on cells derived from 2 human hearts at 15 weeks post conception to define the chromatin profiles and genomic regulatory networks for these in vivo specific cardiomyocyte lineages during development, from which the FASTQ files are available through dbGAP. This information will help us understand the process of heart morphogenesis early in development and may further our understanding of how congenital heart disease impacts development.
Clonal CRISPRi/a hiPS cell lines were plated to induce spontaneous differentiation in KSR medium (KSR medium: knockout DMEM/F12 (Gibco, 12660012), 2 mM GlutaMAX, 1× NEAAs, 20% knockout serum replacement (Gibco, 10828010) and 0.1 mM β-mercaptoethanol (Merck-Millipore)). non-TC plates and dishes were used. At the day of plating, KSR medium was supplemented with Y-27632 (10 µM). The next day, medium was replaced with fresh KSR medium. Medium changes were performed every other day. EBs were transferred to 15 ml conical tubes and allowed to sediment at room temperature for 5 min. The supernatant was replaced with fresh KSR medium and cells were transferred onto non-TC treated plates and maintained at 37 °C, 5% CO2. On day 12 of differentiation, the KSR medium was replaced with Essential 6 medium (Gibco, A1516401). EBs were collected and subjected to SUM-seq library preparation as described in Lobato-Moreno et al. 2025 Nature Methods.
We compared 12 pediatric T cell acute lymphoblastic leukemias collected at initial diagnosis and relapse with their corresponding PDX models. The analysis was performed on genomic level (whole genome sequencing (WGS), whole exome sequencing (WES), multiplex ligation probe amplification (MLPA), targeted sequencing) and the epigenetic level (DNA methylation, Assay for Transposase-Accessible Chromatin sequencing (ATAC-seq)). In sum, this study underlines the remarkable genomic stability, and for the first time documents the preservation of the epigenomic landscape in T-ALL-derived PDX models.
In this study, linked read sequencing was performed on two ovarian metastases and matched normal tissue, from a patient with primary diffuse gastric cancer. Linked read sequencing is a DNA preparation technology whereby high molecular weight molecules of DNA are uniquely barcoded prior to fragmentation and sequencing, thus retaining information about genomic contiguity. This study performed an extended analysis of linked read sequencing data to resolve the complex structures of structural variants in the cancer genomes. Complex structural rearrangements were identified in the genomic region surrounding the known oncogene FGFR2, and the association between FGFR2 and gastric cancer metastasis was demonstrated in an organoid model.
This dataset contains unpaired FASTQ files of 316 endometrial cancer cases and 316 matched controls representing circulating RNAs. RNA count files are provided as the output of sncRNA pipeline and represent 12 RNA types (isomiR, lncRNA, precursor miRNA, miRNA, miscRNA, mRNA, piRNA, scaRNA, snoRNA, snRNA, tRF, and tRNA). We also provide metadata of the samples.
Dataset contains mRNA capture sequencing data from plasma of 180 different human donors: 132 prostate cancer patients (PRAD) and 48 cancer-free controls. Samples were sequenced on a NovaSeq X and sequencing data is provided in FASTQ format.
The dataset contains 30 primary bladder cancer patients, 28 recurrent bladder cancer patients and 31 hematuria controls. Shallow WGS of cell-free DNA enzymatic methyl sequencing libraries was performed on an Illumina Novaseq S4 PE150bp. Samples are provided as raw reads without any prior processing.
low-pass Whole Genome Sequencing (lpWGS) to assess the representativeness of seven body liquids from female patients with metastatic breast cancer.
Whole Exome Sequencing (WES) to explore the mutational status of seven body liquids from female patients with metastatic breast cancer.
Noninvasive detection of cancer-associated genome-wide hypomethylation and copy number aberrations by plasma DNA bisulfite sequencing
We propose to definitively characterise the somatic genetics of Osteosarcoma cancer through generation of comprehensive catalogues of somatic mutations by high coverage genome sequencing.
MESA The Multi-Ethnic Study of Atherosclerosis (MESA) is a study of the characteristics of subclinical cardiovascular disease (disease detected non-invasively before it has produced clinical signs and symptoms) and the risk factors that predict progression to clinically overt cardiovascular disease or progression of the subclinical disease. MESA researchers study a diverse, population-based sample of 6,814 asymptomatic men and women aged 45-84. Thirty-eight percent of the recruited participants are white, 28 percent African-American, 22 percent Hispanic, and 12 percent Asian, predominantly of Chinese descent. Participants were recruited from six field centers across the United States: Wake Forest University, Columbia University, Johns Hopkins University, University of Minnesota, Northwestern University and University of California - Los Angeles. Each participant received an extensive physical exam to determine coronary calcification, ventricular mass and function, flow-mediated endothelial vasodilation, carotid intimal-medial wall thickness and presence of echogenic lucencies in the carotid artery, lower extremity vascular insufficiency, arterial wave forms, electrocardiographic (ECG) measures, standard coronary risk factors, sociodemographic factors, lifestyle factors, and psychosocial factors. Selected repetition of subclinical disease measures and risk factors at follow-up visits allows study of the progression of disease. Blood samples have been assayed for putative biochemical risk factors and stored for case-control studies. DNA has been extracted and lymphocytes cryopreserved (for possible immortalization) for study of candidate genes and possibly, genome-wide scanning, expression, and other genetic techniques. Participants are being followed for identification and characterization of cardiovascular disease events, including acute myocardial infarction and other forms of coronary heart disease (CHD), stroke, and congestive heart failure; for cardiovascular disease interventions; and for mortality. In addition to the six Field Centers, MESA involves a Coordinating Center, a Central Laboratory, and Central Reading Centers for Computed Tomography (CT), Magnetic Resonance Imaging (MRI), Ultrasound, and Electrocardiography (ECG). Protocol development, staff training, and pilot testing were performed in the first 18 months of the study. The first examination took place over two years, from July 2000 - July 2002. It was followed by four examination periods that were 17-20 months in length. Participants have been contacted every 9 to 12 months throughout the study to assess clinical morbidity and mortality. MESA Family The general goal of the MESA Family Study, an ancillary study to MESA funded by a grant from NHLBI, is to apply modern genetic analysis and genotyping methodologies to delineate the genetic determinants of early atherosclerosis. This is being accomplished by utilizing all the current organizational structures of the Multi-Ethnic Study of Atherosclerosis (MESA) and Genetic Centers at Cedars-Sinai Medical Center and University of Virginia. In the MESA Family Study, the goal is to locate and identify genes contributing to the genetic risk for cardiovascular disease (CVD), by looking at the early changes of atherosclerosis within families (mainly siblings). 2128 individuals from 594 families, yielding 3,026 sibpairs divided between African Americans and Hispanic-Americans, were recruited by utilizing the existing framework of MESA. MESA Family studied siblings of index subjects from the MESA study and from new sibpair families (with the same demographic characteristics) and is determining the extent of genetic contribution to the variation in coronary calcium (obtained via CT Scan) and carotid artery wall thickness (B-mode ultrasound) in the two largest non-majority U.S. populations. In a small proportion of subjects, parents of MESA index subjects participating in MESA Family were studied but only to have blood drawn for genotyping. The MESA Family cohort was recruited from the six MESA Field Centers. MESA Family participants underwent the same examination as MESA participants during May 2004 - May 2007. DNA was extracted and lymphocytes immortalized for study of candidate genes, genome-wide linkage scanning, and analyzed for linkage with these subclinical cardiovascular traits. While linkage analysis is the primary approach being used, an additional aspect of the MESA Family Study takes advantage of the existing MESA study population for testing a variety of candidate genes for association with the same subclinical traits. Genotyping and data analysis will occur throughout the study. MESA Air The general goal of the Multi-Ethnic Study of Atherosclerosis and Air Pollution ('MESA Air') is to prospectively examine the relation between an individual level assessment of long-term ambient air pollution exposures (including PM2.5 and the progression of subclinical cardiovascular disease in a multi-city, multi-ethnic cohort. MESA Air will also prospectively examine the relationship between an individual level assessment of long-term ambient air pollution exposures and the incidence of cardiovascular disease, including myocardial infarction and cardiovascular death. MESA AIR is funded by a grant from the United States Environmental Protection Agency to the University of Washington and subcontracts from the UW to other participating institutions. MESA Air will assess if ambient air pollution is associated with changes over time in subclinical measures of atherosclerosis and plasma markers of inflammation, oxidative damage, and endothelial activation in a longitudinal data model, adjusting for age, race/ethnicity, socioeconomic status, and specific cardiovascular risk factors (such as diabetes, hypertension, smoking, and diet). The study will similarly assess if the incidence of cardiovascular events is associated with long-term exposure to ambient air pollution, using a proportional hazards model. The study includes refinement of statistical tools, and explores joint/independent effects of acute and long-term pollutant exposure in the occurrence of cardiovascular disease. The MESA Air study is built on the foundation of the ongoing MESA study. The parent MESA Study cohort is located in six geographic areas ('Field Centers') that capture tremendous exposure heterogeneity, comparable to or greater than the variability in locations of prior U.S. cohort studies. In addition to the six Field Centers, the study involves a Coordinating Center, a Central Laboratory, and Reading Centers for Computed Tomography (CT), ultrasound and air pollution data. The cohort for the MESA Air study currently includes 6226 subjects: 5479 enrolled in the parent MESA study; 257 recruited specifically for this study, and 490 recruited from the MESA Family study. The entire MESA Air cohort will be followed over a 10-year project period for the occurrence of cardiovascular disease events. On two occasions over the ten-year study period, 3600 subjects from the MESA Air cohort, residing in nine locales, will undergo computed tomography scanning to assess presence and extent of coronary artery calcification (CAC), and ultrasound of the carotid artery to determine intima-media thickness (IMT). We will also repeatedly assess plasma markers of inflammation, oxidative damage, and endothelial function in 720 subjects. MESA Air adds state-of-the-art air pollution exposure assessment information to the MESA cohort study, and introduces new subjects and outcome measures to achieve our aims. The study will assess long-term individual-level exposure to ambient air pollutants for each subject using community-scale monitoring, outdoor spatial variation, subject proximity to pollution sources, pollutants' infiltration efficiency, and personal time-activity information. The exposure models will be validated using detailed monitoring in a subset of subjects. The MESA Cohort is utilized in the following dbGaP substudies. To view genotypes, analysis, expression data, other molecular data, and derived variables collected in these substudies, please click on the following substudies below or in the "Substudies" box located on the right hand side of this top-level study page phs000209 MESA Cohort. phs000420 MESA SHARe phs000283 MESA CARe phs000403 MESA ESP Heart-GO
Matched Ovarian Cancer Sequencing
ER-, HER2-, PR- breast Cancer genome sequencing