We aim to discover gene regulatory networks that govern heart morphogenesis in the early fetal heart. To this end, we have performed single-nucleus ATAC-seq on cells derived from 2 human hearts at 15 weeks post conception to define the chromatin profiles and genomic regulatory networks for these in vivo specific cardiomyocyte lineages during development, from which the FASTQ files are available through dbGAP. This information will help us understand the process of heart morphogenesis early in development and may further our understanding of how congenital heart disease impacts development.
Clonal CRISPRi/a hiPS cell lines were plated to induce spontaneous differentiation in KSR medium (KSR medium: knockout DMEM/F12 (Gibco, 12660012), 2 mM GlutaMAX, 1× NEAAs, 20% knockout serum replacement (Gibco, 10828010) and 0.1 mM β-mercaptoethanol (Merck-Millipore)). non-TC plates and dishes were used. At the day of plating, KSR medium was supplemented with Y-27632 (10 µM). The next day, medium was replaced with fresh KSR medium. Medium changes were performed every other day. EBs were transferred to 15 ml conical tubes and allowed to sediment at room temperature for 5 min. The supernatant was replaced with fresh KSR medium and cells were transferred onto non-TC treated plates and maintained at 37 °C, 5% CO2. On day 12 of differentiation, the KSR medium was replaced with Essential 6 medium (Gibco, A1516401). EBs were collected and subjected to SUM-seq library preparation as described in Lobato-Moreno et al. 2025 Nature Methods.
We compared 12 pediatric T cell acute lymphoblastic leukemias collected at initial diagnosis and relapse with their corresponding PDX models. The analysis was performed on genomic level (whole genome sequencing (WGS), whole exome sequencing (WES), multiplex ligation probe amplification (MLPA), targeted sequencing) and the epigenetic level (DNA methylation, Assay for Transposase-Accessible Chromatin sequencing (ATAC-seq)). In sum, this study underlines the remarkable genomic stability, and for the first time documents the preservation of the epigenomic landscape in T-ALL-derived PDX models.
This dataset contains unpaired FASTQ files of 316 endometrial cancer cases and 316 matched controls representing circulating RNAs. RNA count files are provided as the output of sncRNA pipeline and represent 12 RNA types (isomiR, lncRNA, precursor miRNA, miRNA, miscRNA, mRNA, piRNA, scaRNA, snoRNA, snRNA, tRF, and tRNA). We also provide metadata of the samples.
DNA methylation data using Illumina 450K
Gene expression analysis from primary human JMML samples using Illumina Human HT-12 v4
TrypanoGEN+ data containing fastq files of 183 samples from DRC, Malawi and Uganda using NextSeq500.
MESA The Multi-Ethnic Study of Atherosclerosis (MESA) is a study of the characteristics of subclinical cardiovascular disease (disease detected non-invasively before it has produced clinical signs and symptoms) and the risk factors that predict progression to clinically overt cardiovascular disease or progression of the subclinical disease. MESA researchers study a diverse, population-based sample of 6,814 asymptomatic men and women aged 45-84. Thirty-eight percent of the recruited participants are white, 28 percent African-American, 22 percent Hispanic, and 12 percent Asian, predominantly of Chinese descent. Participants were recruited from six field centers across the United States: Wake Forest University, Columbia University, Johns Hopkins University, University of Minnesota, Northwestern University and University of California - Los Angeles. Each participant received an extensive physical exam to determine coronary calcification, ventricular mass and function, flow-mediated endothelial vasodilation, carotid intimal-medial wall thickness and presence of echogenic lucencies in the carotid artery, lower extremity vascular insufficiency, arterial wave forms, electrocardiographic (ECG) measures, standard coronary risk factors, sociodemographic factors, lifestyle factors, and psychosocial factors. Selected repetition of subclinical disease measures and risk factors at follow-up visits allows study of the progression of disease. Blood samples have been assayed for putative biochemical risk factors and stored for case-control studies. DNA has been extracted and lymphocytes cryopreserved (for possible immortalization) for study of candidate genes and possibly, genome-wide scanning, expression, and other genetic techniques. Participants are being followed for identification and characterization of cardiovascular disease events, including acute myocardial infarction and other forms of coronary heart disease (CHD), stroke, and congestive heart failure; for cardiovascular disease interventions; and for mortality. In addition to the six Field Centers, MESA involves a Coordinating Center, a Central Laboratory, and Central Reading Centers for Computed Tomography (CT), Magnetic Resonance Imaging (MRI), Ultrasound, and Electrocardiography (ECG). Protocol development, staff training, and pilot testing were performed in the first 18 months of the study. The first examination took place over two years, from July 2000 - July 2002. It was followed by four examination periods that were 17-20 months in length. Participants have been contacted every 9 to 12 months throughout the study to assess clinical morbidity and mortality. MESA Family The general goal of the MESA Family Study, an ancillary study to MESA funded by a grant from NHLBI, is to apply modern genetic analysis and genotyping methodologies to delineate the genetic determinants of early atherosclerosis. This is being accomplished by utilizing all the current organizational structures of the Multi-Ethnic Study of Atherosclerosis (MESA) and Genetic Centers at Cedars-Sinai Medical Center and University of Virginia. In the MESA Family Study, the goal is to locate and identify genes contributing to the genetic risk for cardiovascular disease (CVD), by looking at the early changes of atherosclerosis within families (mainly siblings). 2128 individuals from 594 families, yielding 3,026 sibpairs divided between African Americans and Hispanic-Americans, were recruited by utilizing the existing framework of MESA. MESA Family studied siblings of index subjects from the MESA study and from new sibpair families (with the same demographic characteristics) and is determining the extent of genetic contribution to the variation in coronary calcium (obtained via CT Scan) and carotid artery wall thickness (B-mode ultrasound) in the two largest non-majority U.S. populations. In a small proportion of subjects, parents of MESA index subjects participating in MESA Family were studied but only to have blood drawn for genotyping. The MESA Family cohort was recruited from the six MESA Field Centers. MESA Family participants underwent the same examination as MESA participants during May 2004 - May 2007. DNA was extracted and lymphocytes immortalized for study of candidate genes, genome-wide linkage scanning, and analyzed for linkage with these subclinical cardiovascular traits. While linkage analysis is the primary approach being used, an additional aspect of the MESA Family Study takes advantage of the existing MESA study population for testing a variety of candidate genes for association with the same subclinical traits. Genotyping and data analysis will occur throughout the study. MESA Air The general goal of the Multi-Ethnic Study of Atherosclerosis and Air Pollution ('MESA Air') is to prospectively examine the relation between an individual level assessment of long-term ambient air pollution exposures (including PM2.5 and the progression of subclinical cardiovascular disease in a multi-city, multi-ethnic cohort. MESA Air will also prospectively examine the relationship between an individual level assessment of long-term ambient air pollution exposures and the incidence of cardiovascular disease, including myocardial infarction and cardiovascular death. MESA AIR is funded by a grant from the United States Environmental Protection Agency to the University of Washington and subcontracts from the UW to other participating institutions. MESA Air will assess if ambient air pollution is associated with changes over time in subclinical measures of atherosclerosis and plasma markers of inflammation, oxidative damage, and endothelial activation in a longitudinal data model, adjusting for age, race/ethnicity, socioeconomic status, and specific cardiovascular risk factors (such as diabetes, hypertension, smoking, and diet). The study will similarly assess if the incidence of cardiovascular events is associated with long-term exposure to ambient air pollution, using a proportional hazards model. The study includes refinement of statistical tools, and explores joint/independent effects of acute and long-term pollutant exposure in the occurrence of cardiovascular disease. The MESA Air study is built on the foundation of the ongoing MESA study. The parent MESA Study cohort is located in six geographic areas ('Field Centers') that capture tremendous exposure heterogeneity, comparable to or greater than the variability in locations of prior U.S. cohort studies. In addition to the six Field Centers, the study involves a Coordinating Center, a Central Laboratory, and Reading Centers for Computed Tomography (CT), ultrasound and air pollution data. The cohort for the MESA Air study currently includes 6226 subjects: 5479 enrolled in the parent MESA study; 257 recruited specifically for this study, and 490 recruited from the MESA Family study. The entire MESA Air cohort will be followed over a 10-year project period for the occurrence of cardiovascular disease events. On two occasions over the ten-year study period, 3600 subjects from the MESA Air cohort, residing in nine locales, will undergo computed tomography scanning to assess presence and extent of coronary artery calcification (CAC), and ultrasound of the carotid artery to determine intima-media thickness (IMT). We will also repeatedly assess plasma markers of inflammation, oxidative damage, and endothelial function in 720 subjects. MESA Air adds state-of-the-art air pollution exposure assessment information to the MESA cohort study, and introduces new subjects and outcome measures to achieve our aims. The study will assess long-term individual-level exposure to ambient air pollutants for each subject using community-scale monitoring, outdoor spatial variation, subject proximity to pollution sources, pollutants' infiltration efficiency, and personal time-activity information. The exposure models will be validated using detailed monitoring in a subset of subjects. The MESA Cohort is utilized in the following dbGaP substudies. To view genotypes, analysis, expression data, other molecular data, and derived variables collected in these substudies, please click on the following substudies below or in the "Substudies" box located on the right hand side of this top-level study page phs000209 MESA Cohort. phs000420 MESA SHARe phs000283 MESA CARe phs000403 MESA ESP Heart-GO
This is a test study for EGA using data from 1000 Genomes Project, specifically created to add more diversity to the existing dataset EGAD00001003338. This Test Study should NOT be confused with the real study of 1000 Genomes Project - Phase 3. Specifically, this study encompasses 4 CRAM, 4 BAM, 2 VCF and 2 BCF files, of different sizes (all above 1 GB), used for testing downloads and monitoring services. It also contains their respective indexes: crai for crams, bai for bams, tbi for vcfs and csi for bcfs.
Gut microbiome 16S rRNA raw data for N=7174 FINRISK 2002 participants. FINRISK fecal samples were mailed to the Knight laboratory at the University of California (San Diego, CA), for microbiota sequencing using the standard Earth Microbiome Project protocols (https://earthmicrobiome.org/protocols-and-standards/). DNA was extracted using a magnetic bead-based DNA extraction protocol. Amplicon sequence data for the V4 region of the 16S rRNA gene was generated using 515F (Parada) and 806R (Apprill) primers. For a total of 25μl reaction volume, 13 µl PCR-grade water was combined with 10 µl PCR master mix (Platinum Hot Start PCR Master Mix, 2x, ThermoFisher), 0.5 µl forward primer (10 µM), 0.5 µl reverse primer (10 µM), and 1 µl template DNA. Amplification was performed in triplicate reactions, and triplicate PCR reactions were pooled afterwards. Expected products were visualized on agarose gels (300–350 bp) and quantified with Quant-iT PicoGreen dsDNA kit (Invitrogen). Equal amounts (240 ng) of amplicon were combined for each sample and cleaned (MoBio UltraClean PCR Clean-Up Kit). Cleaned amplicon pools were sequenced with 515F and 806R primers.