H9 human embryonic stem cells (hESCs) were cultured in feeder-free chemically-defined conditions in medium containing 10ng/ml Activin A and 12ng/ml FGF2 (Vallier L. 2011, Methods in Molecular Biology, 690: 57-66). Chromatin immunoprecipitation was performed as described in Brown S. et al. 2011. Stem Cells 29: 1176-85 by using 5ug of anti-DPY30 antibody (Sigma, cat. number HPA043761). This protocol was performed in control hESCs (expressing a scrambled shRNA) and in hESCs stably expressing an shRNA against DPY30 (Sigma, clone n. TRCN0000131112).This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
Hepatocellular carcinoma (HCC) is a heterogeneous aggressive malignancy with low efficacy of current therapies at advanced stages. We integrated molecular and pharmacological profiling of a large panel of liver cancer cell lines (LCCL) to assess their clinical relevance as HCC preclinical models and identify new effective therapies and biomarkers of response. Here, we performed multi-omic analysis including whole-exome, RNA and microRNA sequencing in a series 34 LCCL. Molecular profiles of LCCL and primary HCC were compared and we searched for molecular features associated with drug response. Our panel of LCCL faithfully recapitulated the most aggressive molecular “proliferation class” of HCC.
Recurrent breast cancer is almost universally fatal. We characterize 170 patients locally relapsed or distant metastatic cancers using massively parallel sequencing. We identify that the relapse-seeding clone disseminates late from the primary tumor. TP53 and AKT1 appear to be enriched in ER-positive cancers predisposed to relapse. Mutation acquisition continues at relapse as the same mutation signatures continue to operate and new signatures, such as that caused by radiotherapy appear de novo. In 49% of cases we identify drivers mutations private to the relapse and these are sampled from a wider range of cancer genes, including SWI-SNF complex and JAK-STAT signaling.
The Cleveland Family Study is the largest family-based study of sleep apnea world-wide, consisting of 2284 individuals (46% African American) from 361 families studied on up to 4 occasions over a period of 16 years. The study was begun in 1990 with the initial aims of quantifying the familial aggregation of sleep apnea. NIH renewals provided expansion of the original cohort (including increased minority recruitment) and longitudinal follow-up, with the last exam occurring in February 2006. Index probands (n=275) were recruited from 3 area hospital sleep labs if they had a confirmed diagnosis of sleep apnea and at least 2 first-degree relatives available to be studied. In the first 5 years of the study, neighborhood control probands (n=87) with at least 2 living relatives available for study were selected at random from a list provided by the index family and also studied. All available first degree relatives and spouses of the case and control probands also were recruited. Second-degree relatives, including half-sibs, aunts, uncles and grandparents, were also included if they lived near the first degree relatives (cases or controls), or if the family had been found to have two or more relatives with sleep apnea. Blood was sampled and DNA isolated for participants seen in the last two exam cycles (n=1447). The sample, which is enriched with individuals with sleep apnea, also contains a high prevalence of individuals with sleep apnea-related traits, including: obesity, impaired glucose tolerance, and HTN. Phenotyping data have been collected over 4 exam cycles, each occurring ~every 4 years. The last three exams targeted all subjects who had been studied at earlier exams, as well as new minority families and family members of previously studied probands who had been unavailable at prior exams. Data from one, two, three and four visits are available for 412, 630, 329 and 67, participants, respectively. In the first 3 exams, participants underwent overnight in-home sleep studies, allowing determination of the number and duration of hypopneas and apneas, sleep period, heart rate, and oxygen saturation levels; anthropometry (weight, height, and waist, hip, and neck circumferences); resting blood pressure; spirometry; standardized questionnaire evaluation of symptoms, medications, sleep patterns, quality of life, daytime sleepiness measures and health history; venipuncture and measurement of total and HDL cholesterol. The 4th exam (2001-2006) was designed to collect more detailed measurements of sleep, metabolic and CVD phenotypes and included measurement of state-of-the-art polysomnography, with both collection of blood and measurement of blood pressure before and after sleep, and anthropometry, upper airway assessments, spirometry, exhaled nitric oxide, and ECG performed the morning after the sleep study. Data have been collected by trained research assistants or GCRC nurses following written Manuals of Procedures who were certified following standard approaches for each study procedure. Ongoing data quality, with assessment of within or between individual drift, has been monitored on an ongoing basis, using statistical techniques as well as regular re-certification procedures. Between and within scorer reliabilities for key sleep apnea indices have been excellent, with intra-class correlation coefficients (ICCs) exceeding 0.92 for the apnea-hypopnea index (AHI). Sleep staging, assessed with epoch specific comparisons, also demonstrate excellent reliability for stage identification (kappas>0.82). There has been no evidence of significant time trends-between or within scorers- for the AHI variables. We also have evaluated the night-to-night variability of the AHI and other sleep variables in 91 subjects, with each measurement made 1-3 months apart. There is high night to night consistency for the AHI (ICC: 0.80), the arousal index (0.76), and the % sleep time in slow-wave sleep (0.73). We have demonstrated the comparability of the apnea estimates (AHI) determined from limited channel studies obtained at in-home settings with in full in-laboratory polysomnography. In addition to our published validation study, we more recently compared the AHI in 169 Cleveland Family Study participants undergoing both assessments (in-home and in-laboratory) within one week apart. These showed excellent levels of agreement (ICC=0.83), demonstrating the feasibility of examining data from either in-home or in-laboratory studies for apnea phenotyping. Data collected in the GCRC were obtained, when possible, with comparable, if not identical techniques, as were the same measures collected at prior exams performed in the participants' homes. To address the comparability of data collected over different exams, we calculated the crude age-adjusted correlations ~3 year within individual correlations between measures made in the most recent GCRC exam with measures made in a prior exam and demonstrated excellent levels of agreement for BMI (r=.91); waist circumference (0.91); FVC (0.88); and FEV1 (0.86). As expected due to higher biological and measurement variability, 149 somewhat lower 3-year correlations were demonstrated for SBP (0.56); Diastolic BP (0.48); AHI (0.62); and nocturnal oxygen desaturation (0.60). NHLBI Candidate-gene Association Resource. The NHLBI initiated the Candidate gene Association Resource (CARe) to create a shared genotype/phenotype resource for analyses of the association of genotypes with phenotypes relevant to the mission of the NHLBI. The resource comprises nine cohort studies funded by the NHLBI: Atherosclerosis Risk in Communities (ARIC), Cardiovascular Health Study (CHS), Cleveland Family Study (CFS), Coronary Artery Risk Development in Young Adults (CARDIA), Cooperative Study of Sickle Cell Disease (CSSCD), Framingham Heart Study (FHS), Jackson Heart Study (JHS), Multi-Ethnic Study of Atherosclerosis (MESA), and the Sleep Heart Health Study (SHHS). A database of genotype and phenotype data will be created that includes records for approximately 50,000 study participants with approximately 50,000 SNPs from more than 1,200 selected candidate genes. In addition, a genome wide association study using a 1,000K SNP Chip will be conducted on approximately 9,500 African American participants drawn from the 50,000 participants in the nine cohorts. Some relevant CARe publications CARe Study: PMID 20400780 CVD Chip Design: PMID 18974833
This dataset includes raw data (.idat) for the Illumina Human450k beadchip and methylation levels (.txt files). Methylation level were treated for normalization and background substraction. We removed probes with at least one of the following characteristics: (1) weak signal (p > 0.01) (2128 CpG sites), (2) SNP-enriched sites (4100 sites), (3) out of a CpG context (not on a CG) (3149 sites), or (4) located on sex chromosomes (11,129 sites). A total of 465,071 CpG sites were analyzed initially. Signal was then normalized, first by scaling to the internal controls using the methylumi R package, then by applying the method of subset-quantile within array normalization (SWAN) implemented in the minfi R package.
Dataset includes scRNA-sequencing data of cells isolated by dissociation of ~20 BJ WT day 20 hiPSC-derived MN embryoid bodies generated in ULA 96w plates following the protocol described in Rodriguez-Muela et al., JCI 2018). Droplet-based scRNA-seq was performed using 10X Genomics scRNA-sequencing was performed using 10X Genomics Chromium Single Cell Kit v3. The resulting sequencing reads were processed as described in Buchner et al 2024. Analysis of the count matrices was performed using the R toolkit for single cell genomics, Seurat v5.1.0, with R v3.3.3 “Angel Food Cake.”
scRNA-seq was performed on the BD Rhapsody platform. ILCs (ILC1, ILC2, ILC3) and NK cells were bulk sorted from fresh cord bloods of 3 donors. The 4 samples containing ILCs and NK cells from each donor were stained with Sampletag antibodies (12 samples in total) before Abseq staining with the BD Immune Discovery panel and 6 additional antibodies was performed to better classify immune cell subsets. The sorted population were pooled before loading onto the BD Rhapsody cartridge. The manufacturer's protocol was followed for library construction and samples were sequenced on a Illumina NovaSeq 6000.
This dataset includes all FASTQ files for 11 samples where different capture-based methods for transcriptome profiling have been tested. Specifically, we have the 'traditional' RNA-seq experiment with fresh frozen (FF) material, and 3 different capture methods for the matching formalin-fixed paraffin-embedded samples: Agilent (AGI), IDT, and Twist Biosciences (TBS). In total, there are 43 samples with paired-end FASTQ files (1 sample did not have sufficient material to test all methods). Samples are identified by the R01-R11 IDs with a suffix that indicates the capture method used (or FF for fresh frozen).
Whole-exome sequencing data (Agilent SureSelectXT Human All Exon V7). Retrospective study of matched pairs of initial and post-therapeutic GBM cases treated with temozolomide+radiotherapy with a recurrence period greater than one year. Matched normal, initial and post-therapeutic samples for 27 patients and 1 patient (GBM046) with a matched normal and two post-therapeutic samples.
Deep targeted sequencing of 56 genes associated with clonal haematopoiesis and haematological malignancy in peripheral blood-derived DNA from 385 older adults, each sampled 2-5 times over ~13 years.