Raw fastq files from WGS sequencing of CLL and matching blood normal for the ICGC Techval Benchmark1 study. Sequence data was provided to multiple centers for independent analysis and comparison.
Genetic risk scores (GRS) quantify polygenic disease risk into a single measure and can aid in disease classification. GRS studies have focused primarily on adult populations of recent European ancestry. We aimed to assess the utility of GRS in classification of diabetes type among racially/ethnically diverse youth in the United States. We used data from SEARCH baseline visits and follow up visits for which key data elements were available. 2260 participants are included. Phenotypic Data to be submitted here include: Self-reported sex, race and ethnicity Diabetes type determined by provider Etiologic diabetes type T1D (antibody positive, or antibody negative/missing and insulin sensitive) T2D (antibody negative and insulin resistant) Diabetes duration (years) Age at diagnosis (years) Measures from baseline, SEARCH 3 (follow-up) and SEARCH 4 (follow-up): Age Height Weight Waist circumference HbA1c Fasting/non-fasting glucose C-peptide Triglycerides HDL LDL Systolic Blood Pressure Diastolic Blood Pressure Measures from SEARCH 3 and 4:Arterial stiffness as characterized by PWVRetinopathy Measures from SEARCH 4 only: LV structure and diastolic function measurements from echocardiogram GAD autoantibody IAD autoantibody ZnT8 autoantibody We performed genotyping using the Illumina Multi-Ethnic Global Array (MEGA) array with 1,697,069 genotyped variants including 748,291 with minor allele frequency <0.01. Genotyping and preliminary quality control checks were performed at the Colorado Center for Personalized Medicine. After additional quality control, 2,238 samples and 900,743 variants remained for analyses. Samples genotyped on the MEGA array were categorized using SEARCH etiologic type and consisted of predominantly type 1 diabetes cases (n=2,051) but also those with other diabetes (n=133 type 2 diabetes, n=52 other diabetes including monogenic diabetes; genetic confirmation with either a genetic clinical test or test performed as an ancillary study to SEARCH). The median reported age at DNA collection was 11.2 years (interquartile range 7.6 – 14.1) with a minimum age of 1.9 years and maximum of 21.9 years. We used additional data genotyped on the Affymetrix 500K imputation scaffold chip with 239,279 genotyped variants. This cohort consisted of predominantly type 2 diabetes cases (n=417) but also those with type 1 (n=104), and other diabetes types (n=16). After additional quality control, 537 samples and 235,967 variants remained. The median reported age at collection was 11.2 years (quartile range 8.1 – 14.2) with a minimum of 2.0 years and a maximum of 21.1 years. About 100 type 2 diabetes cases were also genotyped using the MEGA chip to ensure concordance between the data sets and facilitate strand alignment between the two chips before genotypic imputation. The data sets had n=230,228 genotyped variants in common and concordance between the genotypes was high (mean correlation r2 for SNPs used in GRS=0.95). We used the 1000 Genomes reference panel to impute each data set separately, resulting in a total of 34.5M and 27.8M well-imputed variants (r2>0.8) for the MEGA and Affymetrix data sets respectively. We combined high quality imputed variants for analysis.
This postmortem study examines molecular, genetic and epigenetic signatures in the brains of hundreds of subjects with or without mental disorders conducted by the DIRP NIMH Human Brain Collection Core (HBCC). The brain tissues are obtained under protocols approved by the CNS IRB (NCT00001260), with the permission of the next-of-kin (NOK) through the Offices of the Chief Medical Examiners (MEOs) in the District of Columbia, Northern Virginia and Central Virginia. Additional samples were obtained from the University of Maryland Brain and Tissue Bank (contracts NO1-HD-4-3368 and NO1-HD-4-3383) (http://www.medschool.umaryland.edu/btbank/ and the Stanley Medical Research Institute: http://www.stanleyresearch.org/brain-research/). Clinical characterization, neuropathological screening, toxicological analyses, and dissections of various brain regions were performed as previously described (Lipska et al. 2006; PMID: 16997002). All patients met DSM-IV criteria for a lifetime Axis I diagnosis of psychiatric disorders including schizophrenia or schizoaffective disorder, bipolar disorder and major depression. Controls had no history of psychiatric diagnoses or addictions. SNP array: Array-based genotyping was performed on most samples published in this collection. The number of SNPs assayed via Illumina chips varied between 650,000 and 5 Million. Cerebellar tissue was generally used for genotyping studies. # Diagnosis SNP Array 1 Anxiety Disorder 1 2 Autism Spectrum Disorder 13 3 Bipolar Disorder 114 4 Control 387 5 Eating Disorder (ED) 2 6 Major Depressive Disorder (MDD) 186 7 Obsessive Compulsive Disorder (OCD) 5 8 Post-Traumatic Stress Disorder (PTSD) 0 9 Schizophrenia 220 10 Other 7 11 Tic Disorder 3 12 Undetermined 1 13 Williams Syndrome 2 Table: Numbers of samples in each diagnostic category. DNA extraction: 45-80 mg of cerebellar tissue was pulverized for DNA extractions. The QIAamp DNA mini Kit (Qiagen) method was employed for tissue DNA extraction. The tissue was initially lysed using Tissue Lyser (Qiagen) and extractions were accomplished according to manufacturer's protocol. The DNA was captured in 500uL elution buffer. The concentrations were measured using Thermo Scientific's NanoDrop 1000/NanoDrop ONE. The mean yield was 128.85 uG (+/- 79.48), the mean ratio of 260/280 was 1.87 (+/- 0.105), and the mean ratio of 260/230 was 2.48 (+/-1.75). Genotyping methods: Three types of Illumina Beadarray chips were used: HumanHap650Y, Human1M-Duo, and HumanOmni5M-Quad (San Diego, California). The genotyping was done according to the manufacturer's protocol (Illumina Proprietary, Catalog # WG-901-5003, Part # 15025910 Rev.A, June 2011). Approximately, 400ng DNA was used and each DNA sample was QC tested for 260/280 ratio by nanodrop and DNA band intactness on 2% agarose gel. Briefly, the samples were whole-genome amplified, fragmented, precipitated and resuspended in appropriate hybridization buffer. Denatured samples were hybridized on prepared Bead Array Chips. After hybridization, the Bead Chip oligonucleotides were extended by a single fluorescent labeled base, which was detected by fluorescence imaging with an Illumina Bead Array Reader, iScan. Normalized bead intensity data obtained for each sample were loaded into the Illumina Genome Studio (Illumina, v.2.0.3) with cluster position files provided by Illumina, and fluorescence intensities were converted into SNP genotypes. Microarray: We generated RNA expression data using array technology for psychiatric subjects compared to non-psychiatric subjects as controls. We used tissues from three different brain regions i.e. hippocampus, dorsolateral prefrontal cortex (DLPFC), and dura mater for a large cohort of individuals (total number 552 subjects for hippocampus, 800 for DLPFC and 146 for dura). Total RNA was extracted from ~100 mg of tissue using the RNeasy kit (Qiagen) according to the manufacturer's protocol. RNA quality and quantity were examined using the Bioanalyzer (Agilent, Inc) and NanoDrop (Thermo Scientific, Inc), respectively. Samples with RNA integrity number (RIN) # Diagnosis DLPFC Hippo Dura 1 Anxiety Disorder 1 0 0 2 Autism Spectrum Disorder 14 6 0 3 Bipolar Disorder 90 49 0 4 Control 336 270 75 5 Eating Disorder (ED) 2 1 0 6 Major Depressive Disorder (MDD) 144 87 0 7 Obsessive Compulsive Disorder (OCD) 5 3 0 8 Post-Traumatic Stress Disorder (PTSD) 6 0 0 9 Schizophrenia 192 125 71 10 Other 5 6 0 11 Tic Disorder 3 3 0 12 Undetermined 1 1 0 13 Williams Syndrome 2 1 0 Table: Numbers of samples in each diagnostic category. RNA-Seq of Dorso-lateral prefrontal cortex: All brains were collected and the dorsolateral prefrontal cortical (DLPFC) samples dissected at the HBCC, DIRP, NIMH. Dorsolateral prefrontal cortex (DLPFC) specimens were dissected from right or left hemisphere of frozen coronal slabs. The study was funded by the DIRP, NIMH under contract (#HHSN 271201400099C) with Icahn School of Medicine at Mount Sinai,1106402 One Gustave L. Levy Place, Box 3500, New York NY 10029-6574. RNA extraction, library preparation and sequencing were performed under contract at Icahn School of Medicine. The Common Mind Consortium (CMC) provided project management support. RNA isolation: Total RNA from 468 HBCC samples was isolated from approximately 100 mg homogenized tissue from each sample by TRIzol/chloroform extraction and purification with the Qiagen RNeasy kit (Cat#74106) according to manufacturer's protocol. Samples were processed in randomized batches of 12. The order of extraction for schizophrenia, bipolar, and MDD disorders and control samples was assigned randomly with respect to diagnosis and all other sample characteristics. The mean total RNA yield was 24.2 ug (+/- 9.0). The RNA Integrity Number (RIN) was determined by 4200 Agilent TapeStation System. Samples with RIN DLPFC RNA-Seq quantified expression data are provided for 364 samples. Data were generated, QC'd, processed and quantified as follows: RNA library preparation and sequencing: All samples submitted to the New York Genome Center for RNAseq were prepared for sequencing in randomized batches of 94. The sequencing libraries were prepared using the KAPA Stranded RNAseq Kit with RiboErase (KAPA Biosystems). rRNA was depleted from 1ug of RNA using the KAPA RiboErase protocol that is integrated into the KAPA Stranded RNAseq Kit. The insert size and DNA concentration of the sequencing library was determined on Fragment Analyzer Automated CE System (Advanced Analytical) and Quant-iT PicoGreen (ThermoFisher) respectively. Schizophrenia Bipolar Control 89 65 210 Table: Numbers of samples in each diagnostic category. RNA-Seq of subgenual anterior cingulate cortex (sgACC): All the 200 post-mortem brain samples (61 controls; 39 bipolar disorder; 46 schizophrenia; 54 major depressive disorder) were collected by the HBCC, DIRP, NIMH. RNA Extraction and Quality Assessment: Tissue from sgACC was pulverized and stored at -80°C. Total RNA was extracted from 50-80 mg of the tissue using QIAGEN RNeasy Lipid Tissue Mini Kit (QIAGEN, Cat. # 74804) with DNase treatment (QIAGEN, Cat. # 79254). The RNA Integrity Number (RIN) for each sample was assessed with high-resolution capillary electrophoresis on the Agilent Bioanalyzer 2100 (Agilent Technologies, Palo Alto, California). The concentration of RNA and their 260/280 ratio (2.1+/- 0.032 SD) were determined with NanoDrop (Thermo Scientific). RNA sequencing: Stranded RNA-Seq libraries were constructed after rRNA depletion using Ribo-Zero GOLD (Illumina). RNA sequencing was performed at National Institute of Health Intramural Sequencing Center (NISC). Schizophrenia Bipolar Control MDD 46 39 61 54 Table: Numbers of samples in each diagnostic category. Whole Genome Sequencing: All brains were collected and dissected at the HBCC, DIRP, NIMH. This study generates whole genome sequencing data using sequencing of DNA in the dorsolateral prefrontal cortex (DLPFC), anterior cingulate cortex (ACC) or cerebellum of 443 individuals with schizophrenia, bipolar disorder and major depressive disorder and non-psychiatric controls. The study was funded by the DIRP, NIMH under contract (#HHSN 271201400099C) with Icahn School of Medicine at Mount Sinai,1106402 One Gustave L. Levy Place, Box 3500, New York NY 10029-6574. DNA extraction, library preparation and sequencing were performed under contract at Icahn School of Medicine. The Common Mind Consortium (CMC) provided project management support. All specimens were dissected from right or left hemisphere of frozen coronal slabs. DNA Library Preparation and Sequencing: All samples submitted to the New York Genome Center for WGS were prepared for sequencing in randomized batches of 95. The sequencing libraries were prepared using the Illumina PCR-free DNA sample preparation Kit. The insert size and DNA concentration of the sequencing library was determined on Fragment Analyzer Automated CE System (Advanced Analytical) and Quant-iT PicoGreen (ThermoFisher) respectively. A quantitative PCR assay (KAPA), with primers specific to the adapter sequence, was used to determine the yield and efficiency of the adaptor ligation process. Performed on the Illumina HiSeqX with 30X coverage. Schizophrenia Bipolar Control 115 78 230 Table: Numbers of samples in each diagnostic category. ChIP-Seq: All brains were collected and the dorsolateral prefrontal cortical (DLPFC) samples dissected at the HBCC, DIRP, NIMH. This study generates epigenetic data using sequencing of DNA after chromatin immunoprecipitation (ChIP-Seq) for marks H3K4me3 and H3K27ac in the dorsolateral prefrontal cortex (DLPFC). Dorsolateral prefrontal cortex (DLPFC) specimens were dissected from right or left hemisphere of frozen coronal slabs. The study was funded by the DIRP, NIMH under contract (#HHSN 271201400099C) with Icahn School of Medicine at Mount Sinai,1106402 One Gustave L. Levy Place, Box 3500, New York NY 10029,6574. Chromatin precipitation, library preparation and sequencing were performed under contract at Icahn School of Medicine. The Common Mind Consortium (CMC) provided project management support. Chromatin immunoprecipitation (ChIP) assays for histone marks H3K4me3 and H3K27ac were carried out using Native ChIP. Micrococcal Nuclease (MNase) (Sigma, N3755) treatment was used to digest chromatin into mononucleosomes. The following antibodies were used for chromatin pull-down: anti-H3K4me3 (Cell Signaling, Cat# 9751BC, lot 7) and anti-H3K27ac (Active Motif, Cat# 39133, Lot # 31814008). Histone modification-enriched genomic DNA fragments were recovered using Protein A/G magnetic beads (Thermo Scientific, 88803-88938 or Millipore 16-663), and then washed, eluted, and treated with RNAse A and proteinase K. Final ChIP DNA products were isolated using phenol-chloroform extraction followed by ethanol precipitation. The efficiency of each ChIP assay was validated using Qubit concentration measurement and qPCR for positive (GRIN2B, DARPP32) and negative (HBB) control genomic regions. Only ChIP assays that passed quality control were further processed for library preparation and sequencing; this included ChIP DNA that was not detectable on Qubit but showed a good signal and expected enrichment patterns in qPCR. HISTONE_MARK H3K27ac H3K4me3 Input Bipolar 56 4 7 Control 158 11 24 Schizophrenia 79 11 12 Table: Numbers of individuals in each assay grouped by histone mark or input.Long-Read Whole-Genome Sequencing (WGS) Cohort Description: Brain specimens were obtained from the Human Brain Collection Core (HBCC), part of the NIH NeuroBioBank. Samples were collected under protocols approved by the NIH CNS Institutional Review Board (IRB) (NCT03092687), with informed consent from next-of-kin (NOK). Collection was coordinated through the Offices of the Chief Medical Examiners (MEOs) in Washington, D.C., Northern Virginia, and Central Virginia. Clinical metadata and documentation are publicly available via the NIMH Data Archive (NDA) (Collection #3151) https://nda.nih.gov/edit_collection.html?id=3151 Eligibility Criteria No clinical diagnosis of major neuropsychiatric or neurodegenerative diseaseNo diagnosis of cognitive impairment during life All individuals were confirmed to be neurologically normal at time of deathDemographics Initial cohort size: 155 individuals Ancestry: All individuals self-identified as African or African-admixed Mean age at death: 44.2 years (range: 18–85 years) Sex distribution: 36.4% femaleSample Processing: Frozen frontal cortex tissue was dissected and processed according to the public protocol: https://www.protocols.io/view/processing-human-frontal-cortex-brain-tissue-for-p-kxygxzmmov8j/v2. High-molecular-weight DNA was extracted and libraries were prepared using the Oxford Nanopore Technologies (ONT) LSK-114 kit. Sequencing was performed using ONT PromethION flow cells (R10.4.1 chemistry) Data Processing and Quality Control: Basecalling: Conducted using Guppy v6.38 Read Alignment: Reads were aligned to the GRCh38 reference genome using minimap2 Sample Identity Verification: Sample identity was validated by comparing ONT-derived SNP calls with matched short-read WGS genotypes to ensure concordance and prevent sample swaps Variant Calling and Phasing: Reads were base-called with Guppy v6.38. Reads were aligned to GRCh38 using minimap2. We verified sample identity by cross-checking ONT SNV calls with the existing short-read WGS genotypes, confirming no sample switches. The napu pipeline (https://github.com/nanoporegenomics/napu_wf) produced; haplotype-resolved assemblies, joint small-variant (SNV/indel) calls, and multi-caller structural-variant sets, all reported on GRCh38 and phased where possible. Raw signal data were basecalled to obtain 5-methyl-cytosine (5mC) status; methylation tags were added to the phased BAM files. Genome-wide methylation summaries are provided in BED format.Dataset Filtering and Exclusions: All 155 samples underwent sequencing and SNP-based ancestry inference 8 samples were excluded due to ancestry inconsistent with African or African-admixed background 1 sample was excluded due to insufficient sequencing quality Final Sample Set: 146 high-quality samples from individuals of African or African-admixed ancestry were retained for downstream analyses See PMID: 39764002 for further analysis detailsDiagnosis#SamplesControl155Table: Diagnostic Summary.Note: The data derived from HBCC resources were removed from dbGAP and are now available in the NIMH Data Archive (NDA). They include genotypes, short read whole genome sequencing (WGS), epigenetics (DNA methylation, ChIP-seq for histones), RNA expression (qPCR, microarray, RNA-seq, single nucleus RNA-seq) of various brain regions in cases with schizophrenia, bipolar disorder, major depression, substance use disorders and normative controls. Please access our NDA collection (https://nda.nih.gov/edit_collection.html?id=3151) for further detail.
We conducted a prospective cohort study of adults (RESERVE-U-C19) admitted to Entebbe Regional Referral Hospital (ERRH), a national COVID-19 referral center, across three viral variant-driven phases of the pandemic in Uganda. In this cohort, which reflects the entire spectrum of SARS-CoV-2 infection, we integrated clinical data with serum proteomics (N=306) and whole-blood transcriptomics (N=100) to determine a range of host responses associated with COVID-19 severity, SARS-CoV-2/HIV co-infection, and circulating viral variants. Beyond the prognostic importance of myeloid cell-driven immune activation and lymphopenia, we show that multifaceted impairment of host protein synthesis and extensive redox imbalance define core metabolic signatures of severe COVID-19, with central roles for IL-7, IL-15, and lymphotoxin-α in COVID-19 respiratory failure. While prognostic signatures were generally consistent in SARS-CoV-2/HIV-coinfection, type I interferon responses uniquely scaled with COVID-19 severity in persons living with HIV. Throughout the pandemic, COVID-19 severity peaked during phases dominated by A.23/A.23.1 and Delta B.1.617.2/AY variants, with Delta phase COVID-19 distinguished by exaggerated pro-inflammatory myeloid cell and inflammasome activation, NK and CD8+ T-cell depletion, impaired host protein synthesis, and upregulation of viral cell entry and trafficking pathways. Data from this study available through dbGaP include raw RNA-Seq data from whole-blood of adult participants (N=100).
Rhabdoid tumors (RTs) predominantly affect young children and are among the deadliest pediatric solid tumors. Despite multimodal therapy consisting of surgery, radiation, and chemotherapy, children with these tumors have median survival of less than one year. RTs can arise throughout the body, including the central nervous system (CNS) where they are called atypical teratoid rhabdoid tumors (AT/RTs), and in extra-CNS locations such as the kidneys and other soft tissues where they are designated malignant RTs (MRTs). We previously identified MDM2 and MDM4 as therapeutic vulnerabilities in RTs and showed that treatment with the MDM2 inhibitor idasanutlin increased survival in mice bearing MRT xenografts. However, the therapeutic potential of idasanutlin in CNS RT tumors is unknown. Single agent therapies are prone to resistance and show limited clinical benefit on their own. Therefore, we sought to identify combination strategies incorporating idasanutlin that would be effective in both CNS and extra-CNS RTs. Here, we show that the XPO1 inhibitor selinexor increased nuclear retention of p53 and potentiated idasanutlin-induced p53 pathway activation and cytotoxicity in AT/RT and MRT cell lines in vitro. Importantly, combination therapy limited acquired resistance through TP53 mutation. In vivo, combination therapy was well-tolerated, reduced tumor burden, and increased survival in orthotopic models of both AT/RT and MRT. Our results demonstrate that combining idasanutlin with selinexor is a promising therapeutic strategy for children with rhabdoid tumors.
In developed countries, ~10% of individuals are exposed to systemic chemotherapy for cancer and other diseases. Many chemotherapeutic agents act by increasing DNA damage in cancer cells, hence triggering cell death. However, there is limited understanding of the extent and consequences of collateral DNA damage to normal tissues. To investigate the impact of chemotherapy on mutation burdens and cell population structure of a normal tissue we sequenced blood cell genomes from 23 individuals, aged 3-80 years, treated with a range of chemotherapy regimens. Substantial additional mutation loads with characteristic mutational signatures were imposed by some chemotherapeutic agents, but there were differences in burden between different classes of agent, different agents of the same class and different blood cell types. Chemotherapy also induced premature changes in the cell population structure of normal blood, similar to those of normal ageing. The results constitute an initial survey of the long-term biological consequences of cytotoxic agents to which a substantial fraction of the population is exposed during the course of their disease management, raising mechanistic questions and highlighting opportunities for mitigation of adverse effects.
GeneChip HTA 2.0 data of primary renal cell carcinoma (RCC) related to Reustle et al, Genome Med 12:2020 32. Preprocessing of microarray data was performed using Robust Multi-array Average (RMA).
Buccal epithelial cells of chimeric twins were isolated using laser-capture microdissection. Cells were pooled (13-60 cells) per batch to create a genomic DNA library using an NEB low-input kit.
We analyzed 264 plasma samples collected between June 2016 and September 2021 from 63 epithelial ovarian cancer patients using tumor-guided plasma cell-free DNA analysis to detect residual disease after treatment.
ATAC-seq performed in LCLs derived from constitutional MLH1 epimutation carriers and non-carrier relatives to profile alterations in chromatin accessibility associated with constitutional MLH1 epimutation.