Paired RNA-seq of bisulfite treated VDH01 samples control or depleted for NSUN3 (4 replicates each). The samples were prepped with NEBNext Ultra II DNA library prep kit and sequenced on MiSeq. Paired RNA-seq of fCAB treated vdh01 samples control or depleted for NSUN3(4 replicates each). the samples were prepped with NEBNext Ultra II DNA library prep kit and sequenced on MiSeq. Paired RNA-seq of fCAB treated VDH01 samples (3). the samples were prepped with NEBNext Ultra II DNA library prep kit and sequenced on MiSeq. Paired RNA-seq of bisulfite treated VDH01 samples (3). the samples were prepped with NEBNext Ultra II DNA library prep kit and sequenced on MiSeq.
Our objective was to establish a liquid biopsy-based monitoring strategy for pediatric high-risk neuroblastomas that are harboring genomic TERT rearrangements at diagnosis. TERT rearrangement breakpoints are detected by a hybrid capture-based neuroblastoma DNA panel sequencing (published in PMID: 34442335) in tumor material and are reflected in cell-free tumor DNA and can serve as robust biomarkers for disease activity. Within the dataset, 5 tumors of 4 pediatric patients with a neuroblastoma were DNA sequenced. Provided are FASTQ data files, bam and bambai files, as well as breakpoint spanning and encompassing read (enspan) bam and bambai files. Sequencing bam data files are aligned to GRCh37.p13 reference genome (processed).
Includes 4 Datasets: 1. 35 Control Plasma WGS samples sequenced on NovaSeq V1.0 Chemistry at the New York Genome Center. Denoted by CTRL-2XX naming scheme. 2. Plasma WGS from 17 patients with Small Cell Lung Cancer. Samples extracted at either Pretreatment or Postoperative at weeks 2 or 3. Denoted by SCLC-XX naming scheme. 3. Plasma WGS from a synthetic mixing study of a high-burden melanoma plasma sample with plasma bag, at estimated tumor concentrations of 10e-3, 10e-4, 10e-5, and 10e-6 for 2 replicates. Denoted by SM-repX naming scheme. 4. Assorted Plasma, Tumor, and Normal WGS from patients with NSCLC expressing high-burden for use in model training. Denoted by NSCLC-2XX naming scheme.
This dataset contains Chromium single-cell RNA-seq data and demultiplexing support for the PCA Atlas study EGAS00001008332. Included data objects: - 16 Chromium scRNA-seq FASTQ runs (EGAR accessions) for captures PCa1–PCa16. - 16 aligned BAM analyses (one Cell Ranger possorted_genome_bam.bam per capture). - 16 capture-level genotype VCF analyses derived from the Axiom UK Biobank array, used as demultiplexing panels. - demux_map.csv (EGAZ00001945520): donor–capture–sample mapping for genotype-based demultiplexing. - chromium.csv (EGAZ00001945537): run-level mapping linking captures, EGA samples, runs (EGAR), experiments (EGAX) and BAM analyses. Together these objects provide a complete view of the Chromium scRNA-seq data and the genotype-based demultiplexing support needed to reproduce donor assignment and downstream analyses.See also https://zenodo.org/records/17372603
Background and Rationale for the Childhood Cancer Survivor Study (CCSS) Over the last several decades, advances in treatments for childhood and adolescent cancer have substantially improved survival following diagnosis. These improvements gave rise to the responsibility for investigating long-term treatment-associated morbidity and mortality. Early efforts to describe late effects were largely conducted through single-institution and limited consortia studies. However, by the mid-1980s, it became increasingly clear that these approaches had inherent limitations, including small sample size, convenience sampling, incompletely characterized populations, and limited length of follow-up. To overcome these limitations, the CCSS was proposed and funded by the National Cancer Institute (NCI) as a U01 grant in 1994. Subsequently, the strengths of the CCSS, including an efficient and extensive infrastructure, plus expanding database and biorepository, were recognized and appreciated. Thus, in consultation with the NCI, the CCSS was converted to a U24 (resource grant) funding mechanism to serve the scientific community in 2000. The overarching goal of the CCSS resource is to increase the conduct of innovative and high impact research related to pediatric cancer survivorship. CCSS has been used extensively by researchers from a wide range of disciplines to address a broad spectrum of topics. Strengths of the resource include its large size, comprehensive annotation of treatment exposures, ongoing longitudinal follow-up with characterization of a wide array of participant characteristics and outcomes, and an established biorepository. Design of the Childhood Cancer Survivor Study The Childhood Cancer Survivor Study (CCSS) is a multi-institutional, multi-disciplinary collaborative research resource comprised of a retrospective hospital-based cohort of survivors of childhood cancer and a comparison sibling cohort. Eligible survivors from 31 participating institutions were diagnosed between 1970 and 1999, prior to age 21 years, with selected common pediatric cancers (leukemia, central nervous system tumors, Hodgkin lymphoma, non-Hodgkin lymphoma, kidney tumors, neuroblastoma, soft tissue sarcoma, or bone tumors). All patients who survived five years from the date of diagnosis were eligible, regardless of disease or treatment status. The baseline questionnaire was completed by 24,368 survivors and 5,039 siblings recruited to serve as a comparison group. To date, participants have completed three general follow-up surveys, as well as a number of specialized surveys on specific topics (e.g. health care, insurance, screening practices, men's and women's health issues, adolescent health, sleep and fatigue). In addition, biological samples (buccal cells, saliva and/or blood) have been collected for over 11,000 participants. Full descriptions of the design and characteristics of the CCSS have been previously published (Robison et al; Leisenring et al.), and available data and samples are described at https://ccss.stjude.org/develop-a-study/gwas-data-resource.html. Treatment Data in the Childhood Cancer Survivor Study A key feature of CCSS is the availability of detailed treatment data, which were collected by abstraction of medical records for each individual member of the cohort. Detailed abstraction included dates of therapy, protocol information, and specific details regarding surgery, chemotherapy and radiation. Quantitative dose details were collected for 22 specific chemotherapeutic agents, including alkylating agents, anthracyclines, platinum compounds and epipodophyllotoxins. In addition to individual agent doses, algorithms have been created to calculate cumulative doses of all drugs in a specific class, such as anthracyclines (doxorubicin, daunomycin and idarubicin) or platinum agents (cisplatinum and carboplatinum). Data abstracted for surgeries included dates and both the names and corresponding International Classification of Diseases (9th revision) code. For radiation treatment data, all relevant records were sent to the Radiation Physics Center at M.D. Anderson Cancer Center for detailed abstraction and dosimetry. Initial body region dosimetry was performed for all participants, followed by more detailed dosimetry as needed for specific studies. Genomics Data in the Childhood Cancer Survivor StudyThe NCI's Division of Cancer Epidemiology and Genetics and CCSS investigators collaborated to conduct genomics studies (SNP array genotyping and whole exome sequencing) using samples from the CCSS Biorepository. Studies included all cohort participants with available DNA regardless of sex or ancestry when the genomics studies were initiated. Phenotype Data in the Childhood Cancer Survivor Study Vital status and cause of death for both participants and non-participants is determined via linkage with the National Death Index (NDI). Identification of subsequent neoplasms is based on self-report, followed by validation using medical records, or via NDI. A wide array of additional health outcomes have been ascertained via a comprehensive set of questions on the CCSS questionnaires, covering potential adverse events across a range of organ systems (hearing/vision/speech, urinary, hormonal, heart and circulatory, respiratory, digestive, brain and nervous systems). In addition to health outcomes, longitudinal data have been collected on demographics, health behaviors, family history, screening practices, insurance status, and a range of psychosocial and neurocognitive factors. A full listing of available variables and copies of the CCSS questionnaires are available at http://ccss.stjude.org. Research Areas in the Childhood Cancer Survivor Study Extensive use by the research community has resulted in over 265 published manuscripts on a wide range of topics, including associations between treatment factors and mortality, subsequent neoplasms, chronic health conditions, cardiac events, neurocognitive sequelae, psychosocial factors, fertility, and health status. Additional topics have included health behaviors, screening practices, health care access and utilization, statistical and exposure assessment methodology, and development of risk prediction models. A full listing of published manuscripts using CCSS data is available on the CCSS website at https://ccss.stjude.org/published-research/publications.html. The Childhood Cancer Survivor Study as a Resource for Investigators The CCSS is an NCI-funded resource (U24 CA55727) to promote and facilitate research among long-term survivors of cancer diagnosed during childhood and adolescence. Interested investigators are encouraged to develop research ideas and propose projects within CCSS, whether or not they are from a participating CCSS institution. The CCSS is now accepting proposals to collaborate with CCSS and NCI investigators in the use of genomics data and corresponding outcomes-related data to address innovative research questions relating to potential genetic contributions to risk for treatment-related outcomes. Any researcher, or group of researchers, qualified to conduct genetic research can submit a proposal. There are no restrictions relative to country, institution, or prior involvement in CCSS. A full description of the process for developing a proposal for genetic research in CCSS can be found at https://ccss.stjude.org/develop-a-study/gwas-data-resource.html, along with listings of approved proposals.
Cleft palate (CP) is a common craniofacial structural birth defect caused by the incomplete closure of the palate (the structure separating the oral and nasal cavities), resulting in feeding, speech, and hearing problems. CP accounts for 33% of all orofacial clefts (OFCs) or approximately 1 in 1500 babies born worldwide. Although CP is commonly grouped with other types of orofacial clefts (e.g. cleft lip or cleft with cleft palate), CP is embryologically and epidemiologically distinct, suggesting a unique etiology. The risk of CP recurrence in first degree relatives is over 50-fold higher than the population risk, suggesting a strong genetic component. However, there have been a dearth of genetic studies for CP. Three well-powered genome-wide association studies and meta-analysis have revealed only two associated loci, neither of which account for a large portion of the genetic heritability in any population. The goal of this study is to elucidate the genetic architecture of CP by performing whole genome sequencing of case-parent trios in a well-phenotyped, multi-ethnic cohort.
The INCLUDE (INvestigation of Co-occurring conditions across the Lifespan to Understand Down syndromE) Project is an NIH-wide collaboration that seeks to improve health and quality-of-life for people with Down syndrome. The INCLUDE Project Data Coordinating Center and partners created the INCLUDE Data Hub, a centralized data resource that allows access to large-scale clinical and multi-omics datasets specific to Down syndrome and supports collaborative, cloud-based analysis to accelerate scientific discoveries related to Down syndrome and its co-occurring conditions.Down syndrome (DS) is the most common chromosomal abnormality in livebirths with an incidence of 1 in 700 in the US. To better understand the pathophysiology of DS, this proposal will generate and analyze sequence data on 777 pediatric DS patients from the Children's Hospital of Philadelphia (CHOP), as well as 321 mothers and 148 fathers. We anticipate that the information derived from this deeply-phenotyped cohort will allow for improved understanding of the pathophysiology and molecular mechanisms underlying DS-associated comorbidities, which may inform on new practices for treatment or innovative future therapies.
Non-coding regions comprise most of the human genome and harbor a significant fraction of risk alleles for neuropsychiatric diseases, yet their functions remain poorly defined. We created a high-resolution map of non-coding elements involved in human cortical neurogenesis by contrasting chromatin accessibility and gene expression in the germinal zone and cortical plate of the developing cerebral cortex. To obtain a high resolution depiction of chromatin structure and gene expression in developing human fetal cortex, we dissected the post-conception week (PCW) 15-17 human neocortex into two major anatomical divisions to distinguish between proliferating neural progenitors and post mitotic neurons: (1) GZ: the neural progenitor-enriched region encompassing the ventricular zone (VZ), subventricular zone (SVZ), and intermediate zone (IZ) and (2) CP: the neuron-enriched region containing the subplate (SP), cortical plate (CP), and marginal zone (MZ). Tissues were obtained from three independent donors and three to four technical replicates from each tissue were processed for ATAC-seq to define the landscape of accessible chromatin and RNA-seq for genome-wide gene expression profiling.
A large research cohort of Geisinger Abdominal Aortic Aneurysm (AAA) patients was created by enrolling and consenting patients of the Geisinger Department of Vascular Surgery. Consented patients provide blood, serum and DNA samples for research and authorize use of data in their medical record for research. They also complete a data questionnaire that asks information about family history of AAA and other vascular diseases, as well as information on known or suspected AAA risk factors, including smoking history, body mass index, hypertension, type 2 diabetes, and atherosclerotic disease. The AAA cases are 78% male, with a mean age of 74 years; 39% have undergone surgical or endovascular repair; more than 85% or current or past smokers; approximately 20% of cases report a positive family history of AAA. Geisinger AAA patients undergo regular imaging studies, typically every 6 months, to monitor the progression of aneurysm expansion. This allows the growth rates of their aneurysm to be calculated. DNA samples from a total of 910 AAA patients were used for whole genome genotyping; these results are included in the dbGaP.
Orofacial clefts (OFCs) are the most common craniofacial structural birth defect in humans caused by incomplete formation of the upper lip and/or the palate. Over 4,000 children in the Philippines are born with an OFC every year, and the Filipino OFC birth prevalence rate of 1 in 500 live births is one of the highest in the world. Genome-wide association studies support a multifactorial etiology for OFCs, but common variants only account for up to ~25% of the heritable risk in any one population. Sequencing studies in diverse populations is essential to fully understand the genetic architecture of OFCs. The study goals of the Kids First project are to sequence a collection of case-parent trios from the Philippines, in order to identify de novo mutations and inherited rare variants, and understand the population differences in common variant signals in the Philippines versus other populations. Approximately 373 trios from the Philippines were selected for whole genome sequencing (WGS) as part of this Gabriella Miller Kids First project.