Hearing loss in adults from South Carolina
Hearing loss in adults from South Carolina
Dataset
EGAD00001000204
Childhood Cancer Data Initiative (CCDI): Texas Pediatric Patient Derived Xenograft
The DNA/RNA samples of patients for which PDX models were developed were from a population of children, adolescents, and young adults from South Texas. Consent was obtained for collection of malignant and non-malignant tissue samples for genetic analysis and for the development of models to study new ways to treat cancer. For additional details, please refer to these publications: PMID: 37915590; PMID: 37990009.
Study
phs003215
AYA glioma NGS
Gliomas are a major cause of cancer-related death in adolescents and young adults (AYA, ages 15-39 years). Different molecular alterations drive gliomas in children and adults leading to distinct biology and clinical consequences. At the convergence of these age groups - the AYA population – little is known about the implications of pediatric vs adult-type alterations. To address this, we analyzed a population-based cohort of 1456 clinically and molecularly characterized gliomas aged 0-39 years. Potentially targetable pediatric-type alterations were found in 31% of AYA gliomas and conferred superior outcomes compared to adult-type alterations.
This submission includes the generated NGS data for children, adolescent, and young adults with gliomas. Data includes targeted panel of glioma genes and low-pass WGS (1x coverage).
Study
EGAS50000000383
Transcriptome analysis of adolescents and young adults with Acute Lymphoblastic Leukemia
The primary purpose of the research is identification of novel chromosome translocations in adolescents and young adults (AYA) with Acute Lymphoblastic Leukemia (ALL). We searched for novel chromosome translocations in 99 philadelphia-chromosome negative ALL cases (including 76 AYA cases) by RNA-seq using next-generation sequencers, and have discovered some novel chromosome translocations.
Study
JGAS000047
T2D-GENES: Exome sequencing
The Type 2 Diabetes Genetic Exploration by Next-generation sequencing in multi-Ethnic Samples (T2D-GENES) is a large collaboration effort between the University of Michigan, Broad Institute, the Wellcome Trust Centre for Human Genetics, University of Chicago, NIDDK, Texas Biomedical Research Institute, University of Texas Health Science Center, University of North Carolina, Blood Systems Research Institute, and University of Mississippi Medical Center. The goal of this study is to identify genetic variants for type 2 diabetes through multiethnic sequencing studies. In this study, we use next generation sequencing to assess the role of common, low frequency, and rare variants on type 2 diabetes risk. We preformed exome sequencing in 6,546 T2D cases and 6,457 control individuals of multiethnic origin, including individuals of African American, East Asian, South Asian, Hispanic, and European descent.
Study
EGAS00001001460
IDH- and H3-wildtype high-grade gliomas in teenagers and young adults
Whole exome sequencing of high grade gliomas occurring in teenagers and young adults between the ages of 13 and 30. This study combines methylation array profiling, whole exome sequencing and fusion panels sequencing to provide and integrated diagnosis of tumours in this age group.
Study
EGAS50000000641
ADHD Genomic Association Study
Studies at Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, Netherlands, has recruited participants with ADHD and participants without ADHD, which included matched healthy controls and family members of participants with ADHD. ADHD participants met DSM-IV-TR criteria for ADHD. DNAs of these participants, 1005 samples, were anonymously genotyped with Illumina PsychArray. These samples were not part of the PGC-ADHD group. After removing family members, 820 case/control independent samples, including 211 ADHD cases, went through genomic association analyses. Results of ADHD risk genetic markers were compared with genetic markers associated with brain structure alterations in ADHD in adults and adolescents.
Study
phs001869
Urethral Microbiome of Adolescent Males
Urethral microbiome of adolescent males is designed to characterize the microbial communities resident in the urethra of young men, to identify differences in these communities as a function of race/ethnicity, circumcision status, sexual exposures, and uro-genital symptoms. We collect detailed sexual behavior and symptoms data using cellular telephones with Internet access. Specimens are routinely collected at monthly intervals, and intermittently following reported symptoms, specific sexual exposures, or identification of a sexually transmitted infection. We also collect periodic samples from the penile coronal sulcus to better characterize its relationship to the urethral micriobial communities. Participants are ages 14 - 17 at enrollment, and prior history of sexual exposure is not required for participation. Parental permission is obtained for each participant. The planned duration of followup is up to 4 years allowing for prospective observation of both physical and behavioral maturation from middle adolescence into young adulthood. The overall objectives of the project are to better characterize the healthy male urethral microbiome, and to use this information to better understand acquisition of urethritis and sexually transmitted infections, as well as chronic genital pain and prostatitis syndromes that become common among young adults.
Study
phs000259
The Immunogenetics of Measles Immunity - Measles (MMR) vaccination (NIAID/NIH)
To examine the role of gene polymorphisms (including functional studies of SNPs) on variations in vaccine-induced immune responses, in approximately 3,000 healthy school-age children, adolescents and young adults/adults, who were vaccinated with measles-containing vaccine (MMR). To achieve greatest power to detect SNPs associated with measles-specific immune response phenotypes, we pooled our data across three genotyping platforms and three cohorts (US cohort, San Diego cohort and Rochester cohort).
Study
phs001630
Whole Genome Sequences of 18th century African descended individuals from Charleston, South Carolina.
This study presents Whole Genome Sequencing results from the Anson Street African Burial Ground Project, which is a community-based initiative aimed at understanding the histories of 37 Ancestors found during construction in Charleston, South Carolina. Here we report fastq files for all 37 Ancestors. DNA was extracted at the University of Tennessee-Knoxville following Dabney et al. 2013, and dual index libraries prepared using a modified NEBNext Ultra II kit with partial USER enzyme digestion. Libraries were then enriched for human genomic DNA (MyBaits) and sequenced on Illumina Platforms.
Study
EGAS00001006693
Bogalusa Heart Study (BHS-BioLINCC)
Objectives: To investigate the early natural history of cardiovascular disease in a cohort of children and young adults in a biracial, semirural community.Background: The Bogalusa Heart Study has been a long-term epidemiologic study. The investigators have identified and followed black and white participants for nearly 40 years, and have described the incidence and prevalence of biologic and behavioral cardiovascular disease risk factors from childhood through adulthood. Their participation has enabled the study to not only document differences between males and females, but also between blacks and whites. The results from the Bogalusa Heart Study have clearly documented that the genesis of atherosclerosis has its basis in childhood, and that prevention can and must begin at the early ages.The Bogalusa Heart Study had been funded over the years by the Specialized Centers of Research (SCOR) Program. The SCOR program was initiated by NHLBI in 1970 to expedite the development and application of new knowledge essential for improved diagnosis, treatment, and prevention of arteriosclerosis, hypertension, pulmonary disease, and thrombosis. In 1984 a Demonstration and Education Component was added to the parent SCOR of the Bogalusa Heart Study in order to translate the experience gained in epidemiological studies into an intervention study designed to retard the development of cardiovascular risk factors in children. Beginning in 1997, the study was supported by the cooperative agreement mechanism. Participants: The Bogalusa dataset includes 11,796 participants that attended at least one of seven cross-sectional pediatric exams and/or the 1995-96 adult examination. Subjects ranged in age from 3 to 20 years at the pediatric exams and 20-37 at the time of the adult exam. Approximately 6,000 have more than one examination constituting a dynamic cohort. Design: The initial survey in 1973-1974 was restricted to children ages 2 1/2 to 14. A physical examination was conducted and information was collected on anthropometric data, hemoglobin, blood pressure, serum lipids, and health history. Over 3,500 children participated. The second cross-sectional survey of 1976-1977 and subsequent surveys expanded the eligible population to include all children ages 5-17 years. The second survey of over 4,000 children also included information on salt intake, smoking, health beliefs, and attitudes, and for girls ages 8-17, menstrual history and oral contraceptive use. The third survey of over 3,500 participants in 1978-1979 also collected anthropometric measurements on skinfold thickness and two measurements of heart rate. The fourth survey of over 3,300 participants in 1981-1982 added data on alcohol use, Type A behavior, peer networks and dieting habits.The Bogalusa Heart Study continued to use a cross-sectional and longitudinal design with the general cross-sectional survey of approximately 3,700 Bogalusa children ages five to seventeen in 1988-1989 in the sixth screen and additional longitudinal studies to recall children in defined subgroups for more intensive evaluation. Half of the 12,000 participants screened since 1973 had been studied three or more times. The Post High School Study examined young adults ages 21 through 30 who previously were examined as children ages five through fourteen in the first Bogalusa Heart Study screening in 1973-1974. The population included approximately 4,603 young adults originally screened and any other children or adolescents examined for the first time in any subsequent surveys. The cardiovascular phenotypes include obesity, blood pressure, lipids, lipoproteins, apoproteins, homocysteine, glucose-insulin, fibrinogen, plasminogen activator inhibitor-1 and von Willebrand Factor. Environmental risk factors consist of sociodemographic characteristics, tobacco and alcohol use, oral contraception, physical activity, cognitive and physical function, and quality of sleep and diet. Subclinical morbidity includes echo-Doppler measurements of cardiac-carotid structure and function.
Study
phs004173
Genomic and Transcriptomic Landscape of Fibrolamellar Hepatocellular Carcinoma
Fibrolamellar hepatocellular carcinoma (FL-HCC) is a rare liver tumor primarily affecting adolescents and young adults. Little is known of the molecular pathogenesis. To characterize the disease we performed RNA sequencing and whole genome sequencing on FL-HCC tumors and adjacent normal tissue. The results demonstrate few consistent differences on the chromosomal level and many hundreds of alterations in the expression of RNA transcripts.
Study
phs000709
Whole Genome Sequences of 18th century African descended individuals from Charleston, South Carolina.
This study presents Whole Genome Sequencing results from the Anson Street African Burial Ground Project, which is a community-based initiative aimed at understanding the histories of 37 Ancestors in Charleston, South Carolina. Here we report fastq files for all 37 Ancestors. DNA was extracted at the University of Tennessee-Knoxville following Dabney et al. 2013, and dual index libraries prepared using a modified NEBNext Ultra II kit with partial USER enzyme digestion. Libraries were then enriched for human genomic DNA (MyBaits) and sequenced on Illumina Platforms.
Dataset
EGAD00001009643
Hearing_loss_in_adults_from_South_Carolina_
DNA from ten individuals from South Carolina with progressive hearing loss will be subjected to exome sequencing
Study
EGAS00001000157
Myeloma Research Laboratory, University of Adelaide and South Australian Health and Medical Research Institute
Dac
EGAC00001000849
Add Health: Longitudinal Study of a Nationally Representative Sample of Adolescents in Grades 7-12 in the United States during the 1994-95 School Year, Followed into Adulthood with Five Interviews/Surveys in 1995, 1996, 2001-02, 2008, and 2016-18
The National Longitudinal Study of Adolescent to Adult Health [Add Health] is an ongoing longitudinal study of a nationally representative U.S. cohort of more than 20,000 adolescents in grades 7-12 (aged 12-19 years) in 1994 followed into adulthood with five interviews/surveys in 1995, 1996, 2001-02, 2008, and 2016-18. Add Health was designed to understand how social environments and behaviors in adolescence are linked to health and achievement outcomes in young adulthood. Add Health contains unprecedented environmental, behavioral, psychosocial, biological, and genetic data from early adolescence and into adulthood on a large, nationally representative cohort with unprecedented racial, ethnic, socioeconomic, and geographic diversity. Add Health has a large, multidisciplinary user base of over 50,000 researchers around the world who have published over 3,400 research articles. Add Health is housed at the Carolina Population Center of the University of North Carolina at Chapel Hill. Add Health datasets are distributed according to a tiered data disclosure plan designed to protect the data from the risk of direct and indirect disclosure of respondent identity. Add Health's large sample size, population diversity and rich longitudinal database of psychosocial, physical, and contextual data will permit investigation of an exceptionally broad range of phenotypes with known genetic variation. Prospective longitudinal measures are available to document change over time in each of these phenotypes, as well as change in the social environment and life experiences, making the Add Health sample ideal for understanding genetic linkages with health and behavior across the life course. The original design of Add Health included important features for understanding biological processes in health and developmental trajectories across the life course of young people, including an embedded genetic sample with more than 3,000 pairs of adolescents with varying biological resemblance (e.g., twins, full sibs, half sibs, and adolescents who grew up in the same household but have no biological relationship), testing of saliva and urine for sexually transmitted infections and HIV, and biomarkers of cardiovascular health, metabolic processes, immune function, renal function, and inflammation. Add Health therefore has critical objective indicators of health status and disease markers in young adulthood, well before chronic illness or its complications emerge in later adulthood. Because DNA has been collected on the full sample at Wave IV, it is possible to link genetic profiles with social, behavioral, and biological measures over time from adolescence into adulthood. Add Health sampled the multiple environments in which young people live their lives, including the family, peers, school, neighborhood, community, and relationship dyads, and provides independent and direct measurement of these environments over time. Add Health contains extensive longitudinal information on health-related behavior, including life histories of physical activity, involvement in risk behavior, substance use, sexual behavior, civic engagement, education, and multiple indicators of health status based on self-report (e.g., general health, chronic illness), direct measurement (e.g., overweight status and obesity), and biomarkers. No other data resource with this expanse of genotype and phenotype data on a large nationally representative longitudinal sample with race, ethnic, socioeconomic, and geographic diversity exists. A complete reference guide on study design and accomplishments can be found on the Add Health website: https://cdr.lib.unc.edu/concern/articles/6t053j27s.
Study
phs001367
Carolina Breast Cancer Study (CBCS)
The Carolina Breast Cancer Study (CBCS), phase III, is a population-based study aiming to understand the social and molecular correlates of breast cancer outcomes, particularly among Black and young women. CBCS recruited first primary breast cancer cases from 44 North Carolina counties between 2008-2013. All cases were between age 20 and 74 and self-identified as female. Black women and young women were systematically oversampled by randomized recruitment such that the study includes approximately 50% Black women and approximately 50% women diagnosed before age 50. A total of 2,998 participants were enrolled. In the ten years following diagnosis, women completed surveys approximately every 1-2 years and diagnoses of subsequent distant local recurrence and second primary tumors were ascertained via linked medical records. We identified records of women who developed secondary breast tumors (including second primary tumors, contralateral breast cancers, and locoregional recurrences), and these participants were oversampled for targeted DNA sequencing using the UNCseq panel (~1200 genes).CBCS germline pathogenic mutations cannot be investigated because participants were consented between 2008 and 2013 with specific language prohibiting this. However, germline data can be used to call somatic mutations or to standardize somatic genomic data.For sharing of study data not included here, please visit unclineberger.org for instructions on submitting a data use agreement.
Study
phs003725
Targeted Long-Read Sequencing of the Ewing Sarcoma 6p25.1 Susceptibility Locus
Ewing Sarcoma (EwS) is rare but the second most common cancer in children, adolescents and young adults. Recent studies suggest germline variation in or around GGAA microsatellites may interact with fusion oncoprotein binding causing changes in binding affinity and potentially impacting enhancer activity on nearby target genes. This study aims to better understand germline variation around EwS susceptibility regions by performing long-range haplotype sequencing of these regions using PacBio sequencing. Our analysis focused on characterizing variation in GGAA microsatellite length as well as germline variants in or around the microsatellite and their association with EwS risk.
Study
phs003159
300-Obese: clinical cohort of obese individuals, Nijmegen, the Netherlands
300-Obese cohort was recruited at the Radboud University Medical Center (RUMC), Nijmegen, the Netherlands. The cohort comprises 377 participants included by the following criteria: age>55 years, BMI>27 kg/m2. The cohort data includes gut microbiome, NMR serum metabolomics, deep cardiovascular phenotyping and broad range of phenotypic information.
Study
EGAS00001003508
Association of Age at Diagnosis and Genetic Mutations in Patients with Neuroblastoma
Neuroblastoma is diagnosed over a wide age range from birth through young adulthood, and older age at diagnosis is associated with a decline in survivability. We performed whole genome sequencing of DNA from diagnostic tumors and their matched germlines from 40 patients with metastatic neuroblastoma obtained between 1987 and 2009. To identify genetic lesions that are associated with age at diagnosis in patients with metastatic neuroblastoma. Age groups at diagnosis included infants (0-18 months), children (18 months-12 years), and adolescents and young adults (>12 years). To confirm the findings from this discovery cohort, validation testing using tumors from an additional 64 patients obtained between 1985 and 2009 was also performed. Formalin-fixed paraffin-embedded tumor tissue was used for immunohistochemistry and fluorescent in situ hybridization. Telomere lengths were analyzed using the whole genome sequencing data, quantitative polymerase chain reaction and fluorescent in situ hybridization
Study
EGAS00001000213
ICR TYA WES
Whole exome sequencing of high grade gliomas occurring in teenagers and young adults between the ages of 13 and 30. This study combines methylation array profiling, whole exome sequencing and fusion panel sequencing to provide an integrated molecular description of brain tumours in this age group.
Dataset contains 96 BAM files aligned with BWA to GRCh37 from 88 tumours. Tumour DNA was isolated from FFPE material in most cases. 8 germline sequences from peripheral blood DNA are also included.
Dataset
EGAD50000000904
Experimental and Clinical Studies of Presbycusis
This study aimed to investigate genes and variants associated with adult-onset progressive hearing loss, a common and complex disease with a strong genetic component.Participants enrolled in an ongoing longitudinal study of age-related hearing loss at the Medical University of South Carolina (MUSC), dating from 1987. Pure tone thresholds at 0.25, 0.5, 1.0, 2.0, 3.0, 4.0, 6.0 and 8.0 kHz were obtained for each ear of each person, along with questionnaire responses concerning noise exposure history. Based on the audiometry, participants were classified into 5 groups as "Older-Normal", "Metabolic", "Sensory", "Unclassified" and "Unselected" by their hearing loss.Exome sequencing was carried out using the Agilent SureSelect X2 Target Enrichment System (version 5) and the Agilent SureSelect Human All Exon V5 kit, which included 5' and 3' UTRs. DNA was sheared using the Covaris S220 focused ultrasonicator. Libraries were sequenced on the Illumina HiSeq 2500. Variant loads per gene were calculated and compared between groups. Individual variants affecting hearing thresholds were also identified. Analyses were followed up in a second cohort. Several genes and variants were identified as novel candidates associated with both better and worse hearing. Data available through dbGaP include whole exome sequencing (WES) files and audiometric phenotyping.
Study
phs003327
Development and Validation of Organoids from Fibrolamellar Carcinoma Human Cells
Fibrolamellar hepatocellular carcinoma (FL-HCC) is a rare liver tumor affecting adolescents and young adults with no history of primary liver disease or cirrhosis. We collected fibrolamellar tumor tissue and the adjacent non-tumor liver from patients at the time of surgery. We only used tumor tissue that was validated as fibrolamellar by the presence of the DNAJB1-PRKACA fusion gene. The freshly resected tissue was dissociated and grown as organoids. This yielded 21 patient-derived organoid lines. From the tumor tissue we developed twelve organoids from metastases and three from the liver tumor. Additionally, we generated six organoid lines from the adjacent non-tumor liver. The tumor organoids were all shown to recapitulate the histologic morphology, immunohistochemistry, and transcriptome of the tumor tissue from which they were derived.
Study
phs002439
Research in Adaptive Interests, Skills, and Environment
The Research on Adaptive Interest, Skills, and Environments (RAISE) project is a coordinated set of pilot studies funded and otherwise supported by the Center for the Study of Adolescent Risk and Resilience (C-StARR) at Duke University. The initial data collection effort (Adolescent T1; 2015; NAHDAP 100737) was conducted by telephone and focused on self-regulation and related skills during early adolescence and their role in early instances of health risk behavior. The survey was comprised of brief measures of constructs in six areas of interest to C-StARR investigators: (1) background and home environment; (2) self-regulation and personality; (3) physical and mental health; (4) problem behaviors; (5) academics and school; and (6) technology use. The representative sample of 2104 adolescents was drawn from the population of fifth through eighth graders enrolled in North Carolina public schools during the 2014-2015 school year.In 2016, a subsample of 395 participants from the Adolescent T1 survey (selected on the basis of having resided within 100 miles of study personnel hubs in Durham, North Carolina and Sylva, North Carolina) participated in an additional home visit study that involved the collection of biological data and measures of cognitive ability and executive functioning. The molecular data made available to dbGaP were obtained from saliva samples that were collected (a) in person from 377 of 395 home visit study participants and (b) by mail from 60 participants who had participated in the Adolescent T1 survey but had not participated in the home visit study. The resulting data file was imputed to a 1000G phase 3 panel and is comprised of N = 437 individuals and 7,193,135 markers after filtering. Data were filtered with MAF ≥ 0.01, HWE p-value > 10-6, and genotype rate > 95%.
Study
phs003982
RNA-seq, WGS and WES of Hepatocellular carcinomas, enriched in fibrolamellar carcinomas
DNAJB1-PRKACA fusion is a specific driver event in fibrolamellar carcinoma (FLC), a rare subtype of hepatocellular carcinoma (HCC) occurring in adolescents and young adults. In older patients, molecular determinants of HCC with mixed histological features of HCC and FLC (mixed-FLC/HCC) remain to be discovered.This study aims to molecularly caracterize mixed fibrolamellar hepatocellular carcinoma (mixed FLC/HCC) within the molecular liver tumors diversity, in respect of pure FLC and different HCC subtypes. This study reunites WGS data (n=3, Tumoral/Non-Tumoral pairs), WES data (n=43, Tumoral/Non-Tumoral pairs) and RNAseq data (n=66, Tumoral only). On a trascriptomic level, most of the mixed-FLC/HCC RNAseq clustered in a robust subgroup of tumors all showing mutation or translocation inactivating BAP1 that codes for the BRCA1 associated protein-1.
Study
EGAS00001003837
Childhood Cancer Data Initiative (CCDI): Molecular Characterization Initiative
The National Cancer Institute's (NCI) Childhood Cancer Data Initiative (CCDI) focuses on the critical need to collect, analyze, and share data to address the burden of cancer in children, adolescents, and young adults (AYAs). The Molecular Characterization Initiative (MCI) will further the CCDI's goals by providing access to better diagnostic tests for pediatric and AYA patients. The molecular characterizations of solid tumors, soft tissue sarcomas, and rare diseases are performed in a CLIA-certified setting as results may be used to screen for and/or confirm clinical trial eligibility, direct treatment, or otherwise contribute to the conduct of the trial. The following molecular characterization assays were performed:Tumor/Normal Whole Exome sequencing Methylation arraysArcher fusion panel Deidentified clinical reportsNote: New data will be added periodically, please check the CCDI Hub Explore site for updates.
Study
phs002790
SNV and indel calls from 8086 individuals in the British Autozygosity Populations BioResource dataset
This includes variant calls (single nucleotide variants and small insertions/deletions) from 8086 (mostly British Pakistani/British Bangladeshi) individuals from the following studies:
1. 3781 British Pakistani/British Bangladeshi adults from East London Genes and Health
2. 2791 British South Asian mothers from Born in Bradford
3. 1428 British South Asian adults from Birmingham
4. 86 individuals (mixed ancestries) from families with rare diseases, from Queen Mary University London
All of the Birmingham and most of the Born in Bradford samples were previously sequenced as part of PMID: 26940866.
Mapping was done with bwa-mem and variant calling was carried out with GATK HaplotypeCaller. We removed variant sites for which the following was true:
SNPs: "QD < 2.0 || FS > 30 || MQ < 40.0 || MQRankSum < -12.5 || ReadPosRankSum < -8.0"
Indels: "QD < 2.0 || FS > 30 || ReadPosRankSum < -20.0"
Dataset
EGAD00001004581
Genome-wide Identification of Variants Affecting Early Human Brain Development
This is a genome-wide association study (GWAS) of global brain tissue volumes in human infants. The published study for this project includes 561 infants, and 239 parents gave consent for data sharing through dbGaP. An intronic single-nucleotide polymorphism (SNP) in IGFBP7 (rs114518130; GeneID: 3490) met genome-wide significance for gray matter volume (P=4.15x10-10). An intronic SNP in WWOX (rs10514437; GeneID: 51741) neared genome-wide significance for white matter volume (P=1.56x10-8). Additional loci with small P-values include psychiatric GWAS associations and transcription factors expressed in the developing brain. Genetic risk scores for schizophrenia and ASD, and the number of genes affected by rare copy number variants (CNV burden) did not predict global brain tissue volumes. Integrating these results with large-scale GWAS in adolescents [Philadelphia Neurodevelopmental Cohort (PNC)] and adults [Enhancing Neuro Imaging Genetics through Meta-Analysis version 2 (ENIGMA2)] suggested minimal overlap between common variants impacting brain volumes at different ages.
Study
phs001122
A Case-Controlled Study for Genotype-Phenotype Associations in Multiple Sclerosis (MS)
This is a multi-centre, case-controlled study to develop a dataset containing 1000 MS cases and 1000 matched controls and to associate DNA sequence (allelic) variations with MS phenotypes. Study subjects were enrolled through a prospective effort initiated in 2003. Three MS clinical centres were involved in subject recruitment and biological specimen collection using identical inclusion/exclusion criteria, two in Europe (Vrije Universiteit Medical Center, Amsterdam; and University Hospital Basel) and one in the US (University of California San Francisco). This study recruited subjects of northern-European ancestry with a diagnosis of MS (McDonald et al., 2001), with dissemination in time and space. Patients with Clinically Isolated Syndromes (CIS) were also included if they fulfilled 3 of the 4 Barkhof criteria for dissemination in space as per application of the McDonald criteria (McDonald et al., 2001). While recruitment predominantly included subjects with a relapsing onset of MS, individuals with all clinical subtypes of the disease participated, including clinically isolated syndrome (CIS), relapsing remitting MS (RRMS), secondary progressive MS (SPMS), primary progressive MS (PPMS), and progressive relapsing MS (PRMS). The control group consisted of unrelated individuals, primarily spouses/partners, friends, and other volunteers. Control subjects were of northern-European ancestry and matched as a group, proportionally with cases according to age (±5 years) and gender. A familial history or current diagnosis of MS as well as a relation to another case or control subject were considered exclusionary for this group. Protocols were approved by the Committees on Human Research at all Institutions and informed consent was obtained from all participants prior to participation in the study. Primary Study Objective:To identify DNA sequence variations (genotype) and flanking sequences that are associated with clinical factors (phenotype) which differ between study subjects with and without MS. Secondary Study Objectives: To develop a clinical dataset including quantitative measures of 1000 well-characterized cases with MS, and 1000 ethnically matched controls. To identify other genotype-phenotype associations in MS study subjects such as magnetic resonance imaging (MRI) measures of disease burden and/or severity. To identify or confirm candidate surrogate markers of neurodegeneration using a variety of techniques including biochemical assays, blood transcriptome analysis, plasma proteomics and MRI*. GenotypingGenotyping of the complete dataset was performed at the Illumina facilities using the Sentrix® HumanHap550 BeadChip. *MRI results are not available on dbGaP.
Study
phs000171
New Brain Tumor Entities Emerge from Molecular Classification of CNS-PNETs
Primitive neuroectodermal tumors of the central nervous system (CNS-PNETs) are highly aggressive, poorly differentiated embryonal tumors occurring predominantly in young children but also affecting adolescents and adults. Herein we demonstrate that a significant proportion of institutionally diagnosed CNS-PNETs display molecular profiles indistinguishable from those of various other well defined CNS tumor entities, facilitating diagnosis and appropriate therapy for patients with these tumors. From the remaining fraction of CNS-PNETs we identify four new CNS tumor entities, each associated with a recurrent genetic alteration and distinct histopathological and clinical features. These new molecular entities, designated “CNS neuroblastoma with FOXR2 activation (CNS NB-FOXR2)”, “CNS Ewing sarcoma family tumor with CIC alteration (CNS EFT-CIC)”, “CNS high grade neuroepithelial tumor with MN1 alteration (CNS HGNET-MN1)”, and “CNS high grade neuroepithelial tumor with BCOR alteration (CNS HGNET-BCOR)”, will enable meaningful clinical trials and the development of therapeutic strategies for patients affected by poorly differentiated CNS tumors.
Study
EGAS00001001632
A Genome-wide Association Study (GWAS) of Risk for Osteosarcoma
This is a genome-wide association study (GWAS) of osteosarcoma, the most common primary bone malignancy. Osteosarcoma typically occurs in adolescents and young adults. It occurs at increased frequency in several inherited cancer predisposition syndromes but the genetic contribution to sporadic osteosarcoma is largely unexplored. The objective of this study was to identify genetic risk factors for osteosarcoma by conducting a genome-wide association study. We developed collaborations with multiple institutions in order to attain the necessary sample size required to discover novel loci in the genome associated with osteosarcoma using the GWAS approach. Genomic DNA (either blood or buccal in source) derived from osteosarcoma cases was obtained from each participating institution or research group. De-identified blood or buccal cell DNA samples from osteosarcoma cases were derived from existing biobanks at the collaborative institutions. Control subjects were derived from existing NCI cohorts and matched by gender and ethnicity.
Study
phs000734
A Genome-Wide Association Study of Heroin Dependence
This collaboration of Australian and American investigators aims to identify genes associated with liability for heroin dependence. The project uses a case-control design in which cases met lifetime DSM-IV criteria for heroin dependence. Controls included assessed individuals who did not meet DSM-IV heroin dependence criteria and unassessed general population controls. Cases and controls were obtained from the several large investigations including: The Comorbidity and Trauma Study, Heroin Dependence in Western Australia, the OZ-ALC Study, a Twin Study of Mole Development in Adolescence, and ongoing genetic studies of substance dependence conducted by investigators at Yale and collaborating institutions. These projects are briefly described below. The Comorbidity and Trauma Study (PI: Elliot Nelson), a retrospective case-control study examining genetic and environmental factors contributing to heroin dependence liability. The study was funded by the National Institute on Drug Abuse (NIDA), and was run in collaboration with Washington University, the Queensland Institute of Medical Research (QIMR), and the National Drug and Alcohol Research Centre (NDARC), University of New South Wales. Case participants were recruited from maintenance clinics in the greater Sydney area. Control participants were recruited from employment centres and community centres, open street malls, and local press servicing the same geographical area as the opioid maintenance treatment clinics and either denied recreational use of opioids or had used these drugs recreationally fewer than 11 times lifetime. The prevalence in these individuals of non-opioid licit drug dependence and illicit drug dependence as well as childhood trauma exposure and other psychiatric disorders is elevated considerably versus estimates of similar measures in Australian general population samples. Participants provided blood samples as a source of DNA and completed a comprehensive psychiatric diagnostic interview based on the Semi-Structured Assessment of the Genetics of Alcoholism - Australia (SSAGA-OZ) augmented with sections drawn from other instruments assessing childhood trauma exposure, family history, and screening for borderline personality disorder. Heroin Dependence in Western Australia (PI: Sybille Schwab) is a study focusing both on genetic contributions to heroin dependence and response to naltrexone treatment of the disorder. Participants completed a clinical assessment and provided blood samples during their treatment at the Perth Naltrexone Clinic now name as the Fresh Start Recovery Programme. Funding for the project was provided by the Australia Government's National Health and Medical Research Council (Grant # 513862; PI: Sybille Schwab) Affected subjects from ongoing genetic studies of substance dependence conducted by investigators at Yale (PI: Joel Gelernter) and collaborating institutions were collected in the course of several NIDA-funded studies. Those included in the current set were assessed by means of the SSADDA (Semi-Structured Assessment for Drug Dependence and Alcoholism). All are opioid dependent European-Americans, and all list heroin as the opioid must used. Most were collected at Yale University School of Medicine or University of CT School of Medicine under the supervision of Drs Joel Gelernter and Henry Kranzler. Control subjects were also collected in the course of several NIDA- and NIAAA-funded studies. Those included in the current set were assessed by means of the SSADDA (Semi-structured Assessment for Drug Dependence and Alcoholism). Most were collected at Yale University School of Medicine or University of Connecticut School of Medicine under the supervision of Drs Joel Gelernter and Henry Kranzler. The OZ-ALC Study (PI: Andrew Heath) consists of a large group of twins and their family members ascertained from the general population Australian Twin Registry who have participated in ongoing research projects. For the control investigation, we have selected individuals who do meet criteria for illicit drug dependence who have had GWAS genotyping with the Illumina Human CNV370-Quad. Inclusion of individuals with alcohol dependence or nicotine dependence was minimized. For a more detailed description of the study, please see: http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000181.v1.p1 The Twin Study of Mole Development in Adolescence (PI: Nick Martin) is an ongoing investigation of melanocytic naevi funded by the Australian Government's National Health and Medical Research Council (Grant # 389891; PI: Nick Martin). For the current project, unassessed parents of these twins will serve as a control group. These individuals will either have been previously genotyped with the Illumina Human 610-Quad BeadChip or will be genotyped as part of the current project. Parents have largely survived the period of risk for heroin dependence and, by virtue of their participation in this research, are very likely to have a prevalence of heroin dependence lower than that in the general population (i.e., <0.7%). In order to understand the immune modulatory effect of opioids across peripheral blood immune populations, a second study (PI: Christine S. Cheng) analyzed single cell RNA-seq profiles from the PBMCs of a subset of case participants and control participants from the Comorbidity and Trauma Study (PI: Elliot Nelson). The study assessed and identified transcriptional changes between opioid dependent and control samples across naive and LPS stimulated immune populations.
Study
phs000277
NHLBI TOPMed - NHGRI CCDG: UCSF Atrial Fibrillation Study
The University of California San Francisco (UCSF) Cardiovascular Research Institute (CVRI) Resource in Arteriosclerosis and Metabolic Disease is an ongoing multi-ethnic study of adults ≥ 18 years of age which was started in 1989 and now includes 28,000 participants recruited from the UCSF medical system. Within the Resource lies data and biospecimens from nearly 1,000 patients presenting to the electrophysiology laboratory for electrophysiology procedures that were densely phenotyped for electrophysiologic characteristics with biospecimens collected from various intra and extra-cardiac chambers. Phenotyping of all participants was achieved via interview and review of medical records. A subset of 113 participants with early-onset atrial fibrillation underwent WGS as a part of the TOPMed Program.
Study
phs001933
NHLBI TOPMed: Pulmonary Hypertension and the Hypoxic Response in SCD (PUSH)
During Visit One, the PUSH Study will perform echocardiography on 600 children and adolescent with patients with sickle cell disease (SCD) and 100 control children and adolescents at three Field Centers, namely Howard University, Children's National Medical Center and University of Michigan. Patients or their parents will be approached and asked to give informed consent. If they appear to have a difficult reading, reading of the consent will be offered. Patients or their parents not appearing to comprehend the consent will not be eligible. As a part of this visit, each participant or parent will sign informed consent, complete a Participant Contact Information Form, complete a Medical History Form, undergo physical examination with completion of a Physical Examination Form and have blood drawn. Each participant must have echocardiography performed with measurement of Tricuspid Regurgitant Jet Velocity (TRV). In addition, attempts will be made 1) to perform a six-minute walk test, 2) to collect information from a recent (within six months) Transcranial Doppler Study (TCD) or to perform TCD, and 3) to perform pulmonary function tests. Study personnel will review all forms for completeness and conduct phlebotomy. Blood will be shipped to the Central Lab. Results of all procedures and tests will be transmitted to the Data Manager at Howard University. Sequencing was only done on sickle cell participants.
Study
phs001682
Coronary Artery Risk Development in Young Adults (CARDIA-BioLINCC)
Data Access NOTE: Please refer to the “Authorized Access” section below for information about how access to the data from this accession differs from many other dbGaP accessions.Related Studies: Other CARDIA data available include:phs000236.v2.p2 PAGE: CALiCo: Coronary Artery Risk Development in Young Adults(CARDIA) phs000309.v3.p2 The CARDIA-GENEVA Study phs000399.v1.p2 NHLBI GO-ESP: Heart Cohorts Exome Sequencing Project (CARDIA) phs000613.v1.p2 NHLBI CARDIA Candidate Gene Association Resource (CARe) phs003675.v1.p1 CARDIA Multi-omics Obesity & CVD Substudy – Year 20 Untargeted Metabolomics Data phs001612.v3.p2 NHLBI TOPMed: CARDIA (Coronary Artery Risk Development in Young Adults) phs003045.v1.p1 Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Coronary Artery Risk Development in Young Adults Study (CARDIA) phs000285.v4.p3 CARDIA_Cohort Available Data: Available data for request include exam data through Year 30 and outcomes and death data through 2022. Additionally, data from 28 ancillary studies are available for request.Objectives: The original objectives of CARDIA were to document levels of risk factors for coronary artery disease and potential determinants of these risk factors in young adults; to study the interrelationships of risk factors and lifestyles and to document behavioral and environmental changes during the transition from young adulthood to middle age; to compare cross-sectional and longitudinal data on age-related trends in cardiovascular disease risk factors; and to compare levels and evolution of risk factors between men and women, blacks and whites, and in groups of differing socioeconomic status. Goals of the study have evolved to emphasize understanding determinants of left ventricular mass, emerging obesity and hypertension, and sequelae of hypertension in pregnancy.Background: CARDIA is designed to increase understanding of contributors to changes in cardiovascular disease (CVD) risk factors during the critical years of transition from young adulthood to middle age. CARDIA was funded initially in 1983 for a five-year cycle that included two rounds of examinations. Contract renewals have allowed for subsequent re-examinations.Participants: Black and white men and women; ages 18-30 years at entry with a range of attained education; original sample size: 5,115.Design: CARDIA is a population-based observational study of 5,115 participants aged 18-30 years recruited in 1985-1986. The sample was designed to achieve approximately balanced subgroups of race, gender, education (high school or less and more than high school) and age (18-24 and 25-30). Forty percent of the cohort had no more than a high school education. A total of nine examinations have been completed in the cohort with examination cycles at year 2 of the project and years 5, 7, 10 , 15, 20, 25, and 30.In addition to standard measurements of blood pressure, anthropometry, blood lipids, smoking behavior, physical activity, diet, pulmonary function, and many psychological factors, CARDIA has other included measurements (in subsets or in the full cohort) to obtain unique information on other aspects of risk factor development and early morbidity. These have included: graded exercise treadmill testing; echocardiography, particularly for measurement of left ventricular mass; cardiovascular reactivity; serum cotinine; Lp(a), apoE phenotype, apolipoprotein A1 and B; homocysteine; skin reflectance; body composition by dual X-ray absorptiometry; glucose tolerance testing; vascular resistance and compliance; and plasma renin activity and sympathetic nervous system activity.
Study
phs003739
Genomic Analysis of Fibrolamellar Hepatocellular Carcinoma
Fibrolamellar hepatocellular carcinoma (FL-HCC) is a very rare and distinct subtype of hepatocellular carcinoma that occurs in the previously healthy livers of adolescents and young adults. The low incidence rate has kept FL-HCC an understudied cancer, thus the list of FL-HCC markers is limited and the genetic alterations driving its growth have remained unknown. In order to gain a better understanding of the molecular mechanisms underlying FL-HCC initiation and progression, we performed an in-depth genomic analyses of one tumor followed by immunohistochemistry validation on seven other tumors. We showed the expression of neuroendocrine markers in FL-HCC. In addition, DNA/RNA sequencing revealed that common cancer pathways are not visibly altered in FL-HCC but identified two novel structural variants, both resulting in fusion transcripts. Through in vitro studies, we demonstrated the oncogenic properties of these fusion transcripts. These experiments further highlight the tumorigenic role of gene fusions in the etiology of pediatric solid tumors and identify both candidate biomarkers and possible therapeutic targets for this lethal pediatric disease.
Study
phs000828
Idiopathic Multicentric Castleman Disease Whole Genome Sequencing Project
Multicentric Castleman disease (MCD) is a rare lymphoproliferative disorder driven by excessive production of inflammatory cytokines. Patients can exhibit a wide range of clinical features from mild flu-like symptoms to multiple organ failure and death. Human herpesvirus-8 (HHV-8) is a well-established cause of MCD in individuals who are immunosuppressed due to human immunodeficiency virus infection or other causes. On the other hand, it is not known what causes MCD in otherwise healthy individuals with no evidence of exposure to HHV-8 (idiopathic MCD or iMCD). Although iMCD is most common in 40-60 years old adults, it affects people of all ages and early-onset (childhood or young adulthood) cases have been reported. Germline mutations may underlie iMCD in such cases. Patients with early-onset iMCD and their family members were enrolled in the study. To identify germline mutations that may underlie iMCD, whole-genome sequencing was performed in patients and their biological parents from whom DNA samples were available.
Study
phs001706
NHLBI TOPMed: Outcome Modifying Genes in Sickle Cell Disease (OMG)
Sickle cell disease (SCD) is caused by homozygosity for a single mutation of the beta hemoglobin gene. Despite the constancy of this genetic abnormality, the clinical course of patients with SCD is remarkably variable. SCD can affect the function and cause the failure of multiple organ systems through the pathophysiologic processes of vaso-occlusion and hemolysis. These pathophysiological processes are complex and expected to impact multiple organ systems in a variety of ways. This study, therefore, was designed to identify genetic factors that predispose SCD patients to develop specific end-organ complications and to experience more or less severe clinical courses. We enrolled > 700 patients with Hb SS, Hb S-beta0 thalassemia and HbSC being followed primarily at three southeastern U.S. regional institutions (Duke University Medical Center, University of North Carolina Medical Center, and Emory University Medical Center). Medical information obtained included the presence or absence of specific targeted outcomes (overall disease severity as well as specific types of end organ damage). Clinical data include medical status (history, physical, examination, and laboratory results) and information regarding potentially confounding environmental factors. Limited plasma samples are available for correlative studies (e.g. of cytokine levels, coagulation activation). Targeted SNP for candidate gene analysis as well as GWAS has been performed on most samples. Whole genome sequencing has been conducted through the TOPMed Consortium. The subjects in this analysis were collected as part of a larger study, "Outcome Modifying Genes in Sickle Cell Disease" (OMG-SCD) aimed at identifying genetic modifiers for sickle cell disease. More information about the study can be found in Elmariah et al. (2014), PMID: 24478166. Clinical and genetic data have been used to identify genetic characteristics predisposing patients with SCD to a more or less severe overall clinical course as well as to individual organ-specific complications. It is anticipated that identification of such genetic factors will reveal new therapeutic targets individualized to specific complications of SCD, leading to improved outcomes and increased life expectancy for patients with SCD.
Study
phs001608
Whole-genome, whole-exome and transcriptome sequencing of pancreatic ductal adenocarcinomas from young adults (NCT MASTER)
Whole-genome, whole-exome and transcriptome sequencing of pancreatic ductal adenocarcinomas from young adults reveals recurrent NRG1-fusions in KRAS wild-type tumors.
Dataset
EGAD00001004068
South Asia Rheumatic Heart Disease Genetics Network
Rheumatic heart disease (RHD) is a major cause of cardiovascular death and disability amongst young adults. Building on a sample collection established for and genotyped in several earlier successful candidate gene studies, this case-control study investigated host genetic susceptibility to RHD using a genome-wide approach. The current dataset comprises over 850 individuals recruited in Uttar Pradesh, India, including cases of RHD based on echocardiographic diagnosis and controls recruited on the basis of normal echocardiograms. For this analysis all available samples were genotyped using the Illumina HumanCore-24 BeadChip platform. Efforts are now underway to establish new sample collections in other areas of India as well as Pakistan and later other neighbouring countries.
Study
EGAS00001003565
Genomic Landscape of Pediatric Germ Cell Tumors
Germ cell tumors (GCTs) are cancers of the testis, ovary and extragonadal sites that occur in infants, children, adolescents, and adults. Post-pubertal (type II) malignant GCTs may present as a seminoma, non-seminoma or with mixed histologies. In contrast, pre-pubertal (type I) GCTs are limited to (benign) teratomas and (malignant) yolk sac tumors (YST). Epidemiologic and molecular data have shown that pre- and post-pubertal GCTs arise by distinct mechanisms. Dedicated studies of the genomic landscape of type I and II GCTs in children and adolescents are lacking. Here we present an integrated genomic analysis of extracranial GCTs across the age spectrum from 0-24 years.
Study
phs002009
GeneScreen, a Population Based, Targeted Genomic Screening Study
The GeneScreen research study was based at the University of North Carolina (UNC) at Chapel Hill. It brought together researchers from different disciplines, including genetics, clinical medicine, bioethics, sociology, anthropology, psychology, and public health, to study the best ways to offer targeted genetic screening to the general adult population. DNA samples were collected from people who joined GeneScreen to look at a targeted panel of 17 genes. It tested for genetic differences (called mutations) that can cause one of 11 rare, but preventable or treatable conditions, including certain types of cancer and heart disease. Each condition has specific medical advice people can follow with their doctors to prevent or treat a serious health problem. The GeneScreen study was approved by the UNC Institutional Review Board and by the Kaiser Permanente Center for Health Research Institutional Review Board. GeneScreen is funded by the National Human Genome Research Institute of the National Institutes of Health.
Study
phs001817
Immune Response to Life-Threatening Respiratory Infections in Children and Young Adults
This is a cross sectional study describing age-related differences in the nasal mucosal immune response to SARS-CoV-2. We collected nasal mucosal samples from six groups: healthy children, children infected with SARS-CoV-2, children infected with respiratory syncytial virus, children infected with influenza virus, healthy adults, and adults infected with SARS-CoV-2. Samples were obtained by mucosal scrape biopsy of the inferior turbinate using a sterile plastic curette. We performed bulk RNA-sequencing to obtain gene expression data. Clinical data was extracted from the electronic health record including: demographics, medical history, clinical presentation, duration of symptoms, intensive care unit admission, hospital and intensive care unit length of stay, type and duration of respiratory support at sample collection and peak illness. Our analysis described the host response to SARS-CoV-2 in the nasal mucosa in children and adults and compared it to the host response to respiratory syncytial virus and influenza virus.
Study
phs002579
Long-Term Outcome in Offspring and Mothers of Dexamethasone-Treated Pregnancies at Risk for Classical Congenital Adrenal Hyperplasia Owing to 21-Hydroxylase Deficiency
Summary The purpose of this study is to perform the first long-term follow up study both of adolescents and young adults with a history of prenatal treatment with dexamethasone and of their mothers and to test for adverse medical or behavioral side effects. The emphasis will be on the outcome of this prenatal treatment in those fetuses who are not affected with steroid 21-hydroxylase deficiency (21OHD) form of congenital adrenal hyperplasia (CAH) and are either heterozygotes or homozygote-unaffected. Prenatal treatment of 46,XX fetuses with 21OHD (via administration of dexamethasone to the pregnant mother) has been shown to reduce the masculinization of the genitalia and, thereby, the later need for 'corrective' feminizing genital surgery and potential impairment of sexual functioning. Along with the suppression of excess prenatal androgen production and reduction of prenatal masculinization of the genitalia in 46,XX fetuses with 21OHD, prenatal dexamethasone treatment may reduce the behavioral masculinization that is well documented in untreated 46, XX patients with 21OHD. Those fetuses who are partially treated until diagnosis of an unaffected status (heterozygous or homozygous-unaffected) will be studied as well. This latter group is of importance because these fetuses are being unnecessarily treated, but we have no way of diagnosing the unaffected status before the 8th week of gestation when treatment must begin for the female fetus who is affected. As treatment is necessary only until term in the female fetus affected with classical CAH, male fetuses and unaffected or heterozygous female fetuses do not require treatment. Thus, only one out of eight fetuses will require prenatal treatment until term. Findings of adverse effects of glucocorticoid treatment in non-human mammals [1, 2] have raised concerns among other clinicians and investigators about potential adverse side effects of such treatment on the developing human. Thus, this study will address this concern and investigate the potential adverse side effects of prenatal treatment. Detailed Description Females with classical CAH owing to 21-hydroxylase deficiency are born with ambiguous genitalia due to the production of excess androgens in utero. Prenatal treatment with dexamethasone was inaugurated in 1978 by Maguelone Forest and has been the standard of practice in the United States since 1986. Dexamethasone, which crosses the placental barrier, suppresses the fetal adrenal gland production of androgens, thus preventing ambiguous genitalia in the affected female. Children of both sexes are prenatally treated as soon as pregnancies at risk are confirmed. Treatment in females with 21OHD continues to term, but is discontinued in males and unaffected females. To date, 685 pregnancies have been diagnosed, of which 366 fetuses were treated. These investigators will study prenatally treated adolescents and adults 12 years and older with respect to medical and behavioral outcomes (see Table 5). In addition, mothers of children prenatally treated for varying periods of time for suspected 21OHD will be studied for long-term side effects of dexamethasone treatment administered during pregnancy. The long-term outcome in these children and their mothers has never before been studied.
Study
phs001317
Sickle Cell Disease Implementation Consortium Registry (SCDIC Registry-BioLINCC)
Data Access NOTE: Please refer to the "Authorized Access" section below for information about how access to the data from this accession differs from many other dbGaP accessions.Objectives: To understand the barriers to care and other factors related to reduced healthcare utilization after the transition from pediatric to adult SCD care which may contribute to increased morbidity and mortality.Background: Fifty years ago, it was rare for individuals with SCD to live beyond childhood. Advances in care delivery and treatment have more than doubled the life expectancy of individuals with sickle cell disease (SCD) since 1972. Beginning in the 1970s, measures such as newborn screening, prophylactic administration of penicillin, and immunization against bacterial infections decreased complications and morbidity, increasing the length and quality of life of children with SCD. Blood transfusions are currently the only proven way to prevent some of the major complications of SCD, especially recurrent stroke. They are also used frequently to help manage some of the acute complications of SCD. More recently, the use of hydroxyurea (HU) as a therapeutic agent to increase fetal hemoglobin has been shown to further reduce the debilitating symptoms of and improve survival in SCD. L-glutamine and crizanlizumab are additional treatments to reduce pain crises and voxelotor is approved to lower the risk of anemia and improve blood flow. In the absence of a widely accessible cure, treatment for SCD is usually aimed at avoiding crises, relieving symptoms, and preventing complications. Many of the advances in treatment have not translated into an increase in longevity or quality of life for adolescents and adults because of disparities in access to routine primary health care.Individuals with SCD experience a markedly increased mortality beginning in the second decade of life. The third and later decades of life are frequently associated with severe chronic pain progressive organ damage and frequent hospitalizations. The provision of evidence-based and expert opinion-based care in SCD is complicated by the difficulties that many patients experience in obtaining access to the health care system and in receiving long-term care from knowledgeable providers. A registry of 2,400 people with SCD was established to provide a rich resource to study the natural history of SCD and to understand barriers to care during the transition from pediatric to adult care. Participants: 2,438 participants with a confirmed diagnosis of SCD were enrolled into the registry over an 18-month period. At the time of enrollment, all participants were 15-45 years of age, English speaking, and without a bone marrow transplant.Design: The SCDIC Registry is a longitudinal observational cohort study. At enrollment, data collection included a medical record abstraction of clinical history, the most recent lab results, and a participant survey which included several patient reported outcomes from PROMIS, ASCQ-Me and NeuroQoL. Similar surveys were administered at annual follow-up. Four to five years after enrollment, a second record abstraction was completed with all clinical events that occurred since enrollment and another set of recent labs. There were many research questions proposed to be answered with the study data thus there was no primary outcome.Conclusions: As noted previously, there were no primary outcomes determined at the start of the study. Research questions included examination of co-morbidities, healthcare utilization, and patient reported outcomes. Key findings from the Registry include:Executive dysfunction, learning difficulties, and poor comprehension significantly associated with poor activities of daily living skills. Executive dysfunction also associated with HU non-adherence. Continuing HU after conception increased odds of miscarriage or stillbirth compared to women who were not using HU. Continuing HU after conception increased odds of low birth weight in infants. Compared to adolescents, young adults had significantly more severe pain, organ dysfunction, mental health disorders, sleep problems, and barriers to medical care. Young adults were significantly less likely to see a sickle cell specialist in the past 2 years. Females had higher pain frequency and severity, more pain episodes, anxiety, depression, hospitalizations, and higher fetal hemoglobin levels. Males had more respiratory, musculoskeletal, genitourinary, and cardiovascular complications, more skin ulcers, and more use of HU. Among 128 subjects that died during the study period, iron overload, pulmonary hypertension, and depression, were significant predictors of the risk of death in multivariate analyses.
Study
phs004203
Modelling Multi-Dimensional ClinOmics for Precision Therapy of Children and Adolescent Young Adults with Relapsed and Refractory Cancer: A Report from the Center for Cancer Research
The Center for Cancer Research (CCR), of the intramural NCI undertook a multidimensional clinical genomics study of children and adolescent young adults with relapsed and refractory cancers who were enrolled on other therapeutic trials to determine the feasibility of a genome guided precision therapy protocol in these patients.Note: NCIPM001blood - This sample was previously submitted with study phs002207, with sample ID NCIPM001blood_E and SRR ID SRR18544311.
Study
phs001052
The Atherosclerosis Risk in Communities (ARIC) Study
The Atherosclerosis Risk in Communities (ARIC) Study, sponsored by the National Heart, Lung and Blood Institute (NHLBI), is a prospective epidemiologic study conducted in four U.S. communities. The four communities are Forsyth County, NC; Jackson, MS; the northwest suburbs of Minneapolis, MN; and Washington County, MD. ARIC is designed to investigate the etiology and natural history of atherosclerosis, the etiology of clinical atherosclerotic diseases, and variation in cardiovascular risk factors, medical care and disease by race, gender, location, and date. ARIC includes two parts: the Cohort Component and the Community Surveillance Component. The Cohort Component began in 1987, and each ARIC field center randomly selected and recruited a cohort sample of approximately 4,000 individuals aged 45-64 from a defined population in their community. A total of 15,792 participants received an extensive examination, including medical, social, and demographic data. These participants were reexamined every three years with the first screen (baseline) occurring in 1987-89, the second in 1990-92, the third in 1993-95, and the fourth and last exam was in 1996-98. Follow-up occurs yearly by telephone to maintain contact with participants and to assess health status of the cohort. In the Community Surveillance Component, currently ongoing, these four communities are investigated to determine the community-wide occurrence of hospitalized myocardial infarction and coronary heart disease deaths in men and women aged 35-84 years. Hospitalized stroke is investigated in cohort participants only. Starting in 2006, the study conducts community surveillance of inpatient (ages 55 years and older) and outpatient heart failure (ages 65 years and older) for heart failure events beginning in 2005. ARIC is currently funded through January 31, 2012. This study is part of the Gene Environment Association Studies initiative (GENEVA, http://www.genevastudy.org) funded by the trans-NIH Genes, Environment, and Health Initiative (GEI). The overarching goal is to identify novel genetic factors that contribute to atherosclerosis and cardiovascular disease through large-scale genome-wide association studies of well-characterized cohorts of adults in four defined populations. Genotyping was performed at the Broad Institute of MIT and Harvard, a GENEVA genotyping center. Data cleaning and harmonization were done at the GEI-funded GENEVA Coordinating Center at the University of Washington.
Study
phs000090
Glioma International Case Control Study (GICC)
The main goals of the GICC Study were: 1) to identify novel genetic risk variants for glioma, as well as validate variants implicated by previous genome-wide association studies of glioma; and 2) to explore biologically relevant gene-gene and gene-environment interactions in glioma susceptibility. The GICC Study includes participants from the following centers: Brigham and Women's Hospital (Boston, Massachusetts), Case Western Reserve University (Cleveland, Ohio), Columbia University (New York, New York), the Danish Cancer Society Research Centre (Copenhagen, Denmark), Duke University (Durham, North Carolina), the University of Texas MD Anderson Cancer Center (Houston, Texas), Memorial Sloan Kettering Cancer Center (New York, New York), the Mayo Clinic (Rochester, Minnesota), NorthShore HealthSystem (Chicago, Illinois), Umea University (Umea, Sweden), the University of California, San Francisco (San Francisco, California), the University of Southern California (Los Angeles, California), and the Institute of Cancer Research (London, United Kingdom).
Study
phs001319
Genome-Wide Association Study of HCC in Non-Asian USA Population
This GWAS of HCC relied on existing biological and data resources from 7 USA sites and one Canadian site which allows for the integration of genetic and environmental data. Total of 2199 case patients and 1103 non cancer controls were genotyped. Case patients are defined as patients with pathological or radiological diagnosis of HCC with and without chronic hepatitis C virus infection. Study was restricted to Caucasian population without prior infection with chronic HBV infection. All DNA samples were extracted from peripheral blood. Study population of the participated sites are: Site-1: University of Texas MD Anderson Cancer Center A hospital based-case-control study was initiated at UT MD Anderson Cancer Center in 2000 and approved by the Institutional Review Board of The University of Texas M. D. Anderson Cancer Center. Study design was previously described in detailed (1-3). Written informed consent for an interview and for a biological sample was obtained from each participant. At time of diagnosis prior to treatment exposure. Case patients were recruited from the population of patients with newly diagnosed HCC who were evaluated and treated at the institution's gastrointestinal medical and surgical oncology outpatient clinics. Inclusion criteria were as follows: a pathologically confirmed diagnosis of HCC, U.S. residency, and the ability to communicate in English. Patients with concurrent or past history of other types of cancers were excluded. Control subjects were healthy spouses of patients with cancers other than liver, gastrointestinal, lung, or head and neck (smoking-related cancers) undergoing treatment at our institution. Eligibility criteria for control subjects were the same as those for patients, except that control subjects could not have ever had cancer. A short, structured questionnaire was used to screen potential control subjects on the basis of eligibility criteria. Control subjects and patients were recruited simultaneously. Case patients and control subjects were interviewed by well-trained interviewers who followed a written protocol to guide ascertainment and reduce surveillance, interviewer, and recall bias. No proxy interviews were conducted. The interviewers used a structured and validated questionnaire to collect information about demographic characteristics and potential risk factors for HCC such as personal smoking history, alcohol use, medical history, occupational history, and family history of cancer. Blood samples from cases and controls were tested for HBV and HCV. HCV antibodies, hepatitis B surface antigen, and antibodies to hepatitis B core antigen were detected by use of a third-generation enzyme-linked immunosorbent assay (ELISA) (Abbott Laboratories, North Chicago, IL). Important clinical information were retrieved from patients' medical records. Total of 1188 case patients and 278 controls from MD Anderson study were genotyped. Site-2: Mayo Clinic Case patients and controls included in the study from Mayo clinic were recruited as part of Mayo International Hepatobiliary Neoplasia Register and Biorepository. Only USA patients and controls were participated. All cases and controls signed informed consent indicating their willingness to participate in genetic studies. Epidemiological and clinical data were collected from participants and retrieved from medical records. Total of 522 HCC case patients and 182 controls were genotyped in this study. Site-3: Toronto University Molecular Epidemiology of Hepatobiliary Tumors (HBT study, CAPCR 09-0289) is a hospital-based study that includes 1710 patients diagnosed with liver, bile duct or gallbladder adenocarcinoma (Hepatocellular carcinoma - HCC; cholangiocarcinoma - CCA; and gall bladder adenocarcinoma -GBCa) recruited at University Health Network (UHN) between 2009 and 2018. HBT study is a prospective study of cancer patients that collects self-reported demographic and epidemiological data, medical records/clinical information and blood samples for research purposes. Eligible patients recruited between 2009 and March 15 2016 were selected to participate in the Genome-Wide Association Study in Patients with Hepatocellular Carcinoma (GWAS in HCC, CAPCR 15-9943-CE), a sub-study of the HBT study. Eligible prospective patients recruited after March 15 2016 signed the Addendum to the HBT consent form and became eligible to participate in the GWAS in HCC study. Total of 271 cases and 21 controls were genotyped from Toronto University Site-4: University of Pittsburgh The design of the study has been described previously (4-6). Briefly, this population-based study enrolled 120 HCC patients and 230 matched controls from black, Hispanic, and non-Hispanic white residents of Los Angeles County who were between 18 and 74 years of age at diagnosis from January 1995 through December 2001. HCC Cases were identified through the Los Angeles County Cancer Surveillance Program. We sought to recruit up to two control subjects per case from the neighborhoods where HCC patients resided at the time of diagnosis, who were matched to the index case by sex, age (within 5 years), and race (Hispanic white, non-Hispanic white, black). Blood samples (plasma and buffy coat), medical and lifestyle factors were collected from all consenting participants. Total of 65 cases and 60 controls were genotyped in this study Site-5: South Western Dallas Under IRB approval, HCC Cases and controls were prospectively collected since 2015 from the outpatient Liver clinics from Parkland Health and Hospital System and UT Southwestern Medical Center, two large health systems in Dallas TX. A Total of 31 cases and 29 controls were genotyped from site 5. Site-6: Columbia University HCC patients from Columbia are those recruited as part of the Herbert Irving Comprehensive Cancer Center Database Shared Resource which seeks to recruit all cancer patients for potential future studies. The PI is Dr. Katherine Crew AAAL5871. This resource collects sociodemographic, lifestyle and clinical data on patients as well as a blood sample. Those who indicated on their consent form that they would participate in genetic studies were included. Total of 79 case patients with HCC were genotyped from Columbia University. Site-7: University of Michigan The study population included from University of Michigan were included from a prospective study of with chronic HCV infection recruited in Ann Arbor Patients who had under- gone liver transplantation, known coinfection with HIV, life expectancy <12 months due to extra-hepatic illnesses, or were receiving HCV treatment at enrolment, were excluded. Protocol, surveys, and data forms were developed where each enrolled patient completed the same questionnaire All patients provided written informed consent before enrolment in the study. The study was approved by the institutional review board or ethics committee at the University of Michigan. Detailed study description was previously published (7). Total of 44 cases and 347 controls were genotyped Site-8: Veteran Administration Medical Center in Houston This research including written informed consent form was jointly approved by Institutional Review Boards for the Baylor College of Medicine and the Michael E. DeBakey VA Medical Center in Houston, Texas. Study details have been previously published (8). Briefly, we prospectively recruited consecutive HCV-infected veterans prior to their previously scheduled HCV clinic visit at a large tertiary care VA medical center between May 1, 2009 and December 31, 2012. Patients completed a research assistant (RA) administered survey interrogating medical and risk factor history including lifetime alcohol use, had anthropometric measurements taken, and completed a fasting venipuncture for performance of the FibroSure-ActiTest as a measure of hepatic pathology. We restricted our current analysis to individuals who were: (1) White male veterans between 18 and 70 years; (2) had no history of HCC, liver transplant, decompensated liver disease including ascites, dementia, or psychosis; (3) were serologically-confirmed to have chronic HCV and to be negative for both HIV and active HBV infection; (4) were not currently receiving anti-HCV pharmacotherapy; and (5) had FibroSURE testing shown F3/F4 fibrosis consistent with cirrhosis.
Study
phs001744
Breast Cancer - Very young women
This project will focus on sequencing of breast cancer from young age cancer patients characterized by age at diagnosis under 35. Based on the incidence rate by Korean National Cancer Incidence Database, breast cancer is the most common solid tumors among young adults aged 15-34 excepting thyroid cancer (Cancer statistics in Korea (2012)). This project just started from the mid of 2015. National cancer center in Korea are collecting more than 100 tumor specimens from breast cancer patients aged 20-34. Genetic features of young patients will be investigated with somatic mutation pattern, structural variations, and canonical pathways.
Study
EGAS00001001908
Sexual dimorphism in human immune system aging
We isolated peripheral blood mononuclear cells (PBMCs) from 172 community-dwelling healthy volunteers (91 women, 81 men) ages spanning adult lifespan: 54 young (ages 22-40: 23 men, 31 women), 59 middle-aged (ages 41-64: 31 men, 28 women), and 59 older subjects (65+: 27 men, 32 women). PBMCs were profiled using ATAC-seq (54 men, 66 women), RNA-seq (41 men, 34 women), and flow cytometry (62 men, 67 women). Flow cytometry data is publicly available, genomic data for 121 individuals are provided here. These datasets revealed immune system aging signatures in men and women and showed in which ways aging differentially affects the immune systems of men and women. A subset of these samples have been generated and analyzed previously and can be found at EGA (EGAS00001002605). This dataset on dbGaP include EGA samples as well.
Study
phs001934
CSER: North Carolina Clinical Genomic Evaluation by Next-Gen Exome Sequencing 2
The North Carolina Clinical Genomic Evaluation by Next-gen Exome Sequencing, 2 (NCGENES 2) study is part of a larger consortium project investigating the clinical utility, or net benefit of an intervention on patient and family well-being as well as diagnostic efficacy, management planning, and medical outcomes. A clinical trial will be implemented to compare (1) first-line exome sequencing to usual care and (2) participant pre-visit preparation to no pre-visit preparation. The study will use a randomized controlled design, with 2x2 factorial design, coupled with patient-reported outcomes and comprehensive clinical data collection addressing key outcomes, to determine the net impact of diagnostic results and secondary findings.
Study
phs002110
Whole Exome Sequencing of controls performed at the Broad Institute on a cohort from Bristol, UK
These data were generated as part of a collaboration between University of Bristol, UK and the Stanley Center at the Broad Institute. This project sequenced and analysed the whole exomes of 2969 samples from the Avon Longitudinal Study of Parents and Children (ALSPAC). ALSPAC is a population-based pregnancy cohort with participants acting as ‘controls’ for this project. Genomic DNA from each sample was sequenced to a mean depth of 20x. The exome used Twist capture and samples were sequenced on Illumina NovaSeq 6000 machines producing CRAM files. ALSPAC recruited pregnant women in the Avon County of south-west England between 1991 and 1992. Of the 15,447 pregnancies, 14,901 children were alive at 1 year of age. DNA was extracted from child blood samples taken between ages 7 and 24.
Dataset
EGAD50000000716
Genomic and Transcriptomic Profiling of Patients with Malignant Pleural and Peritoneal Mesothelioma: The NCI Cohort
Malignant mesothelioma is an aggressive cancer with limited treatment options and poor prognosis. Better understanding of mesothelioma genomics and transcriptomics could advance novel therapies. We performed whole-exome sequencing (WXS) of germline and tumors of 122 patients, and RNA-sequencing (RNA-Seq) of tumor samples of 100 patients with pleural, peritoneal, and tunica-vaginalis mesothelioma.The biospecimens used in this study were collected as a part of the Mesothelioma Natural History Protocol (https://clinicaltrials.gov/ct2/show/NCT01950572), wherein, samples were collected from subjects with malignant mesothelioma as listed in the eligibility criteria below and written consent from each participant was obtained before sample acquisition.
Study
phs002207
Is the Gut Important in Multiple Joint Osteoarthritis? A Multimodal Investigation in Humans and Pet Dogs
The Gut Health in Multiple Joint Osteoarthritis (MJOA) Study leverages data from parallel community-based cohorts in humans and in pet dogs to elucidate the role of altered microbiota in MJOA. One hundred Johnston County Health Study human participants were 35 to 70 years of age at enrollment (2022-2023), self-identified as Hispanic, White, or Black, and lived in Johnston County, North Carolina. Demographic, clinical information, multiple joint radiographs, and stool samples for microbiome profiling by 16S rRNA gene sequencing were obtained from all participants. Similar data were collected from an independent group of pet dogs (N=115) from the local community, at the North Carolina State University (NCSU) College of Veterinary Medicine. The central hypothesis of the study is that intestinal permeability, with or without dysbiosis, is a major driver in the development and worsening of MJOA.
Study
phs003980
Longitudinal_Cambridge_Study_COVID19
Investigating tissue immune responses to SARSCoV-2 in live and deceased donors
This project is part of a longitudinal clinical study that will be run from Cambridge University Medical School (Joseph Cheriyen) studying covid-19 infected patients. The Clatworthy project would use single cell approaches to study severe fatal versus non-fatal in old and old versus young patients and will also provide some information on male versus female. This work will provide single cell resolution insights into immune response to the virus, providing vital information on the biology of covid-19 infection.
This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
Study
EGAS00001004488
Hydroxyurea to Prevent Organ Damage in Children with Sickle Cell Anemia (BABY HUG) Phase III Clinical Trial and Follow-Up Observational Studies I and II
Sickle cell anemia is associated with substantial morbidity from acute complications and organ dysfunction beginning in the first year of life. In 1995, the Multicenter Study of Hydroxyurea (MSH) (dbGaP phs002348) demonstrated that, in adults, hydroxyurea is effective in decreasing the frequency of painful crises, hospitalizations for crises, acute chest syndrome, and blood transfusions by 50%. The phase I/II study of hydroxyurea in children (HUG KIDS) demonstrated that children have a response to hydroxyurea similar to that seen in adults in terms of increasing fetal hemoglobin levels and total hemoglobin, and decreasing complications associated with sickle cell anemia. In addition, this study demonstrated that the drug does not adversely affect growth and development between the ages of 5 and 15. A pilot study of hydroxyurea (HUSOFT) given to children between the ages of 6 months and 24 months demonstrated that the drug was well tolerated and that the fetal hemoglobin levels rose and remained elevated compared to baseline with continued hydroxyurea administration.
A Special Emphasis Panel (SEP) met on April 12, 1996 to review the results of the MSH trial and the progress to date of the HUG KIDS study. The SEP recommended that NHLBI undertake the BABY HUG trial.
The BABY HUG Randomized Controlled Trial concluded that hydroxyurea treatment in very young children seemed to have an acceptable safety profile and to reduce complications of sickle cell anemia. However, more data were needed on the long-term safety of hydroxyurea use in very young children. As a result, follow-up studies were initiated. The Follow-Up Study II provided longer follow-up than Follow-Up Study I, and included more assessment types than Follow-Up Study I.The BABY HUG program consisted of three related studies, each of which has associated datasets and bio-specimens.
A randomized controlled trial comparing hydroxyurea to placebo in very young children with sickle cell anemia (BABY HUG Randomized Controlled Trial)
The first observational follow-up study of children from the randomized controlled trial (BABY HUG Follow-Up Study I). All children in Follow-Up Study I were offered the option of taking open-label hydroxyurea, with treatment decisions made by the family and the clinical team caring for the child.
The second observational follow-up study of children from BABY HUG Follow-Up Study I. All children in Follow-Up Study II were offered the option of taking open-label hydroxyurea, with treatment decisions made by the family and the clinical team caring for the child.The purpose of the Randomized Controlled Trial was to determine if hydroxyurea can safely prevent early end organ damage in very young children with sickle cell anemia.
The purpose of the BABY HUG Follow-up Study I was to provide structured follow-up of the children enrolled in the BABY HUG Randomized Controlled Trial, in order to characterize the long-term toxicities and unexpected risks (if any) associated with treatment with hydroxyurea at an early age.
The objective of Follow-Up Study II was to obtain additional data about the long-term safety and efficacy of hydroxyurea use in children with Sickle Cell Anemia through at least the first decade of life.Instructions for requesting individual-level data are available on BioData Catalyst at https://biodatacatalyst.nhlbi.nih.gov/resources/data/. Apply for data access in dbGaP. Upon approval, users may begin accessing requested data in BioData Catalyst. For questions about availability, you may contact the BioData Catalyst team at https://biodatacatalyst.nhlbi.nih.gov/contact.
Study
phs002415
eMERGE Network's Multi-Center Pilot of Pharmacogenetic Sequencing in Clinical Practice
eMERGE-PGx is a multi-site test of the concept that sequence information can be coupled to electronic medical records (EMRs) for use in healthcare. The promise of personalized medicine - health care guided by each individual's biological characteristics - is being fostered by increasingly powerful and economical methods to acquire clinically relevant biomarkers from large numbers of people. One therapeutic area that seems especially ripe for an early test of the personalized medicine concept is pharmacogenomics (PGx) - the idea that individual variation in drug response includes a genomic component. Drug response variation is an accepted feature of virtually all drug treatments, and contemporary molecular biologic tools continue to identify key genes mediating drug metabolism, transport, and targets. Importantly, common variation in these genes is an increasingly well-recognized contributor, sometimes with large effects, to variation in drug responses. As a result, recommendations for genotype-guided therapy are increasing. These evidence-based recommendations, if implemented in health care practice, could reduce adverse drug events and improve time to therapeutic response. Through eMERGE-PGx, we are developing strategies for the optimal implementation of genetic sequence data into the clinical environment with the ultimate goal of improving patient care. Site and participants include: Children's Hospital of Pennsylvania (CHOP): The Center for Applied Genomics (CAG) at the Children's Hospital of Philadelphia (CHOP) is a high-throughput, highly automated genotyping and sequencing facility equipped with state-of-the-art genotyping and sequencing platforms. Children who are treated at the Children's Hospital Healthcare Network and their parents may be eligible to take part in a major initiative to collect more than 100,000 blood samples, covering a wide range of pediatric diseases. The PGx population selected for sequencing with the PGRNseq panel at CHOP is 1,650 children from CAG's biorepository with well-documented drug-related severe adverse events (SAEs) or EHR-based drug response profiles. SAEs were extracted from EPIC records and from CHOP's Adverse Event (AE) database, which documents every AE at CHOP. These AEs are classified by a medical review panel according to the causal relationship with the suspected drug into 'doubtful', 'possible', and 'probable'. Individuals with events classified as probable, severe and objective, were selected for sequencing. The drugs more frequently associated with adverse events are antibiotics, antineoplastics, immunosuppressants and psychotropic drugs. This cohort constitutes 50% of the target population. The remaining subjects were selected using EHR-based algorithms that we have developed and validated at CAG for identifying patients not responding to ADHD medication (primarily atomoxetine) and patients refractory to antiepileptic treatment from responders. Cincinnati Children's Hospital Medical Center/Boston's Children's Hospital (CCHMC/BCH): 811 CCHMC samples were obtained from children, adolescents or young adults exposed to medication or at risk for needing medication of study interest. 55% of participants were exposed to one or more opioids and their DNA source was a CCHMC study-specific biobank; while 27% of participants were at risk for needing an opioid for surgical pain management and were newly recruited. The remainder of the cohort was exposed to methylphenidate and their DNA samples were obtained from a CCHMC study-specific biobank. The focus of Boston Children's Hospital eMERGE PGx project is on individuals with epilepsy. Samples were taken from a current pharmacogenomics study already in place through which DMET analysis was run and used as confirmation for PGRN-Seq results. A total of 109 samples were sent for PGRN-Seq analysis at University of Washington. The remaining 141 epilepsy samples were from Children's Hospital of Philadelphia and underwent testing with PGRN-Seq at CHOP. Geisinger Health System: A research cohort of adult Geisinger Clinic patients was enrolled from community-based primary care clinics of the Geisinger Health System. Patients were eligible for enrollment if they were a primary care patient of a Geisinger Clinic physician and were scheduled for a non-emergent clinic visit. All data are from Geisinger patients who consent to participate in the MyCode project. MyCode participants agree to provide biological samples for broad research use, including genomic analysis, and for linking of sample data to information in the participant's Geisinger health record. The consent also permits sharing of de-identified data for research purposes. Group Health(GH)/University of Washington (UW): Potential GH participants for the PGx project were enrolled in the eMERGE Network through the Northwest Institute of Genetic Medicine (NWIGM) biorepository, and provided the appropriate consent to receive clinically relevant genetic results (N~6300). Participants were eligible if aged 50 - 65 years old at the time of their enrollment into the NWIGM repository, living, enrolled in GH's integrated group practice, and had completed an online Health Risk Appraisal. The selection algorithm was based on several data sources from the EHR at Group Health: 1. Demographics - participants with self-reported race as Asian or African ancestry were prioritized and selected to enrich for non-European ancestry; 2. Diagnosis and procedure codes - participants were selected if found to have a history of hypertension, atrial fibrillation (AF,) or congestive heart failure (CHF). Participants with a history of arrhythmia were added if the entire selection algorithm did not generate 900 individuals. We also enriched for participants with EHR evidence of actionable indications related to PGRNSeq genes. Participants were selected if found to have an ICD9 code for malignant hyperthermia, hypertension, atrial fibrillation, congestive heart failure or long QT syndrome (LQTS); 3. Laboratory values - if a participant had any laboratory event of creatine kinase (CK) > 1000, and were dispensed statins within 6 months of the event, then they were selected; and 4. Medications - participants were excluded if ever on carbamazepine or had a current regimen of warfarin. Essentia Institute of Rural Health, Marshfield Clinic, Pennsylvania State University (Marshfield): For this study, 750 subjects were selected and enrolled into PGx based on Vanderbilt's algorithm designed to enrich for patients who are most likely to receive one of three common drugs (Clopidogrel, Warfarin or Simvastatin) in the next 2-3 years. These patients were sent a letter of invitation and description of the PGx project. Follow-up phone calls were made, and interested subjects came in for a one time meeting to discuss the project and go through the informed consent with the research coordinator. If they were interested they signed the consent and HIPAA forms and gave blood. Subjects were chosen and enrolled into PGx independently of previous biobank participation. Mayo Clinic: The Right Drug, Right Dose, Right Time - Using Genomic Data to Individualize Treatment (The RIGHT Protocol) enrolled 1013 patients to test the hypothesis that prescribers could deliver genome-guided drug therapy at the point-of-care by using pharmacogenomic data preemptively integrated in the electronic medical record. Complete details regarding the study population have been previously described (Bielinski et al., 2014). Icahn School of Medicine at Mount Sinai School (Mt Sinai): Our study site is the Primary Care Associates (PCA) practice group of the Mount Sinai Faculty Practice Associates (FPA) of the Mount Sinai Medical Center in New York City. This practice has 12 physician providers. All patient encounters are documented and managed with EpicCare ambulatory electronic medical record. Active PCA Patients eligible for enrollment fulfilled the following criteria: a) age 50 or older receiving clinical care at Mount Sinai FPA PCA practice with at least one practice encounter within 18 months prior to commencement of enrollment; b) no history or current use of clopidogrel, warfarin, or simvastatin. Eligible patients were invited to participate through de novo recruitment by letter sent by their provider. Interested patients were screened for eligibility and enrolled to participate in the eMERGE PGX study on site by a dedicated research coordinator. In addition to de novo enrollment from clinical practice, patients of FPA PCA who had previously enrolled in Mount Sinai's BioMe Biobank program AND fulfilled eligibility criteria as stated under a) and b) were identified by chart review and samples sequenced at CIDR using PGRNseq platform (N=300). PGRNseq data from 291 samples passed stringent quality control and are included in the current data set. Furthermore, 56 of these patients carrying known and validated 'actionable' variants affecting prescribing of clopidogrel, warfarin, and/or simvastatin were enrolled in the eMERGE PGX study following invitation through recontacting by the Principal Investigator of the BioMe Program. Northwestern University: Participants for this study were recruited from the General Internal Medicine (GIM) clinic at Northwestern Medical Group (NMG). Patients were selected for invitation to participate if they had been seen a minimum of two times over the last four years, having a high likelihood to receive a prescription for warfarin, Plavix, or a statin, and are seeing a physician who has agreed to allow their patients to be contacted for the study. We utilized an algorithm developed at Vanderbilt and tailored to our population which uses our EHR to estimate the probability that individuals will receive a prescription for warfarin, Plavix, or a statin in the next three years. Participants were sent a letter explaining the study prior to their GIM appointment and offered participation at the time of their visit. Participants were consented on-site and blood drawn after consent was obtained. The GIM clinic consists of 39 primary care physicians who provide approximately 80,000 patient encounters per year. As with any large primary care clinic, a significant proportion of patients in GIM clinic suffer from a variety of chronic health conditions, such as diabetes, hypertension, and coronary artery disease. Over 50,000 individuals have been seen by GIM doctors in the past 5 years; 11,562 of these patients have evidence of a statin prescription in the EHR, 3,436 have evidence of a warfarin prescription, and 1,872 have evidence of a Plavix prescription. Vanderbilt University: The more than 1000 participants enrolled into Vanderbilt's eMERGE PGx study were newly recruited from the Cardiology and Internal Medicine Clinics and the Hillsboro Medical Group within Vanderbilt University Medical Center (VUMC). Patients were selected based on a predictive algorithm estimating the patient's likelihood of receiving Clopidogrel, Warfarin, and/or Simvastatin. The algorithm identifies primarily older middle-aged patients, and the mean age of the study group is 74. The cohort is approximately 45% female with 75% of subjects self-identified as EA and 24% as AA. Subjects were consented in person by study personnel following a routine clinic visit and an introduction to the study staff by their doctor. VUMC is a comprehensive health care facility dedicated to patient care, research, and the education of health care professionals. Translational research into the causes and treatment of disease as well as studying fundamental biological properties is the primary focus of discovery at Vanderbilt. Clinical research is conducted in Vanderbilt University Hospital, the Nashville Veterans Administration Hospital, Meharry General Hospital and in their associated outpatient clinics. These hospitals and clinics, all associated with the Vanderbilt system, each have full time Vanderbilt faculty and medical housestaff and provide clinical care and participate in research programs. The Vanderbilt Clinic is comprised of more than 95 adult outpatient specialty practices and received over 1.5 million ambulatory visits in 2012-13. The Vanderbilt Heart and Vascular Institute offers a comprehensive heart program offering diagnosis, medical treatment, minimally invasive therapies, surgical intervention and disease management, tailored to each individual's unique needs. All programs within the Vanderbilt Clinic have survival figures that surpass the national average.
Study
phs000906
CSER: South-Seq: DNA Sequencing for Newborn Nurseries in the South
SouthSeq, part of the Clinical Sequencing Evidence-Generating Research (CSER2) effort, aims to perform genome sequencing for infants in nurseries at hospitals in Alabama, Mississippi, Louisiana, and Kentucky. SouthSeq aims to sequence 600 newborn babies presenting with congenital anomalies, or other signs of a rare genetic disorder. The primary goals of SouthSeq are to provide early diagnoses to children with rare genetic conditions, to demonstrate the utility of genome sequencing as a frontline test with potential to improve intermediate and long-term clinical outcomes, and to equip non-genetics providers to deliver test results in order to facilitate the conduction of genome sequencing outside of major medical centers where patients may not have access to a genetics provider. SouthSeq also aims to ensure that genomic testing is applied across diverse communities; recruitment efforts focus on patients from rural and minority populations that have been historically under-represented in both medicine and genomics research. Sequencing and analysis for SouthSeq are being conducted at the HudsonAlpha Institute for Biotechnology in Huntsville, Alabama (https://www.hudsonalpha.org/). Enrollment of infants with signs of genetic disease is approved at five clinical sites, including nurseries at the University of Alabama at Birmingham, the University of Mississippi Medical Center, Woman's Hospital in Baton Rouge, Children's Hospital in New Orleans, and the University of Louisville. Both biological parents are enrolled when available so that follow-up testing can be performed to determine inheritance. Further, participant families may choose to opt-into return of secondary findings identified in the proband, which focus on pathogenic and likely pathogenic variation identified within ACMG SFv3.0 genes.
Study
phs002307
eMERGE: Northwestern (NUgene) WGS
The NUgene Project is a biorepository with longitudinal medical information from participating patients at Northwestern Medicine affiliated hospitals and outpatient clinics. Participants' DNA samples are coupled with data from a self-reported questionnaire (2 versions were used, 1 before and 1 after February 2006, both are included) and continuously updated data from our Electronic Medical Record (EMR) representing actual clinical care events. Northwestern has a state-of-the art, comprehensive inpatient and outpatient EMR system of over 3 million patients. NUgene has broad access to participant data for all outpatient visits as well as inpatient data via a consolidated enterprise data warehouse. NUgene participants consent to distribution and use of their coded DNA samples and data for a broad range of genetic research by third-party investigators. The electronic MedicalRecords and Genomics (eMERGE) Network is a consortium of 9 clinical sites with EMR linked DNA biobanks, including Northwestern University and its NUgene biobank, funded by the NHGRI (National Human Genome Research Institute) to investigate the use of electronic medical record systems for genomic research. The goal of eMERGE is to conduct genome-wide association studies in approximately 100,000 individuals using EMR-derived phenotypes and DNA from linked biorepositories. Using electronic phenotyping methods, the consortium has been and is using DNA samples from all participating sites to explore the genetic determinants of approximately 80 phenotypes, including both diseases and traits, for which the electronic phenotyping algorithms have or are being published on PheKB.org. Thus, for the eMERGE network, ~900 NUgene subjects were selected, as described in the study inclusion/exclusion criteria section below, to be genotyped using whole genome sequencing (WGS), for use in the eMERGE network as eMERGE subjects. The criteria used below were selected so that these subjects will likely meet the criteria to be cases and/or controls, of diverse races and/or ethnicities, for at least 1 of the ~80 phenotypes being studied within eMERGE, and could be recontacted for future research if needed.
Study
phs001191
Longitudinal Study of the Porphyrias
This is an observational, longitudinal study of approximately 800 individuals with various types of porphyria. Those participants will be evaluated annually for 5 years, or longer if they agree. Participation in this project will include: Participating in annual visits or contacts. Providing samples, including blood, buccal cells (cells from inside of the mouth), saliva, and urine. The type of samples and amounts may vary from person to person, depending on the type of porphyria. Giving permission for samples to be stored and used for porphyrias research. Granting permission to obtain your medical records. Providing information about your medical history and family history. Completing a questionnaire about your porphyria symptoms and quality of life.
Study
phs001278
Risk Assessment of Cerebrovascular Events (RACE) Study
This study includes 1,220 cases with young onset stroke (stroke before age 60 years) who are participants of the larger RACE study. Risk Assessment of Cerebrovascular Events (RACE) is an on-going existing case-control study of stroke now involving over 5000 imaging confirmed cases of stroke and 5000 controls, recruited from seven centers in Pakistan. The study is aimed to investigate the genetic, biomarker and lifestyle determinants of stroke and its subtypes. Cases are eligible for inclusion in the study if they: (i) are aged at least 18 years; (ii) present with a sudden onset of neurological deficit respecting a vascular territory with sustained deficit at 24 hours verified by medical attention within 72 hours after onset (onset is defined by when the patient was last seen normal and not when found with deficit); and (iii) the diagnosis is supported by CT/MRI; and (iv) present with a Modified Rankin Score < 2 prior to the stroke. Findings from patient's history, 12-lead ECG and CT or MRI of the brain. The mandatory procedures for inclusion in this investigation are: (i) clinical verification of cerebrovascular event within 72 hours of onset; (ii) neuroimaging CT (non-contrast) or MRI (MRI is not a mandatory investigation but recorded whenever ordered by the attending physician); and (iii) 12-lead ECG. All other ancillary investigations ordered by the attending physician are recorded as well. The TOAST classification method is used to classify ischemic stroke based on aetiology whereas the Oxfordshire classification is used to classify stroke neuro-anatomically. Control participants for this subset of young onset stroke were individuals enrolled in the Pakistan Risk of Myocardial Infarction Study (PROMIS), a case-control study of acute MI based in Pakistan. RACE capitalizes on the genetic data (including information on GWAS) that has already been collected from the healthy participants enrolled in PROMIS. RACE and PROMIS share similar methodology of recruitment. Participants from both these investigations are derived from similar catchment areas, hence providing an attractive opportunity for RACE to utilize PROMIS controls as common controls for genetic investigations. Controls in PROMIS were recruited following procedures and inclusion criteria as adopted for RACE cases. In order to minimize any potential selection biases, PROMIS controls selected for this stroke substudy were frequency matched to RACE cases based on age and gender and were recruited in the following order of priority: (1) non-blood related or blood related visitors of patients of the out-patient department; (2) non-blood related visitors of stroke patients; (3) patients of the out-patient department presenting with minor complaints (e.g. back pain, minor gastric complaints). Control subjects from the PROMIS study were genotyped at the Wellcome Trust Sanger Institute on the Illumina 660W Quad array. The Center for Non-Communicable Diseases, Pakistan, serves as the coordinating center for both RACE and PROMIS. More information on these research investigations can be found at www.cncdpk.com. This young onset stroke component to the RACE study was funded through the Gene Environment Association Studies initiative (GENEVA, www.genevastudy.org as one of three studies designed to assess the genetics of young onset stroke and modification of genetic effects by smoking. GENEVA is part of the trans-NIH Genes, Environment, and Health Initiative (GEI). Genotyping of 1,220 young onset stroke cases was performed at the Johns Hopkins University Center for Inherited Disease Research (CIDR). Data cleaning and harmonization were done at the GEI-funded GENEVA Coordinating Center at the University of Washington. This study is part of the Gene Environment Association Studies initiative (GENEVA, http://www.genevastudy.org) funded by the trans-NIH Genes, Environment, and Health Initiative (GEI). The overarching goal is to identify novel genetic factors that contribute to stroke through large-scale genome-wide association studies of cases and controls recruited within Pakistan. Genotyping was performed at the Johns Hopkins University Center for Inherited Disease Research (CIDR). Data cleaning and harmonization were done at the GEI-funded GENEVA Coordinating Center at the University of Washington.
Study
phs000456
Data of the Institute of Pathology, University Medical Center Freiburg, Germany
DAC for proteomics data deposited in EGA by the proteomics laboratory of the Institute of Pathology, University Medical Center Freiburg, Germany
Dac
EGAC50000000048
Single Cell Analysis of Human Airways in Healthy and Asthma volunteers
Bronchoscopies were collected from healthy and asthma volunteers. Cohort inclusion criteria for all subjects were: age between 40 – 65 years and history of smoking < 10 pack years. For the asthmatics, inclusion criteria were: age of onset of asthmatic symptoms ≤12 years, documented history of asthma, use of inhaled corticosteroids with(out) β2-agonists due to respiratory symptoms and a positive provocation test (i.e. PC20 methacholine ≤8mg/ml with 2-minute protocol). For the non-asthmatic controls, the following criteria were essential for inclusion: absent history of asthma, no use of asthma-related medication, a negative provocation test (i.e. PC20 methacholine >8 mg/ml and adenosine 5'-monophosphate >320 mg/ml with 2-minute protocol), no pulmonary obstruction (i.e. FEV1/FVC ≥70%) and absence of lung function impairment (i.e. FEV1 ≥80% predicted). Asthmatics stopped inhaled corticosteroid use 6 weeks prior to all tests. All subjects were clinically characterised with pulmonary function and provocation tests, blood samples were drawn, and finally subjects underwent a bronchoscopy under sedation. If a subject developed upper respiratory symptoms, bronchoscopy was postponed for ≥6 weeks.
Fibreoptic bronchoscopy was performed using a standardised protocol during conscious sedation. Six macroscopically adequate endobronchial biopsies were collected for this study, located between the 3rd and 6th generation of the right lower and middle lobe. Extracted biopsies were processed directly thereafter, with a maximum of one hour delay.
The medical ethics committee of the Groningen University Medical Center Groningen approved the study, and all subjects gave their written informed consent.
Dataset
EGAD00001005064
Multiomic profiling of pleomorphic rhabdomyosarcoma
Rhabdomyosarcoma (RMS) describes rare soft-tissue tumors that exhibit features of skeletal muscle differentiation. The most common subtypes in children are alveolar and embryonal rhabdomyosarcoma, with the alveolar subtype characterized by PAX3/7 fusions. A lesser known and rarer subtype, pleomorphic rhabdomyosarcoma (PRMS), occurs most frequently in adults vetween the ages of 40 and 50. This pleomorphic subtype is often misdiagnosed and little is known about its molecular characterization. Here, we conducted comprehensive genomic, transcriptomic, and methylation profiling of these tumors.
Study
EGAS00001007230
Hematopoietic Cell Transplant for Sickle Cell Disease (HCT for SCD)
The Center for International Blood and Marrow Transplant Research (CIBMTR) is a hematopoietic cell transplant registry that was established in 1972 at the Medical College of Wisconsin. The overarching goal of the registry is to study trends in transplantations and to advance the understanding and application of allogeneic hematopoietic cell transplantation for malignant and non-malignant diseases. Included in this dataset are children, adolescents and young adults with severe sickle cell disease who received an allogeneic hematopoietic cell transplant in the United States and provided written informed consent for research.Hematopoietic cell transplant for sickle cell disease is curative. Offering this treatment for patients with severe disease is challenging as only about 20-25% of patients expected to benefit have an HLA-matched sibling. Consequently, several transplantations have utilized an HLA-matched or mismatched unrelated adult donor and HLA-mismatched relative. Transplantation strategies have also evolved over time that has included transplant conditioning regimens of varying intensity, grafts other than bone marrow and novel approaches to overcome the donor-recipient histocompatibility barrier and limit graft-versus-host disease. The data that is available for sickle cell disease transplants have been utilized to report on outcomes after transplantation and compare outcomes after transplantation of grafts HLA-matched related, HLA-mismatched related, HLA-matched and HLA-mismatched unrelated donors. Collectively, these data have advanced our knowledge and understanding of hematopoietic cell transplant for this disease. These data can also serve as "contemporaneous controls" for comparison with other more recent curative treatments like gene therapy and gene editing.Data available for request include allogeneic hematopoietic cell transplants for sickle cell disease (Hb SS and Hb Sβ thalassemia) in the United States from 1991 to 2019. Follow-up data through December 2020 are available.Instructions for requesting individual-level data are available on BioData Catalyst at https://biodatacatalyst.nhlbi.nih.gov/resources/data/. Apply for data access in dbGaP. Upon approval, users may begin accessing requested data in BioData Catalyst. For questions about availability, you may contact the BioData Catalyst team at https://biodatacatalyst.nhlbi.nih.gov/contact.
Study
phs002385
Spit for Science
Spit for Science™ is a university-wide research opportunity with the scientific goal of understanding how genetic and environmental factors come together to influence substance use and emotional health across the college years and beyond. To address this goal, we have comprehensively and longitudinally studied eligible incoming freshmen (18 years or older) from a diverse urban university, assessing a wide range of risk and protective factors, including both biological susceptibility and environmental risk, and a variety of outcome measures, particularly alcohol use and other substances, and difficulties with emotional health. Our goal is to assess how risk and protective factors dynamically interact to contribute to behavior health outcomes over time and plan to use findings from the project to inform prevention and intervention efforts and aid in university policy and programming in ways that can support and promote student success. Methods: In the fall of their freshman year, first-time college students over the age of 18 are invited to complete an online survey containing broad questions about personality and behavior, as well as family, friends, and experiences growing up. Students can also provide a saliva sample and participate in the DNA component of the project (participation in the DNA component is not a requirement for participation in the survey). Each subsequent spring students are asked to complete a follow-up survey, enabling researchers to study patterns of substance use and emotional health across the college years.
Study
phs001754
Population Architecture using Genomics and Epidemiology (PAGE): Causal Variants Across the Life Course (CALiCo): Coronary Artery Risk Development in Young Adults (CARDIA)
CALiCo CARDIA The Coronary Artery Risk Development in Young Adults (CARDIA) Study is a study examining how heart disease develops in adults. It began in 1986 with a group of 5115 black and white men and women aged 18-30 years. The participants were selected so that there would be approximately the same number of people in subgroups of race, gender, education (high school or less and more than high school) and age (18-24 and 25-30) in Birmingham, AL; Chicago, IL; Minneapolis, MN; and Oakland, CA. These same participants were asked to participate in follow-up examinations during 1987-1988 (Year 2), 1990-1991 (Year 5), 1992-1993 (Year 7), 1995-1996 (Year 10), 2000-2001 (Year 15), and 2005-2006 (Year 20). A majority of the group has been examined at each of the follow-up examinations (90%, 86%, 81%, 79%, 74%, and 72%, respectively). While the specifics of each examination has differed somewhat, data have been collected on a variety of factors believed to be related to heart disease. These include conditions with clear links to heart disease such as blood pressure, cholesterol and other lipids. Data have also been collected on physical measurements such as weight and skinfold fat as well as lifestyle factors such as substance use (tobacco and alcohol), dietary and exercise patterns, behavioral and psychological variables, medical and family history, and other chemistries (e.g., insulin and glucose). In addition, subclinical atherosclerosis was measured via echocardiography during Years 5 and 10, computed tomography during Years 15 and 20, and carotid ultrasound during Year 20. A detailed description of the study and results from the first examination are summarized in Cutter et al (Controlled Clinical Trials, Volume 12, Number 1 [supplement], pages 1S-77S, 1991).
Study
phs000236
The PUWMa (
The PUWMa GWAS of ADHD is a multi-site collaboration initiated to conduct a family-based association study from existing research samples. The source sample of ADHD families was ascertained at Massachusetts General Hospital (MGH, N=309 trios), Washington University at St. Louis (WASH-U, N=272 trios), and University of California at Los Angeles (UCLA, N=156 trios). All offspring met criteria for DSM-IV-TR attention-deficit hyperactivity disorder with childhood onset.
Study
phs000358
Genetic sequencing of MODY patients.
Maturity-onset Diabetes of the young (MODY) is an early-onset, autosomal dominant form of non-insulin dependent diabetes. Genetic diagnosis of MODY can transform patient management. Earlier data on the genetic predisposition to MODY have come primarily from familial studies in populations of European origin. Using next generation sequencing, we carried out a comprehensive genomic analysis of 289 individuals from India that included 152 clinically diagnosed MODY cases to identify variants in known MODY genes. Our findings report that HNF1A and ABCC8 are among the most frequently mutated MODY genes in south India.
Study
EGAS00001001699
Gynecology and Lubricant Effects (GALE) Study
The GALE study is a longitudinal study of the association between lubricant use and the vaginal microbiota in women receiving transvaginal ultrasound (TVUS) at the Diagnostic Radiology and Nuclear Medicine Department at the University of Maryland School of Medicine. Participants undergo a clinical exam at enrollment (at least 5 days prior to TVUS) and self-collect daily, mid-vaginal samples in the week leading up to TVUS, including immediately before the procedure. At the TVUS appointment, the sonographer applies a single-use packet of Glide lubricating jelly to the probe. The product was selected for study as it was currently being used in the TVUS clinic at the University of Maryland Medical Center and is a clinical lubricant which is very similar in composition to many sex lubricants. Participants collect samples the evening after TVUS and present for a follow-up clinical exam within a week of TVUS. Participants continue to sample twice weekly for an additional 8 weeks (returning completed kits each week), complete daily health and behavior diaries, and attend one final clinical exam at week 10. The clinician evaluates participants for bacterial vaginosis (BV) at each clinical visit using Amsel's criteria, and also looks for signs of vulvovaginal candidiasis (VVC) and trichomonas using a wet mount.
Study
phs002211
Studies of a Targeted Risk Reduction Intervention through Defined Exercise (STRRIDE)
The STRRIDE study is the combination of three interventional trials to evaluate dose and intensity of physical activity on cardiometabolic health. The three component studies are Peripheral Effects of Exercise on Cardiovascular Health (STRRIDE I, ClinicalTrials.gov Identifier: NCT00200993), Effects of Resistence and Aerobic Exercise on Cardiovascular Health (STRRIDE II, ClinicalTrials.gov Identifier: NCT00275145) and Exercise Dose-Response Effects in Prediabetes (STRRIDE-PD, ClinicalTrials.gov Identifier: NCT00962962).
Study
phs001855
NIH Human Microbiome Project - Core Microbiome Sampling Protocol A (HMP-A)
This first clinical study of the Human Microbiome Project (HMP) addresses whether individuals share a core human microbiome. It involves broad determination of the microbiota found in five anatomical sites: the oral cavity, skin, nasal cavity, gastrointestinal tract and vagina. This study will enroll approximately 300 healthy male and female adults, 18-40 years old, from two geographic regions of the US: Houston, TX and St. Louis, MO. The participation of healthy individuals will create a baseline for discovery of the core microbiota typically found in various areas of the human body. The information from this initial study can then be used to help assess the changes in the complement of microbiota found on or within diseased individuals.
Study
phs000228
TMM whole genome analysis of 4566 Japanese individuals
Tohoku University Tohoku Medical Megabank Organization (ToMMo) and Iwate Tohoku Medical Megabank Organization (IMM) were founded to establish an advanced medical system to foster the reconstruction from the Great East Japan Earthquake. These organizations are developing a biobank that includes medical and genome information for supporting health and welfare in the Tohoku area. In the first stage, the part of our mission was to sequence the 4,000 individuals to construct Japanese whole-genome reference panel.
Study
JGAS000239
Whole genome sequencing of primary and metastatic Melanoma cases in an Australian cohort.
Melanoma is the fourth most common cancer in Australia and the leading
cause of cancer death in young adults. The Australian Melanoma Genome
Project (AMGP) is analysing whole genomes from melanomas. We include the results of whole
genome sequencing (WGS) for a number of datasets that include cutaneous, acral and mucosal melanoma subtypes.
Study
EGAS00001001552
DAC for DRS peripheral blood, University Medical Center Johannes Gutenberg University Mainz
This DAC is in relation to the DRS samples of peripheral blood of healthy control
Dac
EGAC50000000667
Medical University of Vienna - Nephrogene
Dac
EGAC00001002117
Genomic Landscape and Clonal Architecture in Pediatric Myeloid Neoplasms with Chromosome 7 Deletions
Full and partial loss of chromosome 7 (del7, 7q-) are prevalent cytogenetic abnormalities in pediatric myeloid neoplasms resulting in haploinsufficiency of key hematopoietic regulators on chromosome 7. While the genomic landscape of myeloid neoplasms in adults with del7 and 7q- has been described, pediatric patients with similar chromosome 7 losses have not been compiled into a single, comprehensive study utilizing modern sequencing strategies. To this end, we assembled a cohort of 108 pediatric and young adult patients representing a mixture of diagnoses. Using WGS, WES, and RNAseq, we compared the co-occurring mutations and gene expression levels in our cohort to those found in pediatric and young adult patients with two intact copies of chromosome 7. We also performed single-cell multi-omics on a subset of eight samples.
Study
EGAS00001008055
GWAS genotype data of Japanese
GWAS genotype data of the Japanese population (N=2,380) were collected at Osaka University and the affiliated medical institutes. GWAS genotyping was performed using Infinium Asian Screening Array (Illumina, USA).
Study
EGAS00001006423
Medical University of Vienna - AdameykoLab & Nephrogene
Dac
EGAC00001003383
Department of Clinical Genetics of Maastricht University Medical Center
Dac
EGAC00001002259
Khoe-San genome Project (KSGP)
The Khoe-San Genome Project (KSGP) includes whole genome data generated for 190 study participants from Southern Africa (Namibia and South Africa) representing people speaking a Namibian Khoe-San language (n=150, 12 groups) or regional non-Khoe-San groups (n=40), specifically defined here as Namibian Himba (n=10), South African Southern Bantu (n=12 and representing the Ubuntu Genome Project led by the University of Limpopo) or South African Coloured/European (n=18). Deep whole genome data has been generated for the purposes of cataloguing the extent of regional genetic diversity, creating population specific references and evolutionary trajectories, while creating a population-matched panel of normal (PoN) genetic diversity required for disease associated studies across the region.
Dac
EGAC50000000798
Identification of the dismal subtype of B-ALL with dysregulation of CDX2 and UBTF
PURPOSE: Advance in the knowledge of genomic basis of B-cell acute lymphoblastic leukemia (B-ALL) has changed the treatment approaches and diagnostic assays and improved their outcome. Despite the recent development of next generation sequencing approaches, there are still some cases that their drivers are unknown. We aimed to define and describe the characteristics of an additional B-ALL subtype by integrating genomic, transcriptomic, and epigenomic approach.
METHODS: More than 2,000 B-ALL cases of RNA-seq data from from Eastern Cooperative Oncology Group (ECOG) and the American College of Radiology Imaging Network (ACRIN) (n=764), the Children's Oncology Group (COG), Munich Leukemia Laboratory (MLL) (n=280), and other several collaborators were analyzed to identify the new subtype. Whole genome sequence was performed to detect mutations, stractural variants, and copy number alterations. The tridimentional analysis of chromatin was performed with HiChIP.
RESULTS: We identified the new high-risk B-ALL subtype “CDX2/UBTF” that exhibited unique gene expression profiles, which is enriched in female (77.3%), adolescents and young adults (52.6%), and relapse cohort (3.3%). CD10 negativity and IgM positivity are hallmarks of this subtype with NTRK3 expression that can be an effective marker at diagnostic approach and a potential targeted therapy with TRK and mulkinase inbitors, larotrectinib and entrectinib. Genetically, two different alterations define CDX2/UBTF. One is UBTF-ATXN7L3 fusions caused by deletion of 17q21.31, and the other is extopic CDX2 expression through the enhancer hijacking mechanism induced by deletion of 13q12.2. Other genomic features include gain of 1q and PAX5 rearrangement (PAX5-ZCCHC7), might induce upregulation of histone cluster genes and PAX5.
CONCLUSION: We described the novel B-ALL subtype “CDX2/UBTF” that has unique clinical and genomic characteristics.
Study
EGAS00001005863
Kids First Pediatric Research Study in Familial Predisposition to Hematopoietic Malignancies (SJFAMILY-HM)
The Gabriella Miller Kids First Pediatric Research Program (Kids First) is a trans-NIH effort initiated in response to the 2014 Gabriella Miller Kids First Research Act and supported by the NIH Common Fund. This program focuses on gene discovery in pediatric cancers and structural birth defects and the development of the Gabriella Miller Kids First Pediatric Data Resource (Kids First Data Resource). All of the WGS and phenotypic data from this study are accessible through dbGaP and kidsfirstdrc.org, where other Kids First datasets can also be accessed. Acute lymphoblastic leukemia (ALL) is the commonest childhood tumor and a leading cause of cancer death in children, adolescents and young adults. Hodgkin and non-Hodgkin lymphoma are also important hematologic malignancies (HM) that occur in children. Each are genetic diseases with growing evidence for a germline predisposition of both familial and sporadic cases, however the inherited genetic basis of ALL/lymphoma are poorly understood. Such knowledge is essential to gain mechanistic insight into the basis of tumor formation, and to guide genetic counseling and genetic management. Here we have assembled a large collection of familial HM kindreds, and extended recurrence cohorts of ALL and HL which will be used to identify the genetic basis of familial HM, examine the frequency of germline variants in sporadic ALL and HL, and to integrate inherited and somatic genomic data. These studies have high potential to provide fundamental new insights into the inherited genetic basis of HM, to provide important information to guide clinical management, and to provide an invaluable public resource of genomic data.
Study
phs001738
German early-onset prostate cancer cohort of the Pan-Prostate Cancer Genome (PPCG) project
Prostate cancer is the most frequent malignant tumor in males and the second most frequent cause of cancer-related death. Currently, in Germany, more than 60,000 prostate cancers are diagnosed every year. Although most of these patients are treated in a curative attempt, more than 10,000 German men die from prostate cancer annually. Owing to the demographic changes of our society, a further doubling of prostate cancer incidences during the next 20 years is expected. Prostate cancer is generally considered a tumor of elderly men. However, a fraction of prostate cancers are diagnosed at the age of 55 years or less. For several reasons, these “early onset prostate cancers” may represent a key entity for the understanding of prostate cancer biology. First, it is likely that early onset prostate cancers represent a distinct molecular subgroup of prostate cancer (PCa), potentially characterized by relatively small numbers of genetic changes, some of which may be particularly strong driver mutations for PCa development. Second, a fraction of prostate cancers in young individuals could represent classical prostate cancers that are detected at a very early stage and might therefore have accumulated molecular changes/mutations occurring that are most instrumental for prostate cancer early detection. Third, PCa with hereditary backgrounds are likely to accumulate in the age group below 55 years. A comparison with other sample sets (e.g. from other ICGC consortia), a systematic genomic analysis of young men with PCa could therefore lead to the detection of mechanisms for hereditary PCa. Fourth, a better understanding of these tumors is particularly relevant as finding optimal treatment regimens is most critical in young cancer patients. We are analyzing the entire genomic DNA sequences of at least 200 PCas (and matched non-tumorous DNA) of young men (≤ 55y), to at least 30 fold coverage, and integrate single-nucleotide variants and genomic structural variations with differential methylation, mRNA and miRNA expression data.Tissues were collected at the University Medical Center Hamburg-Eppendorf. Sequencing was performed at DKFZ and NCT (Heidelberg), EMBL (Heidelberg), and MPIMG (Berlin). Data management and bioinformatic data analyses is being conducted at DKFZ.
Study
EGAS00001003373
Data Access Commitee of The First Affiliated Hospital of Nanjing Medical University
Dac
EGAC00001001209
Oxidative DNA damage and ubiquitin proteasome system dysfunction underpins neurodegeneration in young adults with a DNA repair disorder
Deposited here are WGS data generated from differentiated neural stem cells (NSCs) of NXPG-32 patient (XPG patient with neurodegeneration) and a matched healthy control CTRL-33.
Dataset
EGAD50000000232
Data Access Commitee of Nanjing Medical University Affiliated Jiangsu Cancer Hospital
Dac
EGAC00001003206
Data Access Commitee of Nanjing Medical University Affiliated Jiangsu Cancer Hospital
Dac
EGAC00001001774
Data Access Commitee of Nanjing Medical University Affiliated Jiangsu Cancer Hospital
Dac
EGAC00001002086
Institute of Medical Genetics and Pathology, University Hospital Basel, Basel, Switzerland
Dac
EGAC00001002346
Targeting the DNA Repair Pathway in Ewing Sarcoma
Ewing’s Sarcoma is a bone and soft tissue tumor that primarily affects adolescents and young adults. With current therapies, 70% of patients with localized disease survive but survival for metastatic and recurrent disease is poor. Whole genome sequencing of 19 Ewing’s Sarcoma tumors showed that STAG2 was mutated in 10% (2/19) of the tumors and STAG2 protein was absent in 14% (13/106) tumors by immunohistochemical staining. Previous studies have shown that glioblastoma cells lacking STAG2 are more sensitive to poly-ADP ribose polymerase (PARP) inhibitors. We found that Ewing’s Sarcoma cell lines are sensitive to PARP inhibitors irrespective of STAG2 protein expression. Ewing Sarcoma cell lines are defective in double strand DNA break repair. PARP inhibitor cytotoxicity in Ewing’s Sarcoma cells was potentiated 10-1,000 fold by DNA damaging agents (irinotecan and temozolomide). To extend these studies in vivo, we developed an orthotopic Ewing’s Sarcoma mouse model and performed pharmacokinetic and pharmacodynamic studies with three different PARP inhibitors (BMN-673, olaparib and veliparib) in clinical development for pediatric cancer. Those data were used to design preclinical phase I studies to identify tolerable drug combinations for pilot efficacy testing (preclinical phase II). Based on the results of the preclinical phase I/II data, we performed a double blind, randomized, placebo controlled preclinical phase III trial with 274 mice in 15 treatment groups. Irinotecan administered in a low-dose protracted schedule optimized for pediatric patients was an effective DNA damaging agent to combine with olaparib and BMN-673 and was better tolerated than combinations with temozolomide. Combining olaparib or BMN-673 with irinotecan and temozolomide gave complete and durable responses in over 80% of the mice.
Study
EGAS00001000839
Massachusetts General Hospital/Eisai National Institute of Mental Health (NIMH) Genetics Initiative Alzheimer's Disease GWAS - Affymetrix GeneChip Human Mapping 500K Array Set
This study cohort was collected as part of the National Institute of Mental Health (NIMH) Alzheimer's disease Genetics Initiative Study. Recruitment of families, predominantly sibling, for the cohort was focused on local memory disorder clinics, nursing homes, and the surrounding community. Clinical evaluation was performed at the University of Alabama at Birmingham, Johns Hopkins University, and Massachusetts General Hospital/Harvard Medical School. Informed consent was obtained from all study participants, for participants with extensive cognitive impairment, a legal guardian or caregiver with power of attorney provided consent. To determine age of onset, a creditable informer was interviewed and medical records were evaluated.
Study
phs000483
Illumina HumanOmniExpress genotyping data from the TEENAGE (TEENs of Attica: Genes and Environment) study.
The TEENAGE study is a cross-sectional study comprising healthy Greek adolescents. The study was conducted according to the guidelines specified in the Declaration of Helsinki and all procedures involving human subjects were approved by the Institutional Review Board of Harokopio University and the Greek Ministry of Education, Lifelong Learning and Religious Affairs. The TEENAGE study target population comprised adolescent students aged 13 to 15 years attending the first three classes of public secondary schools located in the wider Athens area of Attica. The regions across Attica from which students were recruited were physically similar and all urban in nature. Sampling of regions and schools selected to participate was random. Students were enrolled from the wider urban region to ensure that they were coming from families with varying educational background and socio-economic status. Moreover, in order to exclude any vast economic disparities, all participating schools were exclusively public schools, which are free of charge. A total of twenty-four schools participated in the study out of 285 junior secondary public schools in Attica. Anthropometric, medical and dietary information was obtained. DNA samples were genotyped using Illumina HumanOmniExpress BeadChips (Illumina, San Diego, CA, USA) at the Wellcome Trust Sanger Institute, Hinxton, UK. Genotype calling algorithm used was Illuminus (Teo et al., 2007).
Study
EGAS00001000996
Department of Hematology and Oncology, Nagoya City University Graduate School of Medical Sciences
Dac
EGAC00001001166
Data Access Committee of the Medical Faculty of the University of Basel (MF-DAC)
Dac
EGAC00001002981
WES data from tumors and matching controls collected from 11 UTSW translocation renal cell carcinoma (tRCC) patients.
This study investigates the progression of translocation renal cell carcinoma (tRCC) based on Whole Exome Sequencing (WES) data from 11 patients. The samples were collected from affiliated hospitals of the University of Texas Southwestern Medical Center (UTSW), including Parkland Hospital and Children’s Medical Center. Tumor samples and their matched normal DNA were sequenced on the Illumina platform.
Study
EGAS50000000126
Center for Craniofacial and Dental Genetics (CCDG): Genetics of Orofacial Clefts and Related Phenotypes
The purpose of this study is to investigate the genetics of orofacial clefts (OFCs) in a large study population, and importantly, to incorporate subclinical phenotypic features into these studies. Orofacial clefts (OFCs) comprise a significant fraction of human birth defects (about 1/700 live births (Rahimov et al. 2012) and represent a major public health challenge, as individuals with these anomalies require surgical, nutritional, dental, speech, medical and behavioral interventions, thus imposing a substantial economic and personal burden (Berk and Marazita 2002*). The most common forms include OFCs of the lip alone (CL, Figure 1A), CL plus cleft palate (CL+CP, Figure 1B) or of the palate only (CP, Figure 1C). Individuals born with OFC may have their first surgical repair at age 3 months, but this initial surgery is just the beginning of a lifetime of health burdens. An individual born with an OFC has a hospital use rate increased for most ages (up to 233% increase for children ages 0-10 years and 16% for middle aged adults (Wehby et al. 2012). Healthcare costs for children with OFCs are estimated to be 800% greater compared with their unaffected peers (Boulet et al. 2009). Data from Denmark show that people born with CL with or without CP (CL/P) have an increased mortality up to age 55, which may be attributed to an increased risk of suicide and/or certain cancers (Christensen et al. 2004). The focus of most OFC genetic research has been CL and/or CP. Furthermore, the majority of OFC, i.e. about 70% of CL/P and 50% of CP is considered "nonsyndromic" (Jones 1988), i.e. isolated anomalies with no other apparent cognitive or structural abnormalities. Figure 1 Sample OFC Types A: Bilateral Cleft Lip; B:Cleft Lip plus Cleft Palate; C:Cleft Palate Alone The factors leading to the majority of nonsyndromic OFCs are still unclear, particularly at an individual family level. As is true for many complex traits, substantial progress in gene identification has occurred in the OFC field in the last two years (Dixon et al. 2011; Marazita 2012). Genome Wide Association Studies (GWAS) and sequencing studies to date by our research team and others have focused on genetic risk factors for overt CL/P and CPO-and have been very successful. A major finding from this work is that OFCs exhibit significant genetic heterogeneity, i.e., multiple genetic regions have been implicated (Beaty et al. 2010; Ludwig et al. 2012). Thus, approaches are needed to understand this genetic heterogeneity. Are there GxG interactions at work? Are there subsets of families, each due to a different gene? Our research group has shown that a promising approach to dissect the etiology of OFC is to focus on subclinical phenotypic features within entire cleft families (not just in affected cases, but also in their non-cleft relatives). These subtle features are believed to represent mild manifestations of the same underlying genetic susceptibility responsible for OFCs; as such, their inclusion in case-control and family-based genetic studies can help to clarify and refine the relationship between genotype and phenotype. The study population comprises a large number of families and individuals (~12,000 individuals) from multiple populations worldwide (Caucasians from the US and Europe, Asians from China and the Philippines, Mixed Native American/Caucasians from South America, and Africans from Nigeria and Ethiopia). There are cases, case families (nuclear families and extended kindreds), as well as controls with no history of OFC nor other developmental defects. *Berk NW, Marazita ML (2002) Costs of Cleft lip and Palate: Personal and Societal Implications. In: Wyszynski DF (ed) Cleft Lip and Palate: From Origin to Treatment. Oxford University Press, Inc., New York, pp 458-467.
Study
phs000774
Genome-Wide Association Study in Systemic Sclerosis
The Scleroderma Family Registry and DNA Repository (Registry) was initially developed as a registry and bio-specimen repository of patients with systemic sclerosis (scleroderma), family members and unaffected healthy controls. A case-control design was later adopted due to the lack of availability of many parents in this adult-onset disease, which precluded a linkage approach. In addition to collecting demographic data, the registry included the collection of disease-pertinent, cross-sectional, clinical information from medical records of affected participants. Registry participants were >/= 18 years of age at enrollment. On the basis of medical record review, all cases were verified by the principal investigator to meet the eligibility criteria. 800+ blood samples from verified affected cases of European ancestry were selected for a Genome Wide Association Study (GWAS). These genetic data, as well as phenotypic data, are available in dbGaP. DNA, serum and plasma from Registry participants, which are currently being stored in the University of Texas Health Science Center - Houston, Scleroderma Registry Genetic Repository, are available for research purposes. However, not all of the 800+ registry participants who submitted a blood sample currently have DNA available due to depletion of some samples. In addition to including SNP data from the genome-wide (first study release) and immunochip (second study release) scan on the 800+ samples, the dbGaP database provides a set of data tables with phenotypic information collected from the participants in the registry.
Study
phs000357
DAC - Dept. of Pathology LUMC
DAC of the Department of Pathology of Leiden University Medical Center
Dac
EGAC50000000376