The OPPERA project has two sub-studies investigating effects of genetic, physiological, psychological and clinical factors on the risk of developing painful temporomandibular disorder (TMD). OPPERA I baseline case-control study of chronic TMJD (dbGaP study accession phs000762). This sub-study uses a case-control study design to investigate characteristics associated with chronic TMD. The parent OPPERA phenotype file contains an indicator variable (PHASE1CASECONTROL) identifying participants in this sub-study. Data users should use the relevant case-classification variable (CHRNONICTMDCASE) and apply statistical methods appropriate for case-control studies when analyzing TMJD in this sub-study. OPPERA I prospective cohort for first onset of TMJD (dbGaP study accession phs000850). This sub-study uses a prospective cohort study design to investigate characteristics that predict risk of developing first-onset TMD. The parent OPPERA phenotype file contains an indicator variable (PROSPECTIVECOHORT) identifying participants in this sub-study. Data users should use the relevant case-classification variable (INCIDENTCASE) and the relevant time-in-study variable (FOLLOWUPYEARS) when analyzing TMJD in this sub-study. It is recommended that data users apply statistical methods appropriate for cohort studies of a dynamic population (for example, Cox regression models, where INCIDENTCASE is the censoring variable and FOLLOWUPYEARS is time in study).
Acute Myeloid Leukemia (AML) remains a clinical challenge since most patients diagnosed with AML will die from the disease. Some patients harbor treatment-refractory disease and many others relapse with disease that in many cases is resistant to treatments. Our study was designed to understand the molecular basis of disease progression in AML through assessing genomics signatures in patient specimens collected through an international collaboration which assembled samples from 138 AML patients which experienced disease relapse and normal hematopoietic cells (n=15). It is hoped that this resource will help researchers understand mechanisms of disease relapse in AML and contribute to the general pool of data available for analyses for this disease and general research use.
The Electronic Medical Records and Genomics (eMERGE) Network is a National Institutes of Health (NIH)-organized and funded consortium of U.S. medical research institutions. The primary goal of the eMERGE Network is to develop, disseminate, and apply approaches to research that combine biorepositories with electronic medical record (EMR) systems for genomic discovery and genomic medicine implementation research. eMERGE was announced in June 2007 and began its third phase in September 2015. eMERGE Phase III (June 2015 - June 2020) consists of 10 study sites, two central sequencing and genotyping facilities, and a coordinating center. Included in this study are: Human Reference Consortium (HRC) single nucleotide variants and 1000 Genomes structural variants imputed array data of 105,108 eMERGE participants from nine Phase III study sites and three Phase II study site collaborators. Corresponding demographics, body mass index measurements. Case/control status for the following phenotypes: Abdominal aortic aneurysm; Ace-Inhibitor/Cough; Attention Deficit Hyperactivity Disorder; Age-related macular disease; Appendicitis; Asthma; Atopic Dermatitis; Autism; Benign Prostatic Hyperplasia; Carotid artery disease as a Quantitative Measure; caMRSA; Cataract; Clostridium difficile colitis; Extreme Obesity; Chronic Kidney Disease; Chronic Kidney Disease and Type 2 Diabetes; Chronic Kidney Disease, Type 2 Diabetes and Hypertension; Colon Polyps; Cardiorespiratory Fitness; Dementia; Diverticulosis; Diabetic retinopathy; Gastroesophageal Reflux Disease; Glaucoma; Height; Heart failure; Hypothyroidism; Lipids; Ocular hypertension; Peripheral Arterial Disease; QRS duration; Red blood cell indices; Remission of Diabetes after ROUX-EN-Y gastric bypass surgery; Resistant hypertension; MACE while on Statins; Type 2 Diabetes; Venous Thromboembolism; White blood cell indices; and Zoster virus infection. Study sites and participants include: Boston Children's Hospital: The Gene Partnership (TGP) is a prospective longitudinal registry at Boston Children's Hospital (BCH) to study the genetic and environmental contributions to childhood health and disease, collect genetic information on a large number of children who have been phenotyped, and implement the Informed Cohort and the Informed Cohort Oversight Board (ICOB). The term "The Gene Partnership" reflects a partnership between researchers and participants. Children seen at BCH are offered enrollment, as are their parents and siblings. DNA is collected on all enrollees. BCH has a comprehensive EMR system, and virtually all inpatient and outpatient data are captured electronically. Clinical data in the BCH EMR is loaded in the i2b2 data warehouse which is available to investigators. Cases, phenotypes, and covariates are ascertained using the i2b2 database. Participants at BCH in TGP have consented to receive any research result and/or incidental finding that arises from studies using TGP that is approved by the Informed Cohort Oversight Board (ICOB) and is in accordance with the participants'preferences;results are returned through the Personally Controlled Health Record (PCHR). BCH and Cincinnati Children's Hospital Medical Center (CCHMC) have partnered as the Pediatric Alliance for Genomic and Electronic Medical Record (EMR) Research (PAGER) site for the eMERGE Phase II network for pediatric institutions, and the cohort for eMERGE at BCH is TGP. Children's Hospital of Pennsylvania (CHOP): The Center for Applied Genomics (CAG) at the Children's Hospital of Philadelphia (CHOP) is a high-throughput, highly automated genotyping and sequencing facility equipped with state-of-the-art genotyping and sequencing platforms. Children who are treated at the Children's Hospital Healthcare Network and their parents may be eligible to take part in a major initiative to collect more than 100,000 blood samples, covering a wide range of pediatric diseases. A large majority of participants consenting to prospective genomic analyses also consent to analysis of their de-identified electronic health records (EHRs). EHRs are longitudinal, with a mean duration of 6.5 years. Cincinnati Children's Hospital Medical Center/Boston's Children's Hospital (CCHMC/BCH): Cincinnati Children's Hospital Medical Center (CCHMC) is a pediatric institution dedicated to improving health and welfare of children and to the discovery and practical application of new genomic information to the ordinary care of children. CCHMC brings an extraordinary faculty to eMERGE III who are committed to gain a better understanding of the genesis of disease and to elucidate the mechanisms of diseases that afflict children, specifically pediatric disease phenotypes that will leverage the available eMERGE adult genomic data and electronic medical records (EMRs) to discover meaningful use results. Generation of EMR phenotype algorithms, informed by natural language processing, using heuristic and machine learning methods is ongoing. CCHMC has developed tools to evaluate adolescent return of results preferences, examined the ethical and legal obligations and potential to reanalyze results, and developed clinical decision support for phenotyping, test ordering, and returning sequencing results. The success of these eMERGE III studies is enhanced by the ongoing institutional investment made in the CCHMC Biobank, the comprehensive EMR (EPIC), the i2b2 de-identified medical record data warehouse, and hundreds of faculty and senior staff who make genomics or informatics an active focus of their research. Columbia University: Columbia University Medical Center/New York Presbyterian (CUMC/NYP) Hospital system is one of the nation's largest and most comprehensive hospital systems with over 2 million inpatient and outpatient yearly visits that serves a racially and ethnically diverse urban patient population. The Columbia University GENomic Integration with EHR (GENIE) research study contributed and shared phenotype and genotype data for individuals who were recruited as part of a diverse array of initiatives within the hospital, including Northern Manhattan Study (NOMAS), Pediatric Cardiac Genomic Consortium (PCGC), Caribbean Hispanics with Familial and Sporadic Late Onset Alzheimer's disease (AD), Alzheimer's Disease Sequencing Project (ADSP), and Genetics of Chronic Kidney Disease study. Some of these individuals had kidney or neurological problems, some were healthy adult volunteers with self-reported health status information from the medically underserved Northern Manhattan community, and others were pediatric patients with cardiac conditions. For the kidney disease cohort, patients with the diagnosis of Chronic Kidney Disease (CKD) and healthy controls were recruited to the Columbia University CKD biobank. For the NOMAS cohort, eligible participants were stroke-free, were 40 years old, and resided for at least 3 months in a Northern Manhattan household with a telephone. The PCGC study recruited parent-offspring trios with pediatric probands diagnosed with congenital heart defects (CHD). For the Caribbean Hispanics with Alzheimer's disease project, individuals from families affected by AD and with sporadic AD were recruited, along with unrelated controls. Samples for the ADSP study have been selected from well-characterized cohorts of individuals with AD diagnosis. Geisinger Health System: A research cohort of adult Geisinger Clinic patients was enrolled from community-based primary care clinics of the Geisinger Health System. Patients were eligible for enrollment if they were a primary care patient of a Geisinger Clinic physician and were scheduled for a non-emergent clinic visit. All participants provided written informed consent and HIPAA authorization. Consenting patients agreed to provide blood samples for broad biomedical research use, and permission to access data in their Geisinger electronic medical record for research. The enrollment rate was 90% of patients approached. The demographics of the cohort approximate those of the Geisinger Clinic outpatient population. Research blood samples were collected during an outpatient clinical phlebotomy encounter. Research blood samples are coded and stored in a central biorepository. Samples are linkable to clinical data in a de-identified manner for research via an IRB-approved data broker process. For genomic analysis, DNA is extracted from EDTA-anticoagulated whole blood. Partners Health Care (Harvard): The Partners HealthCare Biobank is a large research program designed to help researchers understand how people's health is affected by their genes, lifestyle, and environment. This large research data and sample repository provides access to high-quality, consented blood samples to help foster research, advance our understanding of the causes of common diseases, and advance the practice of medicine. For the Partners research community (Massachusetts General Hospital and Brigham and Women's Hospital), the Biobank provides: Banked samples (plasma, serum, and DNA) collected from consented patients Blood samples that were discarded after clinical testing in the Crimson Cores maintained in the Brigham and Women's Hospital and Massachusetts General Hospital Pathology Departments Sample handling and preparation services Link to the biobank data to the Partners Research Patient Data Registry (RPDR) a research instance of our electronic clinical chart Data access through our research portal. To date, over 60,000 Partners patients have given their consent to enroll, give a blood sample, receive research results and agreed to be re-contacted for additional research studies. The Biobank has enabled Partners investigators to compete for nationally recognized grants in personalized medicine such as a clinical electronic Medical Records and Genomics network (eMERGE) site and the national All of US program. The Biobank currently supports over 120 Partners investigators and over 100 million dollars in NIH research. Kaiser Permanente Washington with the University of Washington and the Fred Hutchinson Cancer Research Center: KPWA participants were enrolled in the eMERGE Network through the Northwest Institute of Genetic Medicine (NWIGM) biorepository, and provided the appropriate consent to receive clinically relevant genetic results (N~8,073.) NWIGM is based at the University of Washington and co-managed by the University of Washington and KPWA. The purpose of the NWIGM biorepository is to build infrastructure and resources to carry out a broad range of future genetic research. KPWA members enrolled in the biorepository are asked to provide informed consent to providing a DNA sample for storage in the NWIGM biorepository. The consent is purposefully broad to serve the dual purpose of reducing the burden on researchers who wish to use this biorepository and the IRB committees who will be responsible for reviewing these requests in the future. Participants were eligible if aged 50 - 65 years old at the time of their enrollment into the NWIGM repository, living, enrolled in KPWA's integrated group practice, and had completed an online Health Risk Appraisal. The selection algorithm was based on several data sources from the EHR at KPWA. 1) Demographics - participants with self-reported race as Asian ancestry were prioritized and selected to enrich for non-European ancestry. The KPWA eMERGE cohort includes n=1,245 members of Asian ancestry. 2) Participants were also selected for a history of colorectal cancer (n=1,002), in order to allow us to enrich germline pathogenic variants. Essentia Institute of Rural Health, Marshfield Clinic, Pennsylvania State University (Marshfield): The Marshfield Clinic Personalized Medicine Research Project is a population-based biobank in central Wisconsin with more than 20,000 adult subjects who provided written, informed consent to access their medical records and provided a blood sample from which DNA was extracted and plasma and serum stored. In addition to an average of 30 years of medical history data, a questionnaire about environmental exposures, including a detailed food frequency questionnaire, is available to facilitate gene/environment studies. Mayo Clinic: The Mayo Vascular Disease Biorepository is a disease-specific biobank for vascular diseases including peripheral arterial disease (PAD). PAD patients were identified from individuals referred to the non-invasive vascular laboratory for lower extremity arterial evaluation. Since 1997, laboratory findings have been recorded into an electronic database employing an in-house software package for data archiving and retrieval;this data becomes part of the Mayo EMR. Patients referred to the center with suspected PAD undergo a comprehensive non-invasive evaluation including the ankle-brachial index (ABI) - the ratio of blood pressure measured in the upper arms divided by blood pressure measured at the ankles. Controls subjects are identified from patients referred to the Cardiovascular Health Clinic for stress ECG. The prevalence of PAD in patients with normal exercise capacity who do not have inducible ischemia on the stress ECG , was <1%. Data regarding risk factors for atherosclerosis such as diabetes, dyslipidemia, hypertension, and smoking are ascertained from the EMR. Case control study of venous thromboembolism (PI John Heit) Controls from a case control study of pancreatic cancer (PI Gloria Petersen) Mayo Clinic Biobank. Icahn School of Medicine at Mount Sinai School (Mt. Sinai): The Institute for Personalized Medicine (IPM) Biobank Project is a consented, EMR-linked medical care setting biorepository of the Mount Sinai Medical Center (MSMC) drawing from a population of over 70,000 inpatients and 800,000 outpatient visits annually. MSMC serves diverse local communities of upper Manhattan, including Central Harlem (86% African American), East Harlem (88% Hispanic Latino), and Upper East Side (88% Caucasian/white) with broad health disparities. IPM Biobank populations include 28% African American (AA), 38% Hispanic Latino (HL) predominantly of Caribbean origin, 23% Caucasian/White (CW). IPM Biobank disease burden is reflective of health disparities with broad public health impact: average body mass index of 28.9 and frequencies of hypertension (55%), hypercholesterolemia (32%), diabetes (30%), coronary artery disease (25%), chronic kidney disease (23%), among others. Biobank operations are fully integrated in clinical care processes, including direct recruitment from clinical sites, waiting areas and phlebotomy stations by dedicated Biobank recruiters independent of clinical care providers, prior to or following a clinician standard of care visit. Recruitment currently occurs at a broad spectrum of over 30 clinical care sites. Northwestern University: The NUgene Project is a repository with longitudinal medical information from participating patients at affiliated hospitals and outpatient clinics from the Northwestern University Medical Center. Participants'DNA samples are coupled with data from a self-reported questionnaire and continuously updated data from our Electronic Medical Record (EMR) representing actual clinical care events. Northwestern has a state-of-the art, comprehensive inpatient and outpatient EMR system of over 2 million patients. NUgene has broad access to participant data for all outpatient visits as well as inpatient data via a consolidated data warehouse. NUgene participants consent to distribution and use of their coded DNA samples and data for a broad range of genetic research by third-party investigators. Vanderbilt University Medical Center: BioVU, Vanderbilt's DNA databank, was designed as an enabling resource for exploration of the relationships among genetic variation, disease susceptibility, and variable drug responses. BioVU acquires DNA from discarded blood samples collected from routine patient care. The biobank is linked to de-identified clinical data extracted from Vanderbilt's EMR, which forms the basis for phenotype definitions used in genotype-phenotype correlations. BioVU is currently the largest single site DNA collection world-wide, at >235,000 samples as of spring 2017.
This is a multi-center, randomized, double-blind, placebo-controlled trial of creatine monohydrate involving 553 ambulatory subjects with HD who are not requiring skilled care or institutionalization. Eligible subjects were randomized to receive up to 40 grams daily of creatine monohydrate or matching placebo. Randomization will be followed by a Titration Phase, a Maintenance Phase, an Optional Extended Maintenance Phase, and Washout. During the Titration Phase, which may last up to six months, study drug will be initially escalated in 5-gram increments starting at 5 g/day until the 40 g/day target dose is reached. Should subjects be unable to reach or stay at the maximum dose during the Titration Phase, they will remain on the highest tolerated dose (HTD) for which they have no significant side effects. During the Maintenance Phase, subjects will be followed prospectively and systematically. Subjects completing 36 months of follow-up on study drug will be eligible to continue in the study for an additional twelve months of follow-up in the Extended Maintenance Phase. The primary measure of efficacy is the annualized rate of change in TFC. The primary hypothesis of the study is that chronic treatment of early-stage HD subjects with creatine monohydrate will slow the progressive functional decline of HD.
Multiple sclerosis is a common, complex neurological disease, where almost 20% of risk heritability can be attributed to common genetic variants, including >230 identified by genome-wide association studies. Multiple strands of evidence suggest that the majority of the remaining heritability is also due to the additive effects of individual variants, rather than epistatic interactions between these variants, or mutations exclusive to individual families. Here, we show in 68,379 cases and controls that as much as 5% of this heritability is explained by low-frequency variation in gene coding sequence. We identify four novel genes driving MS risk independently of common variant signals, which highlight a key role for regulatory T cell homeostasis and regulation, IFNγ biology and NFκB signaling in MS pathogenesis.
The purpose of the clinical trial (NCT02690558) is to evaluate whether adding pembrolizumab (Keytruda) to the combination of gemcitabine and cisplatin will improve shrinkage of the tumor before having a cystectomy, for people with muscle-invasive bladder cancer (MIBC).
The impact of genetic variants on molecular pathways that give rise to neurodegenerative diseases such as Alzheimer's and Parkinson's is best elucidated in the appropriate cell types and molecular contexts. Existing studies have focused on bulk profiling of mixed cell types, but have ignored assaying genetic effects across development and cell differentiation. At the core of this proposal is the idea to use single-cell assays to study genetic effects during differentiation of dopaminergic and cortical neurons to identify the sequence of molecular events from variants to healthy and diseased cell states in a cell-specific manner. 1) This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute please see http://www.sanger.ac.uk/datasharing/ . This dataset contains all the data available for this study on 2020-05-18.
Extraskeletal myxoid chondrosarcoma (EMC) is an ultra-rare cancer. Though it has a favorable prognosis and an indolent course, it has high rates of local recurrence and metastasis to the lungs. EMC is most often characterized by a translocation involving the NR4A3 gene most often fusing with EWSR1, leading to constitutive expression of NR4A3, the biological significance of which is unknown. Our study presents a case of EMC with case-matched lung and advanced pelvic metastases. We conducted Whole Genome Sequencing to examine differences in the mutational landscape at two different stages of metastasis. This is the first study to our knowledge to analyze mega-base scale structural variants (SVs) in EMC. Our data validates copy number variants (CNVs) found in previous studies. While the primary tumor and lung metastasis had similar somatic variations and CNVs, the pelvic metastasis had more SVs with especially increased mutational burden of SVs in chromosome 2. This suggests that different molecular drivers appear in more advanced, relapsing EMC compared with the primary tumor and early lung metastasis. The sequencing data is available through dbGaP.
We will investigate the clinical implementation of genome sequencing (GS) for carrier screening to aid reproductive decision-making in adults. The study is organized into three inter-related Projects. The study population will include women and their partners who receive pre-conception testing. In Project 1, we will conduct a Randomized Clinical Trial to test clinical implementation of WGS compared with usual care. We will integrate WGS results with the electronic medical record (EMR), and measure outcomes from patient and physician perspectives. In Project 2, we will perform whole genome sequencing, including validation and interpretation of the identified variants to identify "actionable variants", meaning variants deemed worthy of reporting. This will include a robust approach using a Return of Results Committee (RORC) for determination of reportable results. In Project 3, we will evaluate the ethical and psychosocial implications of expanded carrier screening using WGS for the return of carrier status and secondary findings, and measure the impact on downstream healthcare utilization and cost. Two groups of women are identified through the electronic medical record. The first group has a clinical carrier screening test, usually Cystic Fibrosis (CF), ordered and completed as part of a preconception planning visit. The second group has a carrier screening test, usually CF, performed during pregnancy, they have delivered their baby and are planning a future pregnancy. Eligible women are randomized to either usual care or genome sequencing (GS). Women randomized into the usual care arm have already received their clinical genetic test. These women will not receive genome sequencing. Women in the GS arm will receive their carrier status results for about 700 gene/condition pairs approximately three months after randomization. If there are results to return, women come in for a genetic counseling session. If there are no results, a letter is sent in the mail. Incidental findings (IF) are returned about one month later. As with the carrier status results, all IF are returned in person, and if there are no results a letter is sent in the mail. If a participant is found to be a carrier of an autosomal recessive condition, her male partner is then eligible and invited to participate in the study and receive GS.
The NeuroGAP-Psychosis study was conducted in South Africa, Kenya, Uganda and Ethiopia as a response to the lack of genetic diversity in large genomic studies of psychiatric disorders. The study aims to expand knowledge of the genetic and environmental risk factors for neuropsychiatric disorders in Africa through large-scale sample collection and analysis. The initial design is a case-control study in which cases are individuals with schizophrenia or bipolar disorder and controls are age, sex and ancestry matched with individuals from the same geographic location. Cases are patients who are recruited from clinical facilities, both inpatient and outpatient. Controls are people who present for treatment for general medical conditions at health facilities. Data on phenotype, mental disorders, history of physical health problems, substance use and history of past traumatic events was collected from all participants. Additionally, a sample of saliva is collected for DNA extraction. DNA samples were sent to the Broad Institute for genotyping using Illumina Infinium Global Screening Array.