The failure to develop effective therapies for paediatric glioblastoma (pGBM) and diffuse intrinsic pontine glioma (DIPG) is in part due to their intrinsic heterogeneity. Analysis of 142 sequenced cases revealed multiple tumour subclones, spatially and temporally co-existing in a stable manner as observed by multiple sampling strategies. This dataset provides multi region sequencing of high grade gliomas and diffuse intrinsic pontine gliomas from 15 patients. DNA was extracted from FFPE sections in 2-13 regions of each tumour and sequenced with Agilent SureSelect whole exome sequencing. Germline DNA was also sequenced in 14 cases. Data was aligned to hg19 with bwa and is provided as 79 separate BAM files.
Our aim is to analyze the genome of human melanoma cell lines and short term culture from human melanoma samples in order to identify genes that confer drug resistance to clinically relevant targeted therapies. We will perform whole-exome sequencing, copy number variation analysis and methylome analysis in a collection of human melanoma cell lines and short term culture that will be then screened for drug sensitivity/resistance through a library of clinically relevant drugs and drug combinations. By the combined analysis of the genomic lesion and the drug sensitivity/resistance profile of different cell lines, we will look for genes whose mutation is associated to the sensitivity or resistance to a specific drug in different samples.
The goals of this study is to investigate the prevalence and heritability of age-related clonal haemopoeisis (ARCH) in healthy elderly individuals.We will use a bespoke bait set to pull down DNA regions of interest in whole blood samples combined with HiSeq at a deep level . By correlating findings from each individual to their respective twin we hope to elucidate whether heritable traits influence the development of ARCH. a. This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/ . This dataset contains all the data available for this study on 2019-05-31.
Whole genome sequencing of 100 unrelated Kazakhs in order to impute genotypes into PE cases and controls from Kazakhstan and to provide genetic data and infrastructure for future genetic studies in Kazakhstan and Central Asia more generally and to fill a gap in worldwide information as Central Asia is not adequately represented in available genomic data. This dataset is one component of the InterPregGen FP7 project. DNA samples for this component were collected by InterPregGen Consortium collaborators at the Scientific Center of Obstetrics, Gynecology and Perinatology, Almaty, Kazakhstan (Gulnara Svyatova, Principal Investigator)
MESA The Multi-Ethnic Study of Atherosclerosis (MESA) is a study of the characteristics of subclinical cardiovascular disease (disease detected non-invasively before it has produced clinical signs and symptoms) and the risk factors that predict progression to clinically overt cardiovascular disease or progression of the subclinical disease. MESA researchers study a diverse, population-based sample of 6,814 asymptomatic men and women aged 45-84. Thirty-eight percent of the recruited participants are white, 28 percent African-American, 22 percent Hispanic, and 12 percent Asian, predominantly of Chinese descent. Participants were recruited from six field centers across the United States: Wake Forest University, Columbia University, Johns Hopkins University, University of Minnesota, Northwestern University and University of California - Los Angeles. Each participant received an extensive physical exam to determine coronary calcification, ventricular mass and function, flow-mediated endothelial vasodilation, carotid intimal-medial wall thickness and presence of echogenic lucencies in the carotid artery, lower extremity vascular insufficiency, arterial wave forms, electrocardiographic (ECG) measures, standard coronary risk factors, sociodemographic factors, lifestyle factors, and psychosocial factors. Selected repetition of subclinical disease measures and risk factors at follow-up visits allows study of the progression of disease. Blood samples have been assayed for putative biochemical risk factors and stored for case-control studies. DNA has been extracted and lymphocytes cryopreserved (for possible immortalization) for study of candidate genes and possibly, genome-wide scanning, expression, and other genetic techniques. Participants are being followed for identification and characterization of cardiovascular disease events, including acute myocardial infarction and other forms of coronary heart disease (CHD), stroke, and congestive heart failure; for cardiovascular disease interventions; and for mortality. In addition to the six Field Centers, MESA involves a Coordinating Center, a Central Laboratory, and Central Reading Centers for Computed Tomography (CT), Magnetic Resonance Imaging (MRI), Ultrasound, and Electrocardiography (ECG). Protocol development, staff training, and pilot testing were performed in the first 18 months of the study. The first examination took place over two years, from July 2000 - July 2002. It was followed by four examination periods that were 17-20 months in length. Participants have been contacted every 9 to 12 months throughout the study to assess clinical morbidity and mortality. MESA Family The general goal of the MESA Family Study, an ancillary study to MESA funded by a grant from NHLBI, is to apply modern genetic analysis and genotyping methodologies to delineate the genetic determinants of early atherosclerosis. This is being accomplished by utilizing all the current organizational structures of the Multi-Ethnic Study of Atherosclerosis (MESA) and Genetic Centers at Cedars-Sinai Medical Center and University of Virginia. In the MESA Family Study, the goal is to locate and identify genes contributing to the genetic risk for cardiovascular disease (CVD), by looking at the early changes of atherosclerosis within families (mainly siblings). 2128 individuals from 594 families, yielding 3,026 sibpairs divided between African Americans and Hispanic-Americans, were recruited by utilizing the existing framework of MESA. MESA Family studied siblings of index subjects from the MESA study and from new sibpair families (with the same demographic characteristics) and is determining the extent of genetic contribution to the variation in coronary calcium (obtained via CT Scan) and carotid artery wall thickness (B-mode ultrasound) in the two largest non-majority U.S. populations. In a small proportion of subjects, parents of MESA index subjects participating in MESA Family were studied but only to have blood drawn for genotyping. The MESA Family cohort was recruited from the six MESA Field Centers. MESA Family participants underwent the same examination as MESA participants during May 2004 - May 2007. DNA was extracted and lymphocytes immortalized for study of candidate genes, genome-wide linkage scanning, and analyzed for linkage with these subclinical cardiovascular traits. While linkage analysis is the primary approach being used, an additional aspect of the MESA Family Study takes advantage of the existing MESA study population for testing a variety of candidate genes for association with the same subclinical traits. Genotyping and data analysis will occur throughout the study. MESA Air The general goal of the Multi-Ethnic Study of Atherosclerosis and Air Pollution ('MESA Air') is to prospectively examine the relation between an individual level assessment of long-term ambient air pollution exposures (including PM2.5 and the progression of subclinical cardiovascular disease in a multi-city, multi-ethnic cohort. MESA Air will also prospectively examine the relationship between an individual level assessment of long-term ambient air pollution exposures and the incidence of cardiovascular disease, including myocardial infarction and cardiovascular death. MESA AIR is funded by a grant from the United States Environmental Protection Agency to the University of Washington and subcontracts from the UW to other participating institutions. MESA Air will assess if ambient air pollution is associated with changes over time in subclinical measures of atherosclerosis and plasma markers of inflammation, oxidative damage, and endothelial activation in a longitudinal data model, adjusting for age, race/ethnicity, socioeconomic status, and specific cardiovascular risk factors (such as diabetes, hypertension, smoking, and diet). The study will similarly assess if the incidence of cardiovascular events is associated with long-term exposure to ambient air pollution, using a proportional hazards model. The study includes refinement of statistical tools, and explores joint/independent effects of acute and long-term pollutant exposure in the occurrence of cardiovascular disease. The MESA Air study is built on the foundation of the ongoing MESA study. The parent MESA Study cohort is located in six geographic areas ('Field Centers') that capture tremendous exposure heterogeneity, comparable to or greater than the variability in locations of prior U.S. cohort studies. In addition to the six Field Centers, the study involves a Coordinating Center, a Central Laboratory, and Reading Centers for Computed Tomography (CT), ultrasound and air pollution data. The cohort for the MESA Air study currently includes 6226 subjects: 5479 enrolled in the parent MESA study; 257 recruited specifically for this study, and 490 recruited from the MESA Family study. The entire MESA Air cohort will be followed over a 10-year project period for the occurrence of cardiovascular disease events. On two occasions over the ten-year study period, 3600 subjects from the MESA Air cohort, residing in nine locales, will undergo computed tomography scanning to assess presence and extent of coronary artery calcification (CAC), and ultrasound of the carotid artery to determine intima-media thickness (IMT). We will also repeatedly assess plasma markers of inflammation, oxidative damage, and endothelial function in 720 subjects. MESA Air adds state-of-the-art air pollution exposure assessment information to the MESA cohort study, and introduces new subjects and outcome measures to achieve our aims. The study will assess long-term individual-level exposure to ambient air pollutants for each subject using community-scale monitoring, outdoor spatial variation, subject proximity to pollution sources, pollutants' infiltration efficiency, and personal time-activity information. The exposure models will be validated using detailed monitoring in a subset of subjects. The MESA Cohort is utilized in the following dbGaP substudies. To view genotypes, analysis, expression data, other molecular data, and derived variables collected in these substudies, please click on the following substudies below or in the "Substudies" box located on the right hand side of this top-level study page phs000209 MESA Cohort. phs000420 MESA SHARe phs000283 MESA CARe phs000403 MESA ESP Heart-GO
The focus of this study is to identify a new level of genetic variation, i.e. rare genetic variants with a population frequency less than 5%, usually less than 1%, which are believed to provide a stronger risk per variant than those studied to date in the large genome wide association studies (GWAS). To do this we are generating whole exome sequencing data on the Illumina HiSeq. Each HiSeq produces at least 600 billion base pairs of DNA sequence in one run. Whole exome sequencing sequence data, about 50 million base pairs, or about 1.5% of the total DNA of each person's genome, is generated. We are using this data to look for new DNA variations that give risk for Parkinson disease, as well as "modifiers", that may lead to having more severe or milder disease or later or earlier ages of onset.
Oncogenic alterations in EGFR frequently co-occur with additional genetic alterations in EGFR-driven lung adenocarcinoma (LUAD), but how specific combinations of mutations affect tumor phenotypes and responses to targeted therapy is yet unknown. We leveraged a genetically engineered mouse model of EGFR mutant/Trp53-deficient LUAD to study the consequences of inactivating 10 different tumor suppressor genes on the fitness and tyorosine kinase inhibitor (TKI) sensitivity of these tumors. We found that loss of Keap1 is associated with a reduced response to therapy. In patients, we found that mutations in the KEAP1/NFE2L2/CUL3 pathway are associated with a significantly shorter time to treatment failure for EGFR TKI therapy compared to matched patients with wild-type KEAP1/NFE2L2/CUL3 tumors. We also analyzed whole exome sequencing data from tumor specimens before TKI treatment and at the time of treatment resistance for mutations in the KEAP1 pathway and these data are being submitted here.
Acute myeloid leukemia is an aggressive clonal malignancy of the bone marrow that is the direct result of sequential acquisition of mutations in a single lineage of cells. In this study, we investigate a model in which this mutational acquisition occurs serially in long-lived self-renewing hematopoietic stem cells eventually resulting in frank acute myeloid leukemia. Coding mutations in multiple AML patients were identified using exome sequencing followed by sanger sequencing validation. The level of these mutations was then assessed in residual hematopoietic stem cells from each patient using targeted deep sequencing. These population-level estimates of mutant allele burden were then validated in single cell assays targeted to the identified mutations. This allowed for determination of the order of acquisition of the mutations that preceded the development of the leukemia. The results of this study identify pre-leukemic hematopoietic stem cell clones that could contribute to patient relapse and outcome.
The Finland-United States Investigation of NIDDM Genetics (FUSION) study is a long-term effort to identify genetic variants that predispose to type 2 diabetes (T2D) or that impact the variability of T2D-related quantitative traits. The initial effort involved linkage analysis of affected-sibling-pair (ASP) families based on over 5,000 individuals living in Finland, and association fine mapping based on these family members and additional T2D cases and controls. We completed a genome-wide association scan on 1161 T2D cases and 1174 normal glucose tolerant (NGT) controls. Individual-level data is available for the 919 T2D cases and 787 NGT controls who reconsented to the use of their data or are deceased (phs000100). In addition, we selected these 919 T2D cases and a matched set of 919 NGT controls (774 overlapping with GWAS) for targeted sequencing of 78 genes associated with glucose, insulin, and/or lipids. 400 of these T2D cases were also chosen for whole-exome sequencing (phs000702).
Lung cancer is frequently treated with paclitaxel in combination with several other agents; however, paclitaxel treatment is often ineffective or limited by treatment-related toxicities. Heritable variants in genes associated with absorption, distribution, metabolism, and elimination may predict paclitaxel clinical outcome and toxicity. We designed a prospective multi-institutional study that recruited 546 patients receiving therapy with a 5-year follow up. All patients were genotyped using the Drug Metabolizing Enzymes and Transporters (DMET) platform, which ascertains 1931 genotypes in 235 genes. Genotypes were compared to the progression-free survival of paclitaxel therapy and clinically-significant paclitaxel toxicities. Seven genetic variants were associated with paclitaxel PFS (in ABCB11, ABCC3, ABCG1, CYP8B1, NR3C1, FMO6P, and GSTM3), whereas four genetic variants (in VKORC1, SLC22A14, GSTA2, and DCK) were associated with paclitaxel toxicities. The present SNPs have limited clinical utility but suggest that certain genes are related to important paclitaxel pathways in lung cancer.