Many of the congenital anomalies and associated genomic variants identified during prenatal care are rare, requiring aggregation and harmonization to assure their appropriate clinical interpretation and to support the expanding utility of genomics in perinatal medicine. The primary step in accomplishing this is the creation of interoperability allowing collection, collaborative storage, dissemination and use of harmonized and vetted genomic and phenotypic data. The overall goal of this project is to generate a prenatal repository for both clinical care and research ensuring data are Findable; Accessible; Interoperable; and Reusable. The specific goals are: (1) collect and organize data contributions from multiple sites using the cloud-based Terra platform; (2) establish a governance structure for data contributions across sites; (3) develop and implement a pipeline for phenotype driven analysis and implementation of fetal specific Phenopackets on Terra; (4) generate a searchable callset of genomic variants and phenotype features on Terra; and (5) explore methods for facilitating broad access and sharing of longitudinal phenotype and genomic data facilitating genomic analysis in the prenatal space.
RNA004 is a method for direct RNA sequencing (DRS) that holds promise to enhance the detection of modopathies. To this end, we provide DRS data for a patient sample harboring heterozygous inactivating SNPs for the methyltransferase METTL5, which modifies m6A on 18S rRNA. Companion datasets are 1) EGAS50000001201 for DRS of a healthy control with intact METTL5 and 2) PRJEB74238 for IVT 18S rRNA vector data as unmodified control.
A total of 46 frozen tumor biopsies underwent WES analysis, while 41 blood samples were used as germline controls.Tumor samples qualified for whole exome sequencing (WES) if they contained ≥10% tumor cells. For bulk RNA-seq, 20 paired baseline and on-treatment frozen tumor biopsies were analyzed. Tumor samples qualified if they contained ≥30% tumor cells. For GeoMx spatial transcriptomic study we used 7 patients from the ICARUS LUNG01 trial focusing on tumour and immune cell compartments. We selected 7 paired baseline and on-treatment patient samples for analysis. We selected up to 4 regions of interest (ROI) per sample tissue. We identified tumour and immune cell areas of illumination (AOI) using CK and CD45 antibodies respectively
Environmental enteric dysfunction (EED), a sub-acute inflammatory condition of the small intestinal mucosa of unclear etiology, has been associated with a variety of environmental exposures and host factors and is implicated in growth faltering. Linear growth faltering usually occurs within the first two years of life and in most of the cases is irreversible demanding early diagnosis for treatment to be successful. The objectives of 'The Bangladesh Environmental Enteric Dysfunction Study (BEED)' are to investigate role of EED in malnutrition, examine the biology of EED to identify common biological pathways for potential interventions, to validate a system for histological scoring for EED and test the effectiveness of nutritional interventions in improving the growth parameters in children with stunting and or EED. In Bangladesh, participants are recruited from two age groups; a child cohort and a malnourished an adult cohort. In addition, two control groups are recruited for comparison consisting of a) children that are undergoing endoscopy as a part of their clinical care at the University of Virginia Health system (UVAHS) and b) an adult well-nourished control group recruited from the Gastroenterology Outpatient Department of Dhaka Medical College and Hospital in Bangladesh. The description of the study design and procedures of the study can be obtained from Mahfuz M, Das S, Mazumder RN, et al., 2017, PMID: 28801442.
Whereas non-invasive prenatal testing for aneuploidies (NIPT-A) is widely implemented, non-invasive prenatal testing for monogenic diseases (NIPT-M) is lagging. By capturing and targeted sequencing of 250000 polymorphic SNP loci from maternal plasma circulating cell-free DNA (cfDNA) and DNA from relatives, the fetal haplotype and chromosomal copy numbers are deduced. In all families tested, the cfDNA derived haplotypes are on average 97% concordant with the neonatal and embryo haplotype. This generic non-invasive prenatal diagnostic approach allows cost efficient scrutinizing the fetal genome for the presence of any inherited monogenic disease or trait.
Type 2 diabetes mellitus (T2D) is a significant health problem. Close to 20 million individuals in the United States have T2D, and 79 million aged 20 years or older are clinically pre-diabetic, with a 5-year conversion rate of 10% to 23% from prediabetes to T2D. In a collaborative effort to systematically understand diabetes and its etiology, the team is comprised of leading experts in research on both the human host as well as the microbiome, as properties of both are likely relevant in T2D development. For a better elucidation of mechanisms of onset and progression of T2D disease, the group is performing a detailed analysis of the biological processes that occur in the microbiome and human host by longitudinal profiling of patients at risk for T2D. Both microbiome and host profiles are being analyzed by state-of-the-art omics platforms, and these large-scale and diverse data sets will be integrated to determine the dynamic pathways that change during the onset and progression of T2D, especially during viral infections and other stresses. This longitudinal study is expected to reveal changes in the microbiome and host at an unprecedented level of detail, and identify molecules and pathways that play important roles in diabetes onset and progression.
The landscape of genetic alterations in lung adenocarcinoma derived from Asian patients is largely uncharacterized. Here we present an integrated genomic and transcriptomic analysis of 335 primary lung adenocarcinomas and 35 corresponding lymph node metastases from Chinese patients. Altogether 13 significantly mutated genes are identified, including themost commonly mutated gene TP53 and novel mutation targets such as RHPN2, GLI3 and MRC2. TP53 mutations are furthermore significantly enriched in tumours from patients harbouring metastases. Genes regulating cytoskeleton remodelling processes are also frequently altered, especially in metastatic samples, of which the high expression level of IQGAP3 is identified as a marker for poor prognosis. Our study represents the first large-scalesequencing effort on lung adenocarcinoma in Asian patients and provides a comprehensive mutational landscape for both primary and metastatic tumours. This may thus form a basis for personalized medical care and shed light on the molecular pathogenesis of metastatic lung adenocarcinoma.
Long-read whole genome sequencing, long-read long-range PCR sequencing, targetted long fragment long-read sequencing data of the CLCNKA/CLCNKB locus in 27 Bartter syndrome patients. Multiple different deletion alleles are identified and a new common transposition haplotype predisposing for CLCNKB deletions is described.
Startup of Framingham Heart Study. Cardiovascular disease (CVD) is the leading cause of death and serious illness in the United States. In 1948, the Framingham Heart Study (FHS) -- under the direction of the National Heart Institute (now known as the National Heart, Lung, and Blood Institute, NHLBI) -- embarked on a novel and ambitious project in health research. At the time, little was known about the general causes of heart disease and stroke, but the death rates for CVD had been increasing steadily since the beginning of the century and had become an American epidemic. The objective of the FHS was to identify the common factors or characteristics that contribute to CVD by following its development over a long period of time in a large group of participants who had not yet developed overt symptoms of CVD or suffered a heart attack or stroke. Design of Framingham Heart Study. In 1948, the researchers recruited 5,209 men and women between the ages of 30 and 62 from the town of Framingham, Massachusetts, and began the first round of extensive physical examinations and lifestyle interviews that they would later analyze for common patterns related to CVD development. Since 1948, the subjects have returned to the study every two years for an examination consisting of a detailed medical history, physical examination, and laboratory tests, and in 1971, the study enrolled a second-generation cohort -- 5,124 of the original participants' adult children and their spouses -- to participate in similar examinations. The second examination of the Offspring cohort occurred eight years after the first examination, and subsequent examinations have occurred approximately every four years thereafter. In April 2002 the Study entered a new phase: the enrollment of a third generation of participants, the grandchildren of the original cohort. The first examination of the Third Generation Study was completed in July 2005 and involved 4,095 participants. Thus, the FHS has evolved into a prospective, community-based, three generation family study. The FHS is a joint project of the National Heart, Lung and Blood Institute and Boston University. Research Areas in the Framingham Heart Study. Over the years, careful monitoring of the FHS population has led to the identification of the major CVD risk factors -- high blood pressure, high blood cholesterol, smoking, obesity, diabetes, and physical inactivity -- as well as a great deal of valuable information on the effects of related factors such as blood triglyceride and HDL cholesterol levels, age, gender, and psychosocial issues. Risk factors have been identified for the major components of CVD, including coronary heart disease, stroke, intermittent claudication, and heart failure. It is also clear from research in the FHS and other studies that substantial subclinical vascular disease occurs in the blood vessels, heart and brain that precedes clinical CVD. With recent advances in technology, the FHS has enhanced its research capabilities and capitalized on its inherent resources by the conduct of high resolution imaging to detect and quantify subclinical vascular disease in the major blood vessels, heart and brain. These studies have included ultrasound studies of the heart (echocardiography) and carotid arteries, computed tomography studies of the heart and aorta, and magnetic resonance imaging studies of the brain, heart, and aorta. Although the Framingham cohort is primarily white, the importance of the major CVD risk factors identified in this group have been shown in other studies to apply almost universally among racial and ethnic groups, even though the patterns of distribution may vary from group to group. In the past half century, the Study has produced approximately 1,200 articles in leading medical journals. The concept of CVD risk factors has become an integral part of the modern medical curriculum and has led to the development of effective treatment and preventive strategies in clinical practice. In addition to research studies focused on risk factors, subclinical CVD and clinically apparent CVD, Framingham investigators have also collaborated with leading researchers from around the country and throughout the world on projects involving some of the major chronic illnesses in men and women, including dementia, osteoporosis and arthritis, nutritional deficiencies, eye diseases, hearing disorders, and chronic obstructive lung diseases. Genetic Research in the Framingham Heart Study. While pursuing the Study's established research goals, the NHLBI and the Framingham investigators has expanded its research mission into the study of genetic factors underlying CVD and other disorders. Over the past two decades, DNA has been collected from blood samples and from immortalized cell lines obtained from Original Cohort participants, members of the Offspring Cohort and the Third Generation Cohort. Several large-scale genotyping projects have been conducted in the past decade. Genome-wide linkage analysis has been conducted using genotypes of approximately 400 microsatellite markers that have been completed in over 9,300 subjects in all three generations. Analyses using microsatellite markers completed in the original cohort and offspring cohorts have resulted in over 100 publications, including many publications from the Genetics Analysis Workshop 13. Several other recent collaborative projects have completed thousands of SNP genotypes for candidate gene regions in subsets of FHS subjects with available DNA. These projects include the Cardiogenomics Program of the NHLBI's Programs for Genomics Applications, the genotyping of ~3000 SNPs in inflammation genes, and the completion of a genome-wide scan of 100,000 SNPs using the Affymetrix 100K Genechip. Framingham Cohort Phenotype Data. The phenotype database contains a vast array of phenotype information available in all three generations. These will include the quantitative measures of the major risk factors such as systolic blood pressure, total and HDL cholesterol, fasting glucose, and cigarette use, as well as anthropomorphic measures such as body mass index, biomarkers such as fibrinogen and CRP, and electrocardiography measures such as the QT interval. Many of these measures have been collected repeatedly in the original and offspring cohorts. Also included in the SHARe database will be an array of recently collected biomarkers, subclinical disease imaging measures, clinical CVD outcomes as well as an array of ancillary studies. The phenotype data is located here in the top-level study phs000007 Framingham Cohort. To view the phenotype variables collected from the Framingham Cohort, please click on the "Variables" tab above. The Framingham Cohort is utilized in the following dbGaP substudies. To view genotypes, analysis, expression data, other molecular data, and derived variables collected in these substudies, please click on the following substudies below or in the "Substudies" section of this top-level study page phs000007 Framingham Cohort. phs000342 Framingham SHARe phs000282 Framingham CARe phs000363 Framingham SABRe phs000307 Framingham Medical Resequencing phs000401 Framingham ESP Heart-GO phs000651 Framingham CHARGE-S phs000724 Framingham DNA Methylation phs001610 Framingham T2D-GENES phs002558 Framingham ADSP phs002559 Framingham BRIDGET phs002560 Framingham Cholesterol phs002611 Framingham Post-Mortem Brain Tissue phs002938 Framingham Molecular QTL The unflagging commitment of the research participants in the NHLBI FHS has made more than a half century of research success possible. For decades, the FHS has made its data and DNA widely available to qualified investigators throughout the world through the Limited Access Datasets and the FHS DNA Committee, and the SHARe database will continue that tradition by allowing access to qualified investigators who agree to the requirements of data access. With the SHARe database, we continue with an ambitious research agenda and look forward to new discoveries in the decades to come.
Tumors represent dynamically evolving populations of mutant cells, and many advances have been made in understanding the biology of their progression. However, key unresolved questions remain about the conditions that support their initial transformation, which cannot be easily captured in patient populations but are instead modeled using transgenic cellular or animal systems. Here, we used extensive patient atlas data to define common features of the tumor DNA methylation landscape as they compare to healthy human cells and used this benchmark to screen 21 engineered human and mouse models for their ability to reproduce these patterns. Notably, we find that genetically induced cellular transformation can trigger global changes in DNA methylation levels that are consistent with extensive proliferation but rarely recapitulate the widespread de novo methylation of Polycomb targets as found in clinical samples. Our results raise pertinent questions about the relationship between genetic and epigenetic aspects of tumorigenesis and provide an important molecular reference for evaluating existing as well as newer tumor models.