Functional characterisation of CpG islands in human tissues
DNA methylation in bronchial biopsies of asthmatics, asthma in remission and healthy subjects
Social interactions and the overall psychosocial environment have a demonstrated impact on health, particularly for people living in disadvantaged urban areas. Here we investigated the effect of psychosocial experiences on gene expression in peripheral blood immune cells of 251 children with asthma in Metro Detroit. Participants were included from an ongoing longitudinal study Asthma in the Lives of Families Today (ALOFT; recruited from November 2010-July 2018. The ALOFT project was established to identify the behavioral and biological pathways through which family social environments impact youth with asthma.In version 1 we used RNA-sequencing and a new machine learning approach to identify transcriptional signatures of 19 variables including psychosocial factors, blood cell composition and asthma symptoms. Using longitudinal data collected from a subset of the participants we showed that transcriptional signatures track the longitudinal changes in measured phenotypes. Importantly, we found 169 genes associated with asthma that are regulated by psychosocial factors, and 344 significant gene-environment interactions for gene expression levels. These results demonstrate that immune gene expression mediates the link between negative psychosocial experiences and asthma risk.In version 2, we focused on pubertal development. We identified substantial gene expression changes associated with age and pubertal development. We showed that genetic effects on gene expression change dynamically during pubertal development. Gene expression changes during puberty are correlated with gene expression changes associated with asthma and may explain sex differences in prevalence. Our results show that molecular data used to study the genetics of early onset diseases should consider pubertal development as an important factor that modifies the transcriptome.Version 3 uses single cell RNA-seq to study the dynamics of the transcriptional response to glucocorticoids in activated Peripheral Blood Mononuclear Cells from 96 African American children. We employed novel statistical approaches to calculate a mean-independent measure of gene expression variability and a measure of transcriptional response pseudotime. Using these approaches, we demonstrated that glucocorticoids reverse the effects of immune stimulation on both gene expression mean and variability. Our novel measure of gene expression response dynamics, based on the diagonal linear discriminant analysis, separated individual cells by response status on the basis of their transcriptional profiles and allowed us to identify different dynamic patterns of gene expression along the response pseudotime. We identified genetic variants regulating gene expression mean and variability, including treatment-specific effects, and demonstrated widespread genetic regulation of the transcriptional dynamics of the gene expression response.Version 4 adds scATAC-seq in activated peripheral blood mononuclear cells (PBMC) from 16 children with asthma with phytohemagglutinin (PHA) or lipopolysaccharide (LPS), and treated with dexamethasone (DEX), an anti-inflammatory glucocorticoid. We analyzed changes in chromatin accessibility, measured transcription factor motif activity, and identified treatment and cell-type specific transcription factors that drive changes in both gene expression mean and variability. We observed strong positive linear dependence between motif response and their target gene expression changes, but negative in variability changes. This result suggests that an increase of transcription factor binding tightens the variability of gene expression around the mean. We then annotated genetic variants in chromatin accessibility peaks and response motifs followed by computational fine-mapping of eQTL signals from a pediatric asthma cohort. We found that eQTLs were 5-fold enriched in peaks with response motifs and refined the credible set for 410 asthma risk genes, with 191 having the causal variant in response motifs. These results enhance the understanding of molecular mechanisms for asthma risk variants mediated by gene expression.
Data Access NOTE: Please refer to the “Authorized Access” section below for information about how access to the data from this accession differs from many other dbGaP accessions.Related Studies: Other CARDIA data available include:phs000236.v2.p2 PAGE: CALiCo: Coronary Artery Risk Development in Young Adults(CARDIA) phs000309.v3.p2 The CARDIA-GENEVA Study phs000399.v1.p2 NHLBI GO-ESP: Heart Cohorts Exome Sequencing Project (CARDIA) phs000613.v1.p2 NHLBI CARDIA Candidate Gene Association Resource (CARe) phs003675.v1.p1 CARDIA Multi-omics Obesity & CVD Substudy – Year 20 Untargeted Metabolomics Data phs001612.v3.p2 NHLBI TOPMed: CARDIA (Coronary Artery Risk Development in Young Adults) phs003045.v1.p1 Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Coronary Artery Risk Development in Young Adults Study (CARDIA) phs000285.v4.p3 CARDIA_Cohort Available Data: Available data for request include exam data through Year 30 and outcomes and death data through 2022. Additionally, data from 28 ancillary studies are available for request.Objectives: The original objectives of CARDIA were to document levels of risk factors for coronary artery disease and potential determinants of these risk factors in young adults; to study the interrelationships of risk factors and lifestyles and to document behavioral and environmental changes during the transition from young adulthood to middle age; to compare cross-sectional and longitudinal data on age-related trends in cardiovascular disease risk factors; and to compare levels and evolution of risk factors between men and women, blacks and whites, and in groups of differing socioeconomic status. Goals of the study have evolved to emphasize understanding determinants of left ventricular mass, emerging obesity and hypertension, and sequelae of hypertension in pregnancy.Background: CARDIA is designed to increase understanding of contributors to changes in cardiovascular disease (CVD) risk factors during the critical years of transition from young adulthood to middle age. CARDIA was funded initially in 1983 for a five-year cycle that included two rounds of examinations. Contract renewals have allowed for subsequent re-examinations.Participants: Black and white men and women; ages 18-30 years at entry with a range of attained education; original sample size: 5,115.Design: CARDIA is a population-based observational study of 5,115 participants aged 18-30 years recruited in 1985-1986. The sample was designed to achieve approximately balanced subgroups of race, gender, education (high school or less and more than high school) and age (18-24 and 25-30). Forty percent of the cohort had no more than a high school education. A total of nine examinations have been completed in the cohort with examination cycles at year 2 of the project and years 5, 7, 10 , 15, 20, 25, and 30.In addition to standard measurements of blood pressure, anthropometry, blood lipids, smoking behavior, physical activity, diet, pulmonary function, and many psychological factors, CARDIA has other included measurements (in subsets or in the full cohort) to obtain unique information on other aspects of risk factor development and early morbidity. These have included: graded exercise treadmill testing; echocardiography, particularly for measurement of left ventricular mass; cardiovascular reactivity; serum cotinine; Lp(a), apoE phenotype, apolipoprotein A1 and B; homocysteine; skin reflectance; body composition by dual X-ray absorptiometry; glucose tolerance testing; vascular resistance and compliance; and plasma renin activity and sympathetic nervous system activity.
Assessment of dexamethasone response in human lung through scRNA-seq analysis of bronchoalveolar lavage (BAL) samples from patients with severe COVID-19.
We performed whole genome sequencing and whole exome sequencing of 31 lung adenocarcinoma (LUAD) samples from the Environment And Genetics in Lung cancer Etiology (EAGLE) study.
In this study, RNA sequencing was performed on astrocytes microdissected from the posterior cingulate cortex of Alzheimer's disease subjects and healthy elderly controls.
This dataset contains RNA-Seq data of acute lymphoblastic leukemia in LLAG-0707 study. Diagnostic samples of 188 patients were sequenced. Reads were aligned to hg19 genome reference using STAR. Aligned bam files are provided in this dataset.
Research study of rare genetic variants associated with familial cases of pulmonary fibrosis in patients and relatives from Canary Islands, Spain.
To understand the circulating immune cells in colorectal cancer patients, we performed single-cell RNA sequencing for 14 PBMC samples from 14 colorectal cancer patients.
Summary statistics and individual level data associated with PheWAS and phenotypic profiling of individuals with heterozygous loss of mutations in the gene ANGPTL8
Using RNA CaptureSeq we annotated non-coding RNAs transcribed from genome intervals surrounding breast cancer risk signals in a range of mammary-derived tissue and cell lines.
BLUEPRINT DNA methylation profiles of monocytes, T cells and B cells in type 1 diabetes-discordant monozygotic twins
We have genotyped 140 individuals from 5 populations in Pakistan, using the OmniExpressExome-8 array, which includes approximately one million autosomal SNP markers.
This retrospective tissue study explores the prognostic significance of tumour-infiltrating lymphocyte (TIL) populations in NPC, using a combination of RNA sequencing and immunohistochemistry.
To study the effects of rifaximin on human small intestinal (hSI) organoids, we performed RNAseq in PXR knockdown hSI compared to control hSI.
A comprehensive taxonomy of the predicted interactions between LEPR + stromal niches, HSPCs and adaptive/innate immune cells in the human NBM and AML
In this project we will use exome sequencing to identify somatic mutations in lesions from a patient with a germline mutation in the protection of telomeres 1 gene (POT1). This dataset contains all the data available for this study on 2017-04-27.
Insertion of processed pseudogenes is known to occur in the germline but has not previously been observed in somatic cells. Formation of pseudogenes could represent a new class of mutation in cancers and a new source of potential driver events.
Whole genome sequencing of 4 childhood T-ALL patients, which was further used in single-cell analysis in the paper "Single cell sequencing reveals the origin and the order of mutation acquisition in T-cell acute lymphoblastic leukemia".
In this work, using exome sequencing, we identified biallelic PNLPA6 mutations in patients with childhood blindness due to severe photoreceptor death and clinical features of Leber congenital amaurosis (LCA) and, interestingly, also of the rare Oliver McFarlane Syndrome
In this project we will use exome sequencing to identify somatic mutations in lesions from a patient with a germline mutation in the protection of telomeres 1 gene (POT1). This dataset contains all the data available for this study on 2016-04-20.
The aim of this project is to test whether HPV oncogenes and /or interferon induce the APOBEC mutational signature in vitro, and to test the role of APOBEC3A in this process. . This dataset contains all the data available for this study on 2019-04-11.
Objectives: The clinical trial assessed the safety and efficacy of three interventions. Specifically, it evaluated (1) the major health benefits and risks of estrogen plus progestin and estrogen alone, (2) the effects of a low-fat eating pattern on risk of colorectal cancer, and (3) the efficacy of calcium with vitamin D supplementation for preventing hip and other fractures. The objective of the memory study was to determine whether estrogen plus progestin therapy protects global cognitive function, and evaluate the therapy's effect on the incidence of dementia and mild cognitive impairment.The observational study is examining the relationship between lifestyle, socioeconomic, health, and other risk factors with cardiovascular, breast cancer, colorectal cancer and osteoporotic fracture outcomes. Secondary objectives include providing more reliable estimates of the extent to which known risk factors predict disease, more precise estimates of new occurrences of disease, and to provide a future resource for the identification of new or novel risk factors especially factors found in blood. Background: The Women's Health Initiative (WHI) is a long-term national health study that has focused on strategies for preventing the major causes of death, disability, and frailty in postmenopausal women, specifically heart disease, cancer, and osteoporotic fractures. The WHI is primarily composed of an observational study (OS), as well a clinical trial (CT) with three components: Hormone Replacement Therapy (HT), Dietary Modification, (DM) and Calcium/Vitamin D supplementation (CaD).Prior to the WHI, observational studies suggested that postmenopausal hormone therapy was associated with a decreased risk of coronary heart disease (CHD). Potential cardioprotection was based on generally supportive data on lipid levels in intermediate outcome clinical trials, trials in nonhuman primates, and a large body of observational studies suggesting a 40% to 50% reduction in risk among users of either estrogen alone or, less frequently, combined estrogen and progestin. Observational studies primarily examining unopposed estrogen preparations have suggested a 30% to 50% reduction in coronary events, and an 8% to 30% increase in breast cancer with extended use. Other research findings indicated that hormone therapy was also associated with a decreased risk of osteoporosis and increased bone density. The WHI HT trials were designed to test the effects of postmenopausal hormone therapy on risk for coronary heart disease and assess overall risks and benefits in predominantly healthy women. The Women's Health Initiative Memory Program (WHIMS) consists of a suite of studies which include cohorts of women who participated in the WHI HT trials. Postmenopausal women have a greater risk than men of developing Alzheimer's disease, but studies of the effects of estrogen therapy on Alzheimer's disease have been inconsistent. Additionally, observational studies have suggested that postmenopausal hormone treatment may improve cognitive function, but data from randomized clinical trials have been sparse and inconclusive. International comparisons and migration studies have suggested that countries with 50% lower fat intake than the US population had approximately one third the risk of colorectal cancer. Additionally, fairly consistent evidence existed for an effect of dietary fat, vegetables and fruits, and grains on colorectal cancer risk from within-country observational studies, although the protective effect of lower fat intake was no longer clear after adjusting for energy intake. The WHI DM trial was the first randomized trial to directly address the health effects of a low-fat eating pattern in predominantly healthy postmenopausal women from diverse racial/ethnic, geographic, and socioeconomic backgrounds. Osteoporosis is a major cause of injury, loss of independence, and death, and contributes to hip fractures. Observational evidence and data from previous randomized clinical trials suggest that calcium and/or vitamin D supplements may slow bone loss and reduce the risk of falls in postmenopausal and elderly women. However, evidence from trials, observational studies, and meta-analyses of calcium and vitamin D supplementation with respect to hip and other fractures was limited at the time the WHI was initiated. In two prior randomized trials, calcium plus vitamin D supplements did not reduce the risk of nonvertebral fractures among older women. When the WHI CaD trial was designed, guidelines recommended daily intakes of 800 to 1200 mg of calcium with 400 IU of vitamin D for the prevention of osteoporosis, which was not met by many American women. Therefore, the WHI CaD trial was designed to test the primary hypothesis that postmenopausal women randomly assigned to calcium plus vitamin D supplementation would have a lower risk of hip fracture and, secondarily, of all fractures than women assigned to placebo. Subjects: Postmenopausal women ages 50 to 79 were eligible to participate. A woman was considered postmenopausal if she had experienced no vaginal bleeding for 6 months (12 months for women under 55 years of age), had had a hysterectomy, or had ever used postmenopausal hormones. Recruitment was carried out in 40 US clinical centers in 1993-1998. The clinical trial components had additional specific inclusion or exclusion criteria.A total of 68,132 women were randomized into at least one component of the clinical trial. 27,347 women were enrolled in the hormone therapy component with 16,608 in the estrogen plus progestin trial and 10,739 in the unopposed estrogen trial, 48,835 women were enrolled in the diet modification component, and 36,282 women were enrolled in the calcium/vitamin D component. 7,479 women 65 years of age and older at baseline and that participated in the HT trial component were enrolled in the ancillary memory study. Women who were either ineligible or unwilling to participate in the clinical trial component were enrolled in the observational study. For example, many potential participants to the clinical trial component of the study were already undertaking a low fat diet or were using hormone replacement therapy. The effect of the selection process was that women enrolled in the observational study tended to have healthier lifestyles compared to those enrolled in the clinical trial. In total, 93,676 subjects were enrolled in WHI OS, with over 16% being members of a racial/ethnic minority group. The first WHI Extension Study enrolled 115,407 consenting participants from all components of the original WHI study for an additional five years of follow-up, from 2005 to 2010. In 2010, 93,567 women consented to continued follow-up. Design: The clinical trial component of the WHI included three randomized comparisons: hormone therapy, dietary modification, and calcium/vitamin D supplementation. Women could have been randomized into one, two or all three trials.The hormone therapy trial enrolled women to one of two double-blinded trials: estrogen (0.625 mg of conjugated equine estrogens daily) plus progestin (2.5 mg of medroxyprogesterone acetate daily) or estrogen alone. Women with a prior hysterectomy were eligible for the trial of unopposed estrogen. Women with an intact uterus at screening were initially also eligible for unopposed estrogen, but were reassigned to the trial of combined postmenopausal hormones beginning in 1995. Both trials randomized participants 1:1 to either hormone therapy or placebo. A 3-month washout period was required before baseline evaluation of women using postmenopausal hormones at initial screening. Study participants were contacted by telephone 6 weeks after randomization to assess symptoms and reinforce adherence. Follow-up contacts by telephone or clinic visit occurred every 6 months, with clinic visits required annually. The estrogen plus progestin trial was halted in July 2002 after a mean 5.2 years of follow-up because health risks, including increased risk of breast cancer and cardiovascular disease, exceeded benefits. The estrogen alone trial was stopped early in March 2004, because an increased risk of stroke was found with no benefit for coronary heart disease. The primary outcome was coronary heart disease (CHD) (nonfatal myocardial infarction and CHD death), with invasive breast cancer as the primary adverse outcome. The dietary modification trial evaluated the effect of a low-fat, high fruit, vegetable, and grain diet on preventing cardiovascular disease and cancer. Participants were randomly assigned to an intervention or a comparison group in the ratio of 2:3 for cost-efficiency. The intervention was an intensive behavioral modification program, using 18 group sessions in the first year and quarterly sessions thereafter, led by specially trained and certified nutritionists. The program was designed to promote dietary change with the goals of reducing total fat to 20% of energy intake, increasing vegetables and fruits to at least 5 servings daily and grains to at least 6 servings daily. The intervention did not include total energy reduction or weight loss goals. Comparison group participants received a copy of the US Department of Health and Human Services' Dietary Guidelines for Americans and other health-related materials but were not asked to make dietary changes. Dietary intake was monitored using the WHI food frequency questionnaire at 1 year and in a rotating one-third subsample every year thereafter. Women completed a medical update questionnaire every 6 months, and medical records were sought for all women reporting colorectal cancer. The primary outcome was invasive colorectal cancer incidence. Participants in the calcium/vitamin D trial were randomized 1:1 to either supplements or placebo. Active tablets contained 500 mg of elemental calcium (as calcium carbonate) and 200 IU of vitamin D3, to be taken twice daily with meals. The presence and severity of symptoms, safety concerns, and outcomes were ascertained at annual clinic visits and telephone or clinic visits at intervening six-month intervals. Risk factors for fracture were assessed by questionnaire, interview, and clinical examination. The primary outcome was incidence of hip fracture. Participants in the observational study attended a baseline examination and were re-examined three years later. Participants completed annual updates of exposures and clinical outcomes by mail. Final data were collected by mail during the close-out period in April 2004 to March 2005. The major clinical outcomes of interest were coronary heart disease, stroke, breast cancer, colorectal cancer, endometrial cancer, ovarian cancer, osteoporotic fractures, diabetes, and total mortality. Most outcomes were initially ascertained by self-report on an annual questionnaire and documented by hospital and related records. Charts with potential cardiovascular, cancer, and fracture outcomes were sent to the local physician adjudicator for evaluation and classification. Staff at the Clinical Coordinating Center coded and adjudicated all cancers of major interest in the study using standardized SEER guidelines. In 2005, WHI participants were invited to join the Extension Study for an additional five years of follow-up in order to collect long-term outcomes. Participants completed annual data collection forms primarily by mail, similar to the OS follow-up. Women reporting study outcomes were contacted by WHI field center staff to obtain additional details and medical records, which were evaluated by physician adjudicators. In 2010, the woman remaining were invited to join the next Extension Study. In the second extension, women were divided into two groups, one of which would have outcomes documented with medical records (the Medical Records Cohort, MRC), and the other would just be followed by self-report (the Self-Report Cohort, SRC). The MRC consists of women who were in the hormone therapy trials, and all African-American and Hispanic women. In 2012-2013, a subset of the MRC was identified for a potential in-home visit to collect blood and several objective measures of physical functioning. Conclusions: Overall health risks exceeded benefits from use of combined estrogen plus progestin after an average 5.2 year follow-up among healthy postmenopausal US women (Rossouw et al., 2002, PMID:12117397). Among postmenopausal women aged 65 years or older, estrogen plus progestin did not improve cognitive function when compared with placebo (Rapp et al., 2003, PMID: 12771113), increased the risk for probable dementia, and did not prevent mild cognitive impairment (Shumaker, et al., 2003, PMID: 12771112). The use of conjugated equine estrogen increased the risk of stroke, decreased the risk of hip fracture, and did not affect CHD incidence in postmenopausal women with prior hysterectomy after an average of 6.8 years of follow-up (Anderson et al., 2004, PMID: 15082697). Over approximately 8 years of follow-up, a low-fat dietary pattern did not reduce the risk of colorectal cancer (Beresford, et al., PMID: 16467233). Calcium with vitamin D supplementation resulted in a small but significant improvement in hip bone density; however, no significant difference was observed in hip fractures (Jackson, et al., 2006, PMID: 16481635). A recent review summarizes the conclusions from the WHI clinical trials with a focus on clinical practice (Manson, et al., 2024, PMID: 38691368).Description of ECG Imaging Data: Electric cardiograms (ECGs) were given to all clinical trial participants at baseline and in years 3, 6, and 9 of the original WHI study.EKG data consist of 12 lead 10 seconds ECGS sampled at 500Hz via GE ECG machines and process via GE MUSE system. The ECG waveform were directly exported from GE MUSE using MUSE export function in XML format, which include EKG waveform data as well as other ECG characteristics. Waveform data is in base64 encoded format, when it is decoded, it is a binary data that can be used to draw waveform graph. Many programming languages and data tools have built in functions to decode base64 strings. All the other necessary information is included in the LeadData section, total byte size, total sample size etc. (usually 1 sample is 2 bytes). See example below: encoded-data (base64 encoded string) JwAoAC0AKAAiACIAJAAkACQAIwAiACIAHgAcABwAGwAZABgAGAAYABcAEwAQABAAEAAL^/AAsADAAM... decoded-binary-data (1 sample is 2 bytes) 270028002D002800220022002400240024002300220022001E001C001C001B00 1900180018001800170013001000100010000B000B000C000C000D000D000D00 0A000A000A0009000600040004000700070005000500020... These binary values are integers (Y axis data of the graph), hence it is a straightforward process to draw the waveform graph. Acquisition dates have been redacted from this ECG data to comply with WHI policy. All acquisition dates within files and in file names have been set to January 1, 1900 (19000101) to comply with this policy.
The goal of this study was to identify genes and loci associated with pigmentation in African populations. It comprises 1,593 participants from Ethiopia, Tanzania, and Botswana. Sexes are evenly represented (768 males and 825 females) and self reported ages ranged from 18 to 103. Mean age was used for individuals without ages. For each participant, a DSM II color meter was used to measure underarm skin reflectance in triplicate. These measures averaged and converted to a melanin index (MI = 100 x log10(1/(x/255)). All individuals were included in a genome wide scan that accounted for age, sex, and genetic relatedness.