Whole-exome sequencing dataset for the Australian Ovarian Cancer Study (AOCS) and The Jikei University School of Medicine (JIKEI) ovarian clear cell carcinoma (OCCC) clinical outliers project, representing 10 donors. Although OCCC are commonly resistant to platinum based chemotherapy, good clinical outcomes are observed in a subset of patients. Exome sequencing was performed in 10 patients who had either unusually long survival (N=5) or had a very short time to progression (N=5).
Oncocytic thyroid carcinoma, also known as Hürthle cell thyroid carcinoma, accounts for only a small percentage of all thyroid cancers. However, this malignancy often presents at an advanced stage and poses unique challenges to patients and clinicians. Surgical resection of the tumor accompanied in some cases by radioactive iodine treatment, radiation and chemotherapy are the established modes of therapy. Knowledge of the perturbed oncogenic pathways can provide better understanding of the mechanism of disease and thus opportunities for more effective clinical management. Initially, two oncocytic thyroid carcinomas and their matched normal tissues were profiled using whole genome sequencing. Subsequently, 72 oncocytic thyroid carcinomas, one cell line and 5 Hürthle cell adenomas were examined by targeted sequencing for the presence of mutations in multiple endocrine neoplasia I (MEN1) gene. We report the identification of MEN1 loss-of-function mutations in approximately 7% of patients diagnosed with oncocytic thyroid carcinoma. Whole genome sequence data also revealed large regions of copy number variation encompassing nearly the entire genomes of these tumours. Menin, a ubiquitously expressed nuclear protein, is a well-characterized tumor suppressor whose loss is the cause of multiple endocrine neoplasia type 1 syndrome. Menin is involved in several major cellular pathways such as regulation of transcription, control of cell cycle, apoptosis and DNA damage repair pathways. Mutations of this gene in a subset of Hürthle cell tumors point to a potential role for this protein and its associated pathways in thyroid tumorigenesis.
COGA is a family study of alcoholism, in which the subjects have been drawn from the Collaborative Study on the Genetics of Alcoholism (COGA), a large, ongoing family-based study that includes subjects from seven sites around the US. COGA has gathered detailed, standardized data on study participants, including diagnostic and neurophysiological assessments. This project has already proved successful in identifying several genes that influence the risk for alcoholism and neurophysiological endophenotypes, which have been independently replicated. COGA data were included as part of two Genetic Analysis Workshops, and the phenotypes are familiar to the genetics community. Alcoholic probands were recruited from treatment facilities, assessed by personal interview, and after securing permission, other family members were also assessed. A set of comparison families was drawn from the same communities as the families recruited through an alcoholic proband. Assessment involved a detailed personal interview developed for this project, the Semi-Structured Assessment for the Genetics of Alcoholism (SSAGA), which gathers detailed information on alcoholism related symptoms along with other drugs and psychiatric symptoms. Many participants also came to the laboratories for electroencephalographic studies. Neurophysiological features that have been shown to be useful endophenotypes for which we have linkage and in some cases association results, are included for a subset of the case-control sample: the beta power of the resting electroencephalogram (EEG), the P3(00) amplitude of the visual event-related potential (ERP), and the theta and delta event-related oscillations (EROs) underlying the P3. As part of COGA, a set of informative families was selected to have Genome-Wide Association data obtained within families. Genotyping was performed using the Illumina Human OmniExpress array 12.VI to genotype 2,282 subjects selected from 118 densely affected families. Genotyping was performed at the Genome Technology Access Center at Washington University School of Medicine in St. Louis. In addition, we also included genotypes for subjects (n=275 subjects) from these 118 families who were genotyped in a previous case-control GWAS using the Illumina 1M array. For quality control purposes, 51 of the 275 subjects were genotyped again on the Illumina Human OmniExpress array at the Washington University School of Medicine core facility.In addition, exome sequencing data on a subset of individuals with GWAS were added in version 2 (v2). For v2, a subset had 30X Whole Genome Sequencing (WGS) as part of the NIDA Sequencing Initiative. The subset contained two distinct sets: Sibling pairs where one sibling had at least two dependence diagnoses in the set (alcohol, cannabis, cocaine, and opioid), and the other had none, and non-related Case-Control pairs matched for age and ethnicity where the cases had alcohol and at least 2 other dependence diagnoses and controls had none. After sequencing, some sibling pairs are re-classified as half siblings. Three VCF files (small variants, structural variants, and copy number variations) are provided. Additional substance use variables are made available in v2. We note that the full sample data are deposited in four dbGaP submissions and the sequenced samples are split across all four: CIDR: Collaborative Study on the Genetics of Alcoholism Case Control Study [phs000125]. GWAS data on cases (primarily probands) and controls drawn from the families. Families with highest density of alcohol dependence and/or extreme event-related oscillation data [phs000763]. GWAS data on 119 extended families of European descent are available here, along with extensive documentation. Study on the Genetics of Alcoholism (COGA): African American Family GWAS [phs000976]. GWAS data on all available COGA families of African descent are available. COGA: Smokescreen GWAS [phs001208]. GWAS data on all remaining COGA DNA samples, primarily of other racial background, were genotyped on the Smoke Screen array. A listing of all sequenced pairs is provided in the documentation to facilitate the merging of these samples.
Little is known about the transcriptomic profile of individuals who are exposed to SARS-CoV-2 yet resist becoming PCR positive. To investigate this, longitudinal whole-blood samples were taken (0, 7, 14, and 28 days after enrolment) from PCR positive and PCR negative SARS-CoV-2-naïve household contacts who were recently exposed to a COVID-19 index. Samples were also taken from pre- and post-pandemic unexposed controls. Total RNA was extracted from PAXgene tubes before undergoing poly(A) selection followed by globin and rRNA depletion. DNA libraries were constructed using the NEBNext® Ultra™ II Directional RNA Library Prep Kit for Illumina. All samples were then sequenced across 2 flowcells of an Illumina HiSeq 4000.
This is a family-based genome-wide association study of congenital left-sided cardiac lesions (LSLs). Cases with LSLs and their parents were recruited from the Cardiac Center at The Children's Hospital of Philadelphia between 1997 and 2010. The discovery dataset included 405 LSL case-parent triads. DNA was extracted from blood or saliva samples from these subjects and genotyping was conducted using the Illumina Infinium II HumanHap550K array (V3). This study was conducted with approval from the Institutional Review Board for the Protection of Human Subjects of The Children's Hospital of Philadelphia and all research subjects provided either written informed consent or, when appropriate, assent.
Heparin-induced thrombocytopenia (HIT) is an unpredictable, potentially catastrophic adverse effect resulting from an immune response to platelet factor 4 (PF4)/heparin complexes. We performed a genome-wide association study (GWAS) with positive functional assay as the outcome. Genetic associations were investigated in a cohort of functional assay-confirmed HIT cases, antibody-positive (function assay-negative) controls, and antibody negative controls. Both PF4-specific enzyme-linked immunosorbent assay (ELISA) and serotonin release assay (SRA) were positive in every HIT case. GWAS genotyping was performed at the Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences (IMS). All samples in the discovery and replication cohorts were genotyped using the lllumina® Infinium HumanOmniExpressExome BeadChip, which contains 958,497 markers, including 273,000 functional exonic markers. Quality control was accomplished using sample and marker call rate ≥98%, sex mismatch, duplicate concordance, and removal of related samples through identity by descent. Genomic imputation was performed for samples using Minimac4 and the phase3 v5 reference panel data from the 1000 genomes project. The primary GWAS analysis used multivariable logistic regression assuming an additive genetic model with functional assay-confirmed case status as the outcome. Both PF4/heparin antibody negative and antibody positive controls were considered controls in the primary analysis. Logistic regressions were used to generate odds ratios and 95% confidence intervals (95%CIs) after adjustment for age, sex, and first three principal components.
We describe a male proband with normal B lymphocyte and natural killer (NK) cell numbers, but who had T lymphopenia beginning at 1 week of age. His peripheral blood contained very few T cell receptor excision circles (TREC), suggesting impaired thymic T cell development. He received an allogeneic hematopoietic stem cell marrow transplant, but despite engraftment, it did not fully correct the T lymphopenia. Sequencing excluded known immunodeficiency genes; however, whole exome sequencing revealed X-linked inheritance by the male proband of a missense mutation in MED14 (MED14V763A), a component of the mediator complex. Functional screening of the MED14V763A variant in zebrafish suggested that MED14 is essential for normal development, since morpholino (MO)-mediated loss of MED14 function attenuated T cell development, but it was rescued by ectopic expression of the wild type human MED14 but not by the MED14V763A variant. This defect in T cell repopulation is consistent with the incomplete T cell reconstitution observed upon transplantation of the proband with bone marrow from his healthy male sibling, who was found to have the same MED14V763A variant. However, the brother was normal, indicating that the altered MED14 could not by itself explain the disease. WES also revealed a mutation in the L1 cell adhesion molecule (L1CAMQ498H); however, introducing that mutation in vivo in mice did not disrupt T cell development. Consequently, the immunodeficiency in the patient is not fully elucidated. It is possible that MED14 contributes, but it is also possible that another gene is fully responsible for the disease.
Endobronchial ultrasound-guided transbronchial needle aspiration (EBUS-TBNA) is often the only source of tumor tissue from patients with advanced, inoperable lung cancer. EBUS-TBNA aspirates are used for the diagnosis, staging, and genomic testing to inform therapy options. Here we extracted DNA and RNA from 220 EBUS-TBNA aspirates to evaluate their suitability for whole genome (WGS), whole exome (WES), and comprehensive panel sequencing. For a subset of 40 cases, the same nucleic acid extraction was sequenced using WGS, WES, and the TruSight Oncology 500 assay. Genomic features were compared between sequencing platforms and compared with those reported by clinical testing. A total of 204 aspirates (92.7%) had sufficient DNA (100 ng) for comprehensive panel sequencing, and 109 aspirates (49.5%) had sufficient material for WGS. Comprehensive sequencing platforms detected all seven clinically reported tier 1 actionable mutations, an additional three (7%) tier 1 mutations, six (15%) tier 2–3 mutations, and biomarkers of potential immunotherapy benefit (tumor mutation burden and microsatellite instability). As expected, WGS was more suited for the detection and discovery of emerging novel biomarkers of treatment response. WGS could be performed in half of all EBUS-TBNA aspirates, which points to the enormous potential of EBUS-TBNA as source material for large, well-curated discovery-based studies for novel and more effective predictors of treatment response. Comprehensive panel sequencing is possible in the vast majority of fresh EBUS-TBNA aspirates and enhances the detection of actionable mutations over current clinical testing.
Objectives: The clinical trial assessed the safety and efficacy of three interventions. Specifically, it evaluated (1) the major health benefits and risks of estrogen plus progestin and estrogen alone, (2) the effects of a low-fat eating pattern on risk of colorectal cancer, and (3) the efficacy of calcium with vitamin D supplementation for preventing hip and other fractures. The objective of the memory study was to determine whether estrogen plus progestin therapy protects global cognitive function, and evaluate the therapy's effect on the incidence of dementia and mild cognitive impairment.The observational study is examining the relationship between lifestyle, socioeconomic, health, and other risk factors with cardiovascular, breast cancer, colorectal cancer and osteoporotic fracture outcomes. Secondary objectives include providing more reliable estimates of the extent to which known risk factors predict disease, more precise estimates of new occurrences of disease, and to provide a future resource for the identification of new or novel risk factors especially factors found in blood. Background: The Women's Health Initiative (WHI) is a long-term national health study that has focused on strategies for preventing the major causes of death, disability, and frailty in postmenopausal women, specifically heart disease, cancer, and osteoporotic fractures. The WHI is primarily composed of an observational study (OS), as well a clinical trial (CT) with three components: Hormone Replacement Therapy (HT), Dietary Modification, (DM) and Calcium/Vitamin D supplementation (CaD).Prior to the WHI, observational studies suggested that postmenopausal hormone therapy was associated with a decreased risk of coronary heart disease (CHD). Potential cardioprotection was based on generally supportive data on lipid levels in intermediate outcome clinical trials, trials in nonhuman primates, and a large body of observational studies suggesting a 40% to 50% reduction in risk among users of either estrogen alone or, less frequently, combined estrogen and progestin. Observational studies primarily examining unopposed estrogen preparations have suggested a 30% to 50% reduction in coronary events, and an 8% to 30% increase in breast cancer with extended use. Other research findings indicated that hormone therapy was also associated with a decreased risk of osteoporosis and increased bone density. The WHI HT trials were designed to test the effects of postmenopausal hormone therapy on risk for coronary heart disease and assess overall risks and benefits in predominantly healthy women. The Women's Health Initiative Memory Program (WHIMS) consists of a suite of studies which include cohorts of women who participated in the WHI HT trials. Postmenopausal women have a greater risk than men of developing Alzheimer's disease, but studies of the effects of estrogen therapy on Alzheimer's disease have been inconsistent. Additionally, observational studies have suggested that postmenopausal hormone treatment may improve cognitive function, but data from randomized clinical trials have been sparse and inconclusive. International comparisons and migration studies have suggested that countries with 50% lower fat intake than the US population had approximately one third the risk of colorectal cancer. Additionally, fairly consistent evidence existed for an effect of dietary fat, vegetables and fruits, and grains on colorectal cancer risk from within-country observational studies, although the protective effect of lower fat intake was no longer clear after adjusting for energy intake. The WHI DM trial was the first randomized trial to directly address the health effects of a low-fat eating pattern in predominantly healthy postmenopausal women from diverse racial/ethnic, geographic, and socioeconomic backgrounds. Osteoporosis is a major cause of injury, loss of independence, and death, and contributes to hip fractures. Observational evidence and data from previous randomized clinical trials suggest that calcium and/or vitamin D supplements may slow bone loss and reduce the risk of falls in postmenopausal and elderly women. However, evidence from trials, observational studies, and meta-analyses of calcium and vitamin D supplementation with respect to hip and other fractures was limited at the time the WHI was initiated. In two prior randomized trials, calcium plus vitamin D supplements did not reduce the risk of nonvertebral fractures among older women. When the WHI CaD trial was designed, guidelines recommended daily intakes of 800 to 1200 mg of calcium with 400 IU of vitamin D for the prevention of osteoporosis, which was not met by many American women. Therefore, the WHI CaD trial was designed to test the primary hypothesis that postmenopausal women randomly assigned to calcium plus vitamin D supplementation would have a lower risk of hip fracture and, secondarily, of all fractures than women assigned to placebo. Subjects: Postmenopausal women ages 50 to 79 were eligible to participate. A woman was considered postmenopausal if she had experienced no vaginal bleeding for 6 months (12 months for women under 55 years of age), had had a hysterectomy, or had ever used postmenopausal hormones. Recruitment was carried out in 40 US clinical centers in 1993-1998. The clinical trial components had additional specific inclusion or exclusion criteria.A total of 68,132 women were randomized into at least one component of the clinical trial. 27,347 women were enrolled in the hormone therapy component with 16,608 in the estrogen plus progestin trial and 10,739 in the unopposed estrogen trial, 48,835 women were enrolled in the diet modification component, and 36,282 women were enrolled in the calcium/vitamin D component. 7,479 women 65 years of age and older at baseline and that participated in the HT trial component were enrolled in the ancillary memory study. Women who were either ineligible or unwilling to participate in the clinical trial component were enrolled in the observational study. For example, many potential participants to the clinical trial component of the study were already undertaking a low fat diet or were using hormone replacement therapy. The effect of the selection process was that women enrolled in the observational study tended to have healthier lifestyles compared to those enrolled in the clinical trial. In total, 93,676 subjects were enrolled in WHI OS, with over 16% being members of a racial/ethnic minority group. The first WHI Extension Study enrolled 115,407 consenting participants from all components of the original WHI study for an additional five years of follow-up, from 2005 to 2010. In 2010, 93,567 women consented to continued follow-up. Design: The clinical trial component of the WHI included three randomized comparisons: hormone therapy, dietary modification, and calcium/vitamin D supplementation. Women could have been randomized into one, two or all three trials.The hormone therapy trial enrolled women to one of two double-blinded trials: estrogen (0.625 mg of conjugated equine estrogens daily) plus progestin (2.5 mg of medroxyprogesterone acetate daily) or estrogen alone. Women with a prior hysterectomy were eligible for the trial of unopposed estrogen. Women with an intact uterus at screening were initially also eligible for unopposed estrogen, but were reassigned to the trial of combined postmenopausal hormones beginning in 1995. Both trials randomized participants 1:1 to either hormone therapy or placebo. A 3-month washout period was required before baseline evaluation of women using postmenopausal hormones at initial screening. Study participants were contacted by telephone 6 weeks after randomization to assess symptoms and reinforce adherence. Follow-up contacts by telephone or clinic visit occurred every 6 months, with clinic visits required annually. The estrogen plus progestin trial was halted in July 2002 after a mean 5.2 years of follow-up because health risks, including increased risk of breast cancer and cardiovascular disease, exceeded benefits. The estrogen alone trial was stopped early in March 2004, because an increased risk of stroke was found with no benefit for coronary heart disease. The primary outcome was coronary heart disease (CHD) (nonfatal myocardial infarction and CHD death), with invasive breast cancer as the primary adverse outcome. The dietary modification trial evaluated the effect of a low-fat, high fruit, vegetable, and grain diet on preventing cardiovascular disease and cancer. Participants were randomly assigned to an intervention or a comparison group in the ratio of 2:3 for cost-efficiency. The intervention was an intensive behavioral modification program, using 18 group sessions in the first year and quarterly sessions thereafter, led by specially trained and certified nutritionists. The program was designed to promote dietary change with the goals of reducing total fat to 20% of energy intake, increasing vegetables and fruits to at least 5 servings daily and grains to at least 6 servings daily. The intervention did not include total energy reduction or weight loss goals. Comparison group participants received a copy of the US Department of Health and Human Services' Dietary Guidelines for Americans and other health-related materials but were not asked to make dietary changes. Dietary intake was monitored using the WHI food frequency questionnaire at 1 year and in a rotating one-third subsample every year thereafter. Women completed a medical update questionnaire every 6 months, and medical records were sought for all women reporting colorectal cancer. The primary outcome was invasive colorectal cancer incidence. Participants in the calcium/vitamin D trial were randomized 1:1 to either supplements or placebo. Active tablets contained 500 mg of elemental calcium (as calcium carbonate) and 200 IU of vitamin D3, to be taken twice daily with meals. The presence and severity of symptoms, safety concerns, and outcomes were ascertained at annual clinic visits and telephone or clinic visits at intervening six-month intervals. Risk factors for fracture were assessed by questionnaire, interview, and clinical examination. The primary outcome was incidence of hip fracture. Participants in the observational study attended a baseline examination and were re-examined three years later. Participants completed annual updates of exposures and clinical outcomes by mail. Final data were collected by mail during the close-out period in April 2004 to March 2005. The major clinical outcomes of interest were coronary heart disease, stroke, breast cancer, colorectal cancer, endometrial cancer, ovarian cancer, osteoporotic fractures, diabetes, and total mortality. Most outcomes were initially ascertained by self-report on an annual questionnaire and documented by hospital and related records. Charts with potential cardiovascular, cancer, and fracture outcomes were sent to the local physician adjudicator for evaluation and classification. Staff at the Clinical Coordinating Center coded and adjudicated all cancers of major interest in the study using standardized SEER guidelines. In 2005, WHI participants were invited to join the Extension Study for an additional five years of follow-up in order to collect long-term outcomes. Participants completed annual data collection forms primarily by mail, similar to the OS follow-up. Women reporting study outcomes were contacted by WHI field center staff to obtain additional details and medical records, which were evaluated by physician adjudicators. In 2010, the woman remaining were invited to join the next Extension Study. In the second extension, women were divided into two groups, one of which would have outcomes documented with medical records (the Medical Records Cohort, MRC), and the other would just be followed by self-report (the Self-Report Cohort, SRC). The MRC consists of women who were in the hormone therapy trials, and all African-American and Hispanic women. In 2012-2013, a subset of the MRC was identified for a potential in-home visit to collect blood and several objective measures of physical functioning. Conclusions: Overall health risks exceeded benefits from use of combined estrogen plus progestin after an average 5.2 year follow-up among healthy postmenopausal US women (Rossouw et al., 2002, PMID:12117397). Among postmenopausal women aged 65 years or older, estrogen plus progestin did not improve cognitive function when compared with placebo (Rapp et al., 2003, PMID: 12771113), increased the risk for probable dementia, and did not prevent mild cognitive impairment (Shumaker, et al., 2003, PMID: 12771112). The use of conjugated equine estrogen increased the risk of stroke, decreased the risk of hip fracture, and did not affect CHD incidence in postmenopausal women with prior hysterectomy after an average of 6.8 years of follow-up (Anderson et al., 2004, PMID: 15082697). Over approximately 8 years of follow-up, a low-fat dietary pattern did not reduce the risk of colorectal cancer (Beresford, et al., PMID: 16467233). Calcium with vitamin D supplementation resulted in a small but significant improvement in hip bone density; however, no significant difference was observed in hip fractures (Jackson, et al., 2006, PMID: 16481635). A recent review summarizes the conclusions from the WHI clinical trials with a focus on clinical practice (Manson, et al., 2024, PMID: 38691368).Description of ECG Imaging Data: Electric cardiograms (ECGs) were given to all clinical trial participants at baseline and in years 3, 6, and 9 of the original WHI study.EKG data consist of 12 lead 10 seconds ECGS sampled at 500Hz via GE ECG machines and process via GE MUSE system. The ECG waveform were directly exported from GE MUSE using MUSE export function in XML format, which include EKG waveform data as well as other ECG characteristics. Waveform data is in base64 encoded format, when it is decoded, it is a binary data that can be used to draw waveform graph. Many programming languages and data tools have built in functions to decode base64 strings. All the other necessary information is included in the LeadData section, total byte size, total sample size etc. (usually 1 sample is 2 bytes). See example below: encoded-data (base64 encoded string) JwAoAC0AKAAiACIAJAAkACQAIwAiACIAHgAcABwAGwAZABgAGAAYABcAEwAQABAAEAAL^/AAsADAAM... decoded-binary-data (1 sample is 2 bytes) 270028002D002800220022002400240024002300220022001E001C001C001B00 1900180018001800170013001000100010000B000B000C000C000D000D000D00 0A000A000A0009000600040004000700070005000500020... These binary values are integers (Y axis data of the graph), hence it is a straightforward process to draw the waveform graph. Acquisition dates have been redacted from this ECG data to comply with WHI policy. All acquisition dates within files and in file names have been set to January 1, 1900 (19000101) to comply with this policy.
The Osteoarthritis Initiative (OAI) is a publicly and privately funded prospective longitudinal cohort with a primary objective of identifying risk factors for incidence and progression of tibiofemoral knee OA. The OAI utilized a focused population-based recruitment to enroll 4,674 men and women between the ages of 45-79 years who either had radiographic symptomatic knee OA or who were without radiographic symptomatic OA in both knees but were considered high risk for OA because they had two or more known risk factors for knee OA. Subjects were recruited into the baseline phase of the OAI at multiple sites throughout the US between 2004 and 2006. All subjects were invited back for follow-up examinations to assess incidence or progression of OA annually, for up to 5 years. Phenotype data from the baseline and follow-up examinations are available for public access from the Osteoarthritis Initiative (OAI) database at http://www.oai.ucsf.edu/. The Genetic Components of Knee Osteoarthritis (GeCKO) Study was initiated in 2009 as a genetic ancillary study to perform a genome-wide association study to identify genetic variants associated with radiographic osteoarthritis. This study included 4,482 individuals participating in the parent OAI study. Following sample cleaning (e.g., removal of duplicates, sex mismatches, poor sample quality, etc.), the final sample set included 4,129 individuals.