RNASeq files for paper titled "Fusion oncoproteins and cooperating mutations define disease phenotypes in NUP98-rearranged leukemia" PMID: 39974131, PMCID: PMC11838931, DOI: 10.1101/2025.01.21.25320683
scRNASeq files for paper titled "Fusion oncoproteins and cooperating mutations define disease phenotypes in NUP98-rearranged leukemia" PMID: 39974131, PMCID: PMC11838931, DOI: 10.1101/2025.01.21.25320683
The Familial Study of Essential Tremor (FASET) was designed to identify susceptibility genes for Essential Tremor. ET is among the most common neurological diseases with a prevalence (age > 40 years) estimated to be 4.0% and prevalence in advanced age (> 90 years) exceeding 20%. ET, often referred to as "familial tremor", is generally regarded as a highly genetic disorder with families, with affected members over multiple generations, and twin studies show high concordance among monozygotic twins. Probands (affected with ET) and relatives were enrolled in a family study of ET at Columbia University, New York between 2011 - 2014. The study was approved by the Institutional Review Board at Columbia University and written informed consent was obtained from all enrollees. The criteria for enrollment were: 1) the proband had early-onset ET with age at onset < 50 years, 2) the proband had a diagnosis of definite or probable ET, 3) in addition to the proband, there were at least two affected relatives in the family, 4) additional affected and unaffected family members were willing to participate in the study, and 5) the families contained more than two affected individuals in different generations. Blood samples were also collected for genetic research. For the genetic analyses, we excluded enrollees that we or others had diagnosed with Parkinson's disease (PD) or dystonia. The final sample includes 52 families (52 probands [affected with ET]) and 155 relatives). The number of affected individuals enrolled per family ranged from 3 - 7 (mean = 4.1). Genetic samples from FASET were analyzed with whole genome SNP genotyping (for linkage analyses) and whole exome sequencing. It is hoped that this resource will better help researchers to understand the genetic causes of ET and underlying disease pathogenesis.
MESA The Multi-Ethnic Study of Atherosclerosis (MESA) is a study of the characteristics of subclinical cardiovascular disease (disease detected non-invasively before it has produced clinical signs and symptoms) and the risk factors that predict progression to clinically overt cardiovascular disease or progression of the subclinical disease. MESA researchers study a diverse, population-based sample of 6,814 asymptomatic men and women aged 45-84. Thirty-eight percent of the recruited participants are white, 28 percent African-American, 22 percent Hispanic, and 12 percent Asian, predominantly of Chinese descent. Participants were recruited from six field centers across the United States: Wake Forest University, Columbia University, Johns Hopkins University, University of Minnesota, Northwestern University and University of California - Los Angeles. Each participant received an extensive physical exam to determine coronary calcification, ventricular mass and function, flow-mediated endothelial vasodilation, carotid intimal-medial wall thickness and presence of echogenic lucencies in the carotid artery, lower extremity vascular insufficiency, arterial wave forms, electrocardiographic (ECG) measures, standard coronary risk factors, sociodemographic factors, lifestyle factors, and psychosocial factors. Selected repetition of subclinical disease measures and risk factors at follow-up visits allows study of the progression of disease. Blood samples have been assayed for putative biochemical risk factors and stored for case-control studies. DNA has been extracted and lymphocytes cryopreserved (for possible immortalization) for study of candidate genes and possibly, genome-wide scanning, expression, and other genetic techniques. Participants are being followed for identification and characterization of cardiovascular disease events, including acute myocardial infarction and other forms of coronary heart disease (CHD), stroke, and congestive heart failure; for cardiovascular disease interventions; and for mortality. In addition to the six Field Centers, MESA involves a Coordinating Center, a Central Laboratory, and Central Reading Centers for Computed Tomography (CT), Magnetic Resonance Imaging (MRI), Ultrasound, and Electrocardiography (ECG). Protocol development, staff training, and pilot testing were performed in the first 18 months of the study. The first examination took place over two years, from July 2000 - July 2002. It was followed by four examination periods that were 17-20 months in length. Participants have been contacted every 9 to 12 months throughout the study to assess clinical morbidity and mortality. MESA Family The general goal of the MESA Family Study, an ancillary study to MESA funded by a grant from NHLBI, is to apply modern genetic analysis and genotyping methodologies to delineate the genetic determinants of early atherosclerosis. This is being accomplished by utilizing all the current organizational structures of the Multi-Ethnic Study of Atherosclerosis (MESA) and Genetic Centers at Cedars-Sinai Medical Center and University of Virginia. In the MESA Family Study, the goal is to locate and identify genes contributing to the genetic risk for cardiovascular disease (CVD), by looking at the early changes of atherosclerosis within families (mainly siblings). 2128 individuals from 594 families, yielding 3,026 sibpairs divided between African Americans and Hispanic-Americans, were recruited by utilizing the existing framework of MESA. MESA Family studied siblings of index subjects from the MESA study and from new sibpair families (with the same demographic characteristics) and is determining the extent of genetic contribution to the variation in coronary calcium (obtained via CT Scan) and carotid artery wall thickness (B-mode ultrasound) in the two largest non-majority U.S. populations. In a small proportion of subjects, parents of MESA index subjects participating in MESA Family were studied but only to have blood drawn for genotyping. The MESA Family cohort was recruited from the six MESA Field Centers. MESA Family participants underwent the same examination as MESA participants during May 2004 - May 2007. DNA was extracted and lymphocytes immortalized for study of candidate genes, genome-wide linkage scanning, and analyzed for linkage with these subclinical cardiovascular traits. While linkage analysis is the primary approach being used, an additional aspect of the MESA Family Study takes advantage of the existing MESA study population for testing a variety of candidate genes for association with the same subclinical traits. Genotyping and data analysis will occur throughout the study. MESA Air The general goal of the Multi-Ethnic Study of Atherosclerosis and Air Pollution ('MESA Air') is to prospectively examine the relation between an individual level assessment of long-term ambient air pollution exposures (including PM2.5 and the progression of subclinical cardiovascular disease in a multi-city, multi-ethnic cohort. MESA Air will also prospectively examine the relationship between an individual level assessment of long-term ambient air pollution exposures and the incidence of cardiovascular disease, including myocardial infarction and cardiovascular death. MESA AIR is funded by a grant from the United States Environmental Protection Agency to the University of Washington and subcontracts from the UW to other participating institutions. MESA Air will assess if ambient air pollution is associated with changes over time in subclinical measures of atherosclerosis and plasma markers of inflammation, oxidative damage, and endothelial activation in a longitudinal data model, adjusting for age, race/ethnicity, socioeconomic status, and specific cardiovascular risk factors (such as diabetes, hypertension, smoking, and diet). The study will similarly assess if the incidence of cardiovascular events is associated with long-term exposure to ambient air pollution, using a proportional hazards model. The study includes refinement of statistical tools, and explores joint/independent effects of acute and long-term pollutant exposure in the occurrence of cardiovascular disease. The MESA Air study is built on the foundation of the ongoing MESA study. The parent MESA Study cohort is located in six geographic areas ('Field Centers') that capture tremendous exposure heterogeneity, comparable to or greater than the variability in locations of prior U.S. cohort studies. In addition to the six Field Centers, the study involves a Coordinating Center, a Central Laboratory, and Reading Centers for Computed Tomography (CT), ultrasound and air pollution data. The cohort for the MESA Air study currently includes 6226 subjects: 5479 enrolled in the parent MESA study; 257 recruited specifically for this study, and 490 recruited from the MESA Family study. The entire MESA Air cohort will be followed over a 10-year project period for the occurrence of cardiovascular disease events. On two occasions over the ten-year study period, 3600 subjects from the MESA Air cohort, residing in nine locales, will undergo computed tomography scanning to assess presence and extent of coronary artery calcification (CAC), and ultrasound of the carotid artery to determine intima-media thickness (IMT). We will also repeatedly assess plasma markers of inflammation, oxidative damage, and endothelial function in 720 subjects. MESA Air adds state-of-the-art air pollution exposure assessment information to the MESA cohort study, and introduces new subjects and outcome measures to achieve our aims. The study will assess long-term individual-level exposure to ambient air pollutants for each subject using community-scale monitoring, outdoor spatial variation, subject proximity to pollution sources, pollutants' infiltration efficiency, and personal time-activity information. The exposure models will be validated using detailed monitoring in a subset of subjects. The MESA Cohort is utilized in the following dbGaP substudies. To view genotypes, analysis, expression data, other molecular data, and derived variables collected in these substudies, please click on the following substudies below or in the "Substudies" box located on the right hand side of this top-level study page phs000209 MESA Cohort. phs000420 MESA SHARe phs000283 MESA CARe phs000403 MESA ESP Heart-GO
In the context of the study 'Exploring Adaptive Phenotypes for the Human Calcium-Sensing Receptor Polymorphism R990G,' this repository contains 11 VCF files. Each file includes genetic data from 22 autosomes, representing distinct subpopulations. The subpopulations are as follows: Malaysian: Aeta (15 samples) Agta (11 samples) Batak (10 samples) Mamanwa (6 samples) Filipino: Jehai (13 samples) Kintaq (6 samples) Temiar (5 samples) Papua New Guinea: Highlands (2 samples) Koinambe (10 samples) Kosipe (10 samples) Sepik (2 samples) The numbers in parentheses indicate the sample size for each respective subpopulation.
Lung carcinoma is the leading cause of cancer death in the United States and world-wide; lung adenocarcinoma is the most common cause of lung cancer. Pilot studies of lung adenocarcinoma with hybrid-capture based whole exome sequencing will enable us to identify new targets for therapy and improve diagnosis. We will analyze a blend of whole exome and whole genome sequencing data as well as copy number and somatic mutation calls for 200 tumor and matched normal controls. When completed this study will represent the most comprehensive lung adenocarcinoma genome dataset to date.
The Nex-StoCT and ACMG recommend that validation of an NGS-based diagnostic test include performance test characteristics for assay accuracy, analytical sensitivity and specificity, reproducibility and repeatability. To measure these parameters for the GEDi capture and sequencing test, 4 samples (three randomly selected patient samples and the NA12878 HapMap sample) were prepared and sequenced in triplicate on each of three separate days. We also performed WES and SNP array genotyping analyses of these 4 samples using Agilent V4+UTR whole exome enrichment kit and Illumina Omni 2.5 SNP arrays, respectively.
T cells were sorted from metastatic tumors (colon, rectal, breast, anal, melanoma primary histologies) following tumor dissociation and single-cell sequencing was performed (RNAseq and TCRseq). For samples from tumors that had been previously studied, cells were analyzed comparing known neoantigen-reactive TCR-expressing cells to other T cells. For samples without previously identified neoantigen reactivity, TCRs were predicted on the basis of T cell transcriptomic states. To determine candidate neoantigen screening, exome and RNA sequencing were performed on fragments of matched tumors from which T cells were sequenced
The focus of this cross-sectional study was to characterize the vaginal microbiota of asymptomatic, sexually-active, and reproductive-age women representing four self-reported ethnic groups (White, Black, Asian, and Hispanic). Each woman self-collected two mid-vaginal swabs. The first swab was used to prepare a vaginal smear for Nugent gram stain analysis. The second swab was used for DNA extraction, 16S rRNA gene amplicon sequencing, and detection of Candida albicans. In addition, participants measured vaginal pH and completed a detailed questionnaire on sexual health and behaviors.
This study of three metastatic melanoma patients utilized exome sequencing and RNA sequencing to help identify putative neoantigen peptides that could be further validated by standard immunological assays before design of a personalized vaccine. For each set of neoantigens identified and verified by standard assays, we designed GMP peptides that correspond to the top neoantigens, then used these to condition dendritic cell isolates from each patient. The conditioned dendritic cells were used as the vaccine in three rounds of vaccination, followed by monitoring for T cell memory response to each of the neoantigens.