This dataset consists of RNA sequencing data from an ETS2 overexpression experiment in M0 macrophages. Controlled overexpression of ETS2 mRNA or control mRNA (an equivalent amount of mRNA encoding the reverse complement of ETS2 – thereby controlling for the quantity, length and purine/pyrimidine composition of the transfected RNA but with a transcript that would not be translated) was induced in resting, non-activated (M0) macrophages by transfecting predefined amounts of in vitro transcribed mRNA. To minimise non-specific activation due to the transfected RNA, in vitro transcription was performed using co-transcriptional capping (to minimise uncapped products), and incorporating modified, minimally immunogenic nucleotides (replacing uridine with N1-methyl-pseudouridine and cytidine with methylcytidine). After 18 hours, transfected cells were were activated with low dose LPS and harvested for RNA-sequencing 6 hours later. Sequencing libraries were prepared from 10ng RNA using the SMARTer Stranded Total RNA-Seq Kit v2 - Pico Input Mammalian (Takara) following the manufacturer’s instructions. The quality and molarity of all libraries was assessed using a BioAnalyzer 2100 and the libraries were sequenced on a NovaSeq6000. Raw data are provided as 100 bp paired-end Illumina reads from n = 8 donors.
The Cleveland Family Study is the largest family-based study of sleep apnea world-wide, consisting of 2284 individuals (46% African American) from 361 families studied on up to 4 occasions over a period of 16 years. The study was begun in 1990 with the initial aims of quantifying the familial aggregation of sleep apnea. NIH renewals provided expansion of the original cohort (including increased minority recruitment) and longitudinal follow-up, with the last exam occurring in February 2006. Index probands (n=275) were recruited from 3 area hospital sleep labs if they had a confirmed diagnosis of sleep apnea and at least 2 first-degree relatives available to be studied. In the first 5 years of the study, neighborhood control probands (n=87) with at least 2 living relatives available for study were selected at random from a list provided by the index family and also studied. All available first degree relatives and spouses of the case and control probands also were recruited. Second-degree relatives, including half-sibs, aunts, uncles and grandparents, were also included if they lived near the first degree relatives (cases or controls), or if the family had been found to have two or more relatives with sleep apnea. Blood was sampled and DNA isolated for participants seen in the last two exam cycles (n=1447). The sample, which is enriched with individuals with sleep apnea, also contains a high prevalence of individuals with sleep apnea-related traits, including: obesity, impaired glucose tolerance, and HTN. Phenotyping data have been collected over 4 exam cycles, each occurring ~every 4 years. The last three exams targeted all subjects who had been studied at earlier exams, as well as new minority families and family members of previously studied probands who had been unavailable at prior exams. Data from one, two, three and four visits are available for 412, 630, 329 and 67, participants, respectively. In the first 3 exams, participants underwent overnight in-home sleep studies, allowing determination of the number and duration of hypopneas and apneas, sleep period, heart rate, and oxygen saturation levels; anthropometry (weight, height, and waist, hip, and neck circumferences); resting blood pressure; spirometry; standardized questionnaire evaluation of symptoms, medications, sleep patterns, quality of life, daytime sleepiness measures and health history; venipuncture and measurement of total and HDL cholesterol. The 4th exam (2001-2006) was designed to collect more detailed measurements of sleep, metabolic and CVD phenotypes and included measurement of state-of-the-art polysomnography, with both collection of blood and measurement of blood pressure before and after sleep, and anthropometry, upper airway assessments, spirometry, exhaled nitric oxide, and ECG performed the morning after the sleep study. Data have been collected by trained research assistants or GCRC nurses following written Manuals of Procedures who were certified following standard approaches for each study procedure. Ongoing data quality, with assessment of within or between individual drift, has been monitored on an ongoing basis, using statistical techniques as well as regular re-certification procedures. Between and within scorer reliabilities for key sleep apnea indices have been excellent, with intra-class correlation coefficients (ICCs) exceeding 0.92 for the apnea-hypopnea index (AHI). Sleep staging, assessed with epoch specific comparisons, also demonstrate excellent reliability for stage identification (kappas>0.82). There has been no evidence of significant time trends-between or within scorers- for the AHI variables. We also have evaluated the night-to-night variability of the AHI and other sleep variables in 91 subjects, with each measurement made 1-3 months apart. There is high night to night consistency for the AHI (ICC: 0.80), the arousal index (0.76), and the % sleep time in slow-wave sleep (0.73). We have demonstrated the comparability of the apnea estimates (AHI) determined from limited channel studies obtained at in-home settings with in full in-laboratory polysomnography. In addition to our published validation study, we more recently compared the AHI in 169 Cleveland Family Study participants undergoing both assessments (in-home and in-laboratory) within one week apart. These showed excellent levels of agreement (ICC=0.83), demonstrating the feasibility of examining data from either in-home or in-laboratory studies for apnea phenotyping. Data collected in the GCRC were obtained, when possible, with comparable, if not identical techniques, as were the same measures collected at prior exams performed in the participants' homes. To address the comparability of data collected over different exams, we calculated the crude age-adjusted correlations ~3 year within individual correlations between measures made in the most recent GCRC exam with measures made in a prior exam and demonstrated excellent levels of agreement for BMI (r=.91); waist circumference (0.91); FVC (0.88); and FEV1 (0.86). As expected due to higher biological and measurement variability, 149 somewhat lower 3-year correlations were demonstrated for SBP (0.56); Diastolic BP (0.48); AHI (0.62); and nocturnal oxygen desaturation (0.60). NHLBI Candidate-gene Association Resource. The NHLBI initiated the Candidate gene Association Resource (CARe) to create a shared genotype/phenotype resource for analyses of the association of genotypes with phenotypes relevant to the mission of the NHLBI. The resource comprises nine cohort studies funded by the NHLBI: Atherosclerosis Risk in Communities (ARIC), Cardiovascular Health Study (CHS), Cleveland Family Study (CFS), Coronary Artery Risk Development in Young Adults (CARDIA), Cooperative Study of Sickle Cell Disease (CSSCD), Framingham Heart Study (FHS), Jackson Heart Study (JHS), Multi-Ethnic Study of Atherosclerosis (MESA), and the Sleep Heart Health Study (SHHS). A database of genotype and phenotype data will be created that includes records for approximately 50,000 study participants with approximately 50,000 SNPs from more than 1,200 selected candidate genes. In addition, a genome wide association study using a 1,000K SNP Chip will be conducted on approximately 9,500 African American participants drawn from the 50,000 participants in the nine cohorts. Some relevant CARe publications CARe Study: PMID 20400780 CVD Chip Design: PMID 18974833
Data Access Committee to manage external requests. Project: RNA-seq data from soft tissue biopsies harvested from dental implant sites with peri-implantitis
Genomics to select patients with metastatic breast cancer for targeted therapy (microarray_oncoscan)
This dataset contains whole exome sequence of six HCC patients from Qidong China who are very likely exposed to aflatoxin.
The aim is to find rare variants of intermediate penetrance in those at risk of Crohn's disease
This dataset consists on 22 samples linked to 22 bam files from whole genome and whole exome sequencing of Esthioneuroblastomas.
Rmarkdown code, PDF, and Rdata file to recapitulate the paper's primary figures and machine learning model development.
This study is a "first-in-human" phase 1 trial of recurrent, IDH wild-type glioblastoma (rGBM) patients treated with CAN-3110 (aka rQNestin34.5v.2), a genetically modified oncolytic herpes simplex virus 1 (HSV-1). The study was a 3+3 dose-escalation study starting at a single intra-tumoral injection of 106 PFU CAN-3110 going up to 1010 PFU in half-log increments (cohorts 1-9). An expansion cohort (cohort 10) was treated with 109 PFU injected into up to 5 sites within a single tumor. Principal findings of the study conclude that:CAN-3110 was safe at all tested doses (no dose-limiting toxicities reached)HSV-1 seropositive patients survived significantly longer than HSV-1 seronegative patients in this study - longer than expected based on historical rGBM controlsHSV-1 seropositive patients were significantly more likely to clear CAN-3110 (HSV-1) antigen from their tumors based on immunohistochemical stainingCAN-3110 treatment-induced CD8+/CD4+ T-cell recruitment to injected tumor sites altered TCR beta diversity in both tumor infiltrating leukocytes (TILs) and PBMCs, and initiated changes in specific TCR beta clonotypes and immune gene signatures which were associated with post-treatment survival.Data provided in dbGaP for this study include:Bulk RNA-Seq data for 13 pre/post-treatment pairs of rGBM tumors corresponding to 12 unique patients (patient 28 was treated with CAN-3110 at two different timepoints). This data is provided as raw sequencing reads (.fastq), kallisto pseudoalignment transcript-level read counts (abundance.tsv files), gene-level abundance matrix (Gene_Level_Expression_Matrix.tsv), and immune gene signature scores matrix (Immune_Gene_Signatures_Matrix.tsv). Sequencing was performed as paired 150bp sequencing on NovaSeq6000 S4 Illumina flow cells. Library preparation varied by batch as described in the included RNAseq_Metadata.xlsx table.TCR beta DNA sequencing performed by Adaptive Biotechnologies for 21 pre/post-treatment pairs of rGBM tumors and PBMCs representing 20 unique patients (patient 28 was treated with CAN-3110 at two different timepoints). Note that all tumor pairs with TCR beta sequencing data also have available bulk RNA-Seq data. The TCR beta sequencing data is provided as an excel spreadsheet (TCRbeta_Sample_Overview.xslx), which provides a broad level summary of the sequencing on a per-sample basis (# of TCRbeta templates, T cell fraction, TCR beta diversity metrics, etc.) and as a large tab separated value file (Full_TCRbeta_Rearrangement_Details.tsv), which provides information per sample at the level of each unique TCR beta sequence (amino/nucleic acid sequence, VDJ gene usage, etc.).An excel table of de-identified metadata for the patients involved in this study.
DNA samples were obtained from participants of the Geisinger MyCode biobank. Phenotype data to determine case or control status for abdominal aortic aneurysm or extreme obesity were derived from Geisinger electronic medical record data through an enterprise data warehouse. Samples were genotyped using the Illumina Human CoreExome-12 v1.0 Array. Genotype data were filtered using standard quality control measures.