The overall goal for this study was to determine the genetic factors associated with extreme phenotypes of subclinical atherosclerosis (protective and deleterious). Study participants were selected from the Northern Manhattan Study (NOMAS), a population based cohort study investigating stroke and stroke risk factors in Northern Manhattan. Cases and controls were individuals at the extreme ends of the distribution for carotid intima-media thickness and carotid plaque, with cases defined as individuals with subclinical atherosclerosis that could not be explained by traditional vascular risk factors (USAth) and controls defined as individuals with unexplained protection against atherosclerosis (UPAth).
We aim to use whole-genome medical sequencing (WGMS) to discover causative molecular lesions for a set of rare, severe phenotypes hypothesized to be caused by either somatic mutations, germline de nova heterozygous mutations, germline inherited recessive, or germline inherited dominant mutations in currently unknown or uncharacterized genes. The goal of this research is threefold: to identify causative sequence variants for disorders whose molecular etiology was previously unknown, to apply this insight to both the rare disorders under study and more common phenotypes, and to enhance the study of mutation on a genome-wide level.
Benign prostatic hyperplasia (BPH) entails growth in the central regions of the prostate gland and is common among older men. BPH obstructs urinary outflow, resulting in voiding symptoms for which current treatments that target prostate physiology are only partially effective. A better understanding of BPH may suggest new treatment strategies that target its pathophysiology. The overall goal of the study is to apply next-generation sequencing-based approaches to investigate BPH, to discover new insight into BPH disease processes and new targets for precision therapy, and to determine whether the hyperplasia reflects underlying clonal expansions of prostatic cells.
This study evaluates the use of rectal mucus as a minimally invasive biospecimen for colorectal cancer (CRC) detection through whole-genome shotgun metagenomic sequencing. 408 rectal mucus samples were collected from patients suspected to have colorectal cancer and sequenced. These samples were analyzed to characterize microbial community composition and its association with CRC stage and anatomical site. These data provide insights into tumor-proximal microbiome signatures and demonstrate the potential of rectal mucus sampling for early and accurate CRC diagnosis. This is 1 of 4 sequencing experiments on the same sample type.
The data contains single-cell gene sequencing data (10x Genomics) from FACS-purified CD8 T lymphocytes from two Austrian patients. The cells were stimulated with one MHC class I peptides obtained from a common (wild type) variant and an emerging mutant variant of the SARS-Cov-2 virus. Then the samples were multiplexed using hashtag oligos. We provide the raw and aligned sequence data for: i. The single-cell experiments ii. The PCR-amplified samples for enrichment of the hashtag oligo multiplexing barcodes iii. The PCR-amplified samples for enrichment of the T Cell Receptor (TCR) VDJ region for immuno-profiling. The samples and libraries were processed and obtained in collaboration between St. Anna Children's Cancer Research Institute (CCRI), CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, and the Medical University of Vienna. The cell barcodes and processed data has been submitted to the GEO database with GEO accession GSE166651.
We analyzed 34 AGCTs (19 primary and 15 recurrent) and the KGN cell line by RNA-Seq. Our cohort comprised of 3 AGCTs WT for FOXL2, 28 heterozygous and 3 homo/hemizygous for the pathogenic variant. Fresh-frozen AGCTs were selected from OVCARE’s Gynecological Tissue Bank in Vancouver, Canada for bulk RNA-seq. RNA was extracted from frozen tissue and sections adjacent to the scrolls submitted for RNA-seq were stained with hematoxylin and eosin (H&E) to evaluate tumour cell purity. Cases with >80% tumour cell purity were selected for sequencing with the majority of cases (29 of 34 patients) containing >90% tumour cells. Ribodepleted RNA libraries were constructed and paired-end sequencing (125 base pair reads) was performed.
Chronic lymphocytic leukemia (CLL) is characterized by substantial clinical heterogeneity, despite relatively few genetic alterations. To provide a basis for studying epigenome deregulation in CLL, we established genome-wide chromatin accessibility maps for 88 CLL samples from 55 patients using the ATAC-seq assay, and we also performed ChIPmentation and RNA-seq profiling for ten representative samples. Based on the resulting dataset, we devised and applied a bioinformatic method that links chromatin profiles to clinical annotations. Our analysis identified sample-specific variation on top of a shared core of CLL regulatory regions. IGHV mutation status – which distinguishes the two major subtypes of CLL – was accurately predicted by the chromatin profiles, and gene regulatory networks inferred for IGHV-mutated vs. IGHV-unmutated samples identified characteristic differences between these two disease subtypes. In summary, we discovered widespread heterogeneity in the chromatin landscape of CLL, established a community resource for studying epigenome deregulation in leukemia, and demonstrated the feasibility of chromatin accessibility mapping in cancer cohorts and clinical research.
This dataset represents two combined study populations. Serrated Colorectal Cancer: An Emerging Disease Subtype (called the Advanced Colorectal Cancer of Serrated Subtype Study or ACCESS Study) was a grant awarded to investigate a newly-recognized, biologically-distinct subtype of colorectal cancer (CRC) called “serrated CRC.” The objective of this project was to characterize factors related to the genetic predisposition, clinical presentation, and prognosis of serrated CRC. The study recruited incident invasive CRC cases diagnosed between April 2016 and December 2018, aged 20-74 years at diagnosis. Cases were identified through the Surveillance, Epidemiology and End Results (SEER) cancer registry serving 13 counties in western Washington State. Eligibility for all individuals was limited to those who were English-speaking and could consent. Participation included completing a baseline epidemiologic questionnaire shortly after diagnosis, optional donation of a saliva sample for genetic analysis, and optional consent to release of medical records and tissue specimens related to their diagnosis. Tumor specimens were tested for serrated CRC-defining molecular characteristics. Further, we have vital status on all participants and cause of death on those that have died since enrollment. Hormones and Colon Cancer: Epigenetic Subtypes, Risks, and Survival (called the Post-Menopausal Hormones Study or PMH Study) was a grant awarded to investigate the impact of post-menopausal hormone use on colon cancer risk, tumor molecular characteristics, and outcomes. Eligible cases were females, newly diagnosed with invasive colorectal adenocarcinoma between October 1998 and February 2002, aged 50 to 74 years. Cases were residents of 10 out of the 13 counties in western Washington State served by the Surveillance, Epidemiology and End Results (SEER) cancer registry. Eligibility for all individuals was limited to those who were English-speaking with available telephone numbers, in which they could be contacted. Unrelated population-based controls were randomly selected according to age distribution (in 5-year age intervals) of the eligible cases by using lists of licensed drivers from the Washington State Department of Licensing (for individuals aged 50 to 64 years) and rosters from the Health Care Financing Administration (now the Centers for Medicare and Medicaid, for individuals older than 64 years). Participation included completing a baseline epidemiologic questionnaire, optional donation of a saliva sample for genetic analysis, and (for cases only) optional consent to release of medical records and tissue specimens related to their diagnosis. Tumor specimens were tested for epigenetic and other molecular characteristics. The ACCESS study was supported by funding from the National Cancer Institute of the National Institutes of Health (NCI/NIH) (R01CA196337, PI: Newcomb, PA), as was the PMH Study (R01CA076366, PI: Newcomb, PA). Additional support for the PMH Study came from the Seattle site of the Colon Cancer Family Registry (SCCFR) (U01CA167551, PI: Jenkins, M, and U01/U24CA074794, PI: Newcomb, PA). Additional support for case ascertainment was provided by the Cancer Surveillance System of the Fred Hutchinson Cancer Center, which is funded by Contract Number HHSN261201300012I; NCI Control Number: N01 PC-2013-00012; Contract Number HHSN261201800004I; and NCI Control Number: N01 PC-2018-00004 from the Surveillance, Epidemiology and End Results (SEER) Program of the National Cancer Institute with additional support from the Fred Hutchinson Cancer Center and the State of Washington. This research was also supported by the Genomics and Bioinformatics, Comparative Medicine, Specialized Pathology, Collaborative Data Services, and Experimental Histopathology Shared Resources of the Fred Hutch/University of Washington Cancer Consortium (P30 CA015704).Tumor marker testing was performed using formalin-fixed paraffin-embedded diagnostic tumor tissue specimens, and DNA extracted from those specimens. Testing for microsatellite instability (MSI) was based on either a 10-gene panel (BAT25, BAT26, BAT40, MYCL, D5S346, D17S250, ACTC, D18S55, D10S197, BAT34C4) or a 4-marker immunohistochemistry panel of DNA mismatch repair proteins (MLH1, MSH2, MSH6, PMS2). CpG island methylator phenotype (CIMP) testing was based on a validated quantitative DNA methylation assay using a five-gene panel (CACNA1G, IGF2, NEUROG1, RUNX3, SOCS1) or eight-gene panel (CACNA1G, IGF2, NEUROG1, RUNX3, SOCS1, MLH1, CRABP1, CDKN2A). Somatic p.V600E BRAF mutation status was tested for using a fluorescent allele-specific PCR assay. KRAS mutations in codons 12 and 13 were also assessed through forward and reverse sequencing of amplified tumor DNA. DNA was extracted from blood/saliva samples using conventional methods. The genotyping panel completed was the Build37 OncoArray500K-C, including 1%-6% blinded duplicates to monitor the quality of the genotyping. Quality control procedures were performed to 1) make sure that there were no patterns of missing data by batch, study, or plate, 2) check for gender discrepancies and kinship, 3) complete Principal Component Analysis, and 4) test for Hardy-Weinberg equilibrium (HWE). Samples were excluded based on call rate, heterozygosity, unexpected duplicates, gender discrepancy, and unexpectedly high identity-by-descent or unexpected genotypic concordance (>65%) with another individual. In addition, variants were excluded based on call rate (98%), lack of HWE in controls (P
This is the DAC for the study "Pyjacker identifies enhancer hijacking events in acute myeloid leukemia including MNX1 activation via deletion 7q" of Christoph Plass (c.plass@dkfz.de).
This dataset contains 241 paired FASTQ files for 61 early-onset diabetes patient and 174 controls sequenced with MGI Tech DNBSEQ-T10 and DNBSEQ-T7.