Cell-free DNA cleavages analysis (human)
TBD
Datasets Galaxy 929/938 describe the amplified single chromosome sequencing data.
COGA is a family study of alcoholism, in which the subjects have been drawn from the Collaborative Study on the Genetics of Alcoholism (COGA), a large, ongoing family-based study that includes subjects from seven sites around the US. COGA has gathered detailed, standardized data on study participants, including diagnostic and neurophysiological assessments. This project has already proved successful in identifying several genes that influence the risk for alcoholism and neurophysiological endophenotypes, which have been independently replicated. COGA data were included as part of two Genetic Analysis Workshops, and the phenotypes are familiar to the genetics community. Alcoholic probands were recruited from treatment facilities, assessed by personal interview, and after securing permission, other family members were also assessed. A set of comparison families was drawn from the same communities as the families recruited through an alcoholic proband. Assessment involved a detailed personal interview developed for this project, the Semi-Structured Assessment for the Genetics of Alcoholism (SSAGA), which gathers detailed information on alcoholism related symptoms along with other drugs and psychiatric symptoms. Many participants also came to the laboratories for electroencephalographic studies. Neurophysiological features that have been shown to be useful endophenotypes for which we have linkage and in some cases association results, are included for a subset of the case-control sample: the beta power of the resting electroencephalogram (EEG), the P3(00) amplitude of the visual event-related potential (ERP), and the theta and delta event-related oscillations (EROs) underlying the P3. As part of COGA, a set of informative families was selected to have Genome-Wide Association data obtained within families. Genotyping was performed using the Illumina Human OmniExpress array 12.VI to genotype 2,282 subjects selected from 118 densely affected families. Genotyping was performed at the Genome Technology Access Center at Washington University School of Medicine in St. Louis. In addition, we also included genotypes for subjects (n=275 subjects) from these 118 families who were genotyped in a previous case-control GWAS using the Illumina 1M array. For quality control purposes, 51 of the 275 subjects were genotyped again on the Illumina Human OmniExpress array at the Washington University School of Medicine core facility.In addition, exome sequencing data on a subset of individuals with GWAS were added in version 2 (v2). For v2, a subset had 30X Whole Genome Sequencing (WGS) as part of the NIDA Sequencing Initiative. The subset contained two distinct sets: Sibling pairs where one sibling had at least two dependence diagnoses in the set (alcohol, cannabis, cocaine, and opioid), and the other had none, and non-related Case-Control pairs matched for age and ethnicity where the cases had alcohol and at least 2 other dependence diagnoses and controls had none. After sequencing, some sibling pairs are re-classified as half siblings. Three VCF files (small variants, structural variants, and copy number variations) are provided. Additional substance use variables are made available in v2. We note that the full sample data are deposited in four dbGaP submissions and the sequenced samples are split across all four: CIDR: Collaborative Study on the Genetics of Alcoholism Case Control Study [phs000125]. GWAS data on cases (primarily probands) and controls drawn from the families. Families with highest density of alcohol dependence and/or extreme event-related oscillation data [phs000763]. GWAS data on 119 extended families of European descent are available here, along with extensive documentation. Study on the Genetics of Alcoholism (COGA): African American Family GWAS [phs000976]. GWAS data on all available COGA families of African descent are available. COGA: Smokescreen GWAS [phs001208]. GWAS data on all remaining COGA DNA samples, primarily of other racial background, were genotyped on the Smoke Screen array. A listing of all sequenced pairs is provided in the documentation to facilitate the merging of these samples.
This dataset represents two combined study populations. Serrated Colorectal Cancer: An Emerging Disease Subtype (called the Advanced Colorectal Cancer of Serrated Subtype Study or ACCESS Study) was a grant awarded to investigate a newly-recognized, biologically-distinct subtype of colorectal cancer (CRC) called “serrated CRC.” The objective of this project was to characterize factors related to the genetic predisposition, clinical presentation, and prognosis of serrated CRC. The study recruited incident invasive CRC cases diagnosed between April 2016 and December 2018, aged 20-74 years at diagnosis. Cases were identified through the Surveillance, Epidemiology and End Results (SEER) cancer registry serving 13 counties in western Washington State. Eligibility for all individuals was limited to those who were English-speaking and could consent. Participation included completing a baseline epidemiologic questionnaire shortly after diagnosis, optional donation of a saliva sample for genetic analysis, and optional consent to release of medical records and tissue specimens related to their diagnosis. Tumor specimens were tested for serrated CRC-defining molecular characteristics. Further, we have vital status on all participants and cause of death on those that have died since enrollment. Hormones and Colon Cancer: Epigenetic Subtypes, Risks, and Survival (called the Post-Menopausal Hormones Study or PMH Study) was a grant awarded to investigate the impact of post-menopausal hormone use on colon cancer risk, tumor molecular characteristics, and outcomes. Eligible cases were females, newly diagnosed with invasive colorectal adenocarcinoma between October 1998 and February 2002, aged 50 to 74 years. Cases were residents of 10 out of the 13 counties in western Washington State served by the Surveillance, Epidemiology and End Results (SEER) cancer registry. Eligibility for all individuals was limited to those who were English-speaking with available telephone numbers, in which they could be contacted. Unrelated population-based controls were randomly selected according to age distribution (in 5-year age intervals) of the eligible cases by using lists of licensed drivers from the Washington State Department of Licensing (for individuals aged 50 to 64 years) and rosters from the Health Care Financing Administration (now the Centers for Medicare and Medicaid, for individuals older than 64 years). Participation included completing a baseline epidemiologic questionnaire, optional donation of a saliva sample for genetic analysis, and (for cases only) optional consent to release of medical records and tissue specimens related to their diagnosis. Tumor specimens were tested for epigenetic and other molecular characteristics. The ACCESS study was supported by funding from the National Cancer Institute of the National Institutes of Health (NCI/NIH) (R01CA196337, PI: Newcomb, PA), as was the PMH Study (R01CA076366, PI: Newcomb, PA). Additional support for the PMH Study came from the Seattle site of the Colon Cancer Family Registry (SCCFR) (U01CA167551, PI: Jenkins, M, and U01/U24CA074794, PI: Newcomb, PA). Additional support for case ascertainment was provided by the Cancer Surveillance System of the Fred Hutchinson Cancer Center, which is funded by Contract Number HHSN261201300012I; NCI Control Number: N01 PC-2013-00012; Contract Number HHSN261201800004I; and NCI Control Number: N01 PC-2018-00004 from the Surveillance, Epidemiology and End Results (SEER) Program of the National Cancer Institute with additional support from the Fred Hutchinson Cancer Center and the State of Washington. This research was also supported by the Genomics and Bioinformatics, Comparative Medicine, Specialized Pathology, Collaborative Data Services, and Experimental Histopathology Shared Resources of the Fred Hutch/University of Washington Cancer Consortium (P30 CA015704).Tumor marker testing was performed using formalin-fixed paraffin-embedded diagnostic tumor tissue specimens, and DNA extracted from those specimens. Testing for microsatellite instability (MSI) was based on either a 10-gene panel (BAT25, BAT26, BAT40, MYCL, D5S346, D17S250, ACTC, D18S55, D10S197, BAT34C4) or a 4-marker immunohistochemistry panel of DNA mismatch repair proteins (MLH1, MSH2, MSH6, PMS2). CpG island methylator phenotype (CIMP) testing was based on a validated quantitative DNA methylation assay using a five-gene panel (CACNA1G, IGF2, NEUROG1, RUNX3, SOCS1) or eight-gene panel (CACNA1G, IGF2, NEUROG1, RUNX3, SOCS1, MLH1, CRABP1, CDKN2A). Somatic p.V600E BRAF mutation status was tested for using a fluorescent allele-specific PCR assay. KRAS mutations in codons 12 and 13 were also assessed through forward and reverse sequencing of amplified tumor DNA. DNA was extracted from blood/saliva samples using conventional methods. The genotyping panel completed was the Build37 OncoArray500K-C, including 1%-6% blinded duplicates to monitor the quality of the genotyping. Quality control procedures were performed to 1) make sure that there were no patterns of missing data by batch, study, or plate, 2) check for gender discrepancies and kinship, 3) complete Principal Component Analysis, and 4) test for Hardy-Weinberg equilibrium (HWE). Samples were excluded based on call rate, heterozygosity, unexpected duplicates, gender discrepancy, and unexpectedly high identity-by-descent or unexpected genotypic concordance (>65%) with another individual. In addition, variants were excluded based on call rate (98%), lack of HWE in controls (P
This study includes an open-label, phase 2 study to determine the activity of the anti-VEGF receptor tyrosine-kinase inhibitor, pazopanib, combined with the anti-PD-L1 immune checkpoint inhibitor, durvalumab, in unselected advanced sarcomas. We conducted whole exome and transcriptomic sequencing with pre-treatment tissue biopsy to correlate clinical outcomes with molecular and genomic biomarkers to identify patients who would most likely benefit from the combination treatment.
The Centers for Mendelian Genomics project uses next-generation sequencing and computational approaches to discover the genes and variants that underlie Mendelian conditions. By discovering genes that cause Mendelian conditions, we will expand our understanding of their biology to facilitate diagnosis and new treatments.
The impact of MSC(UC) on peripheral B cells from Systemic Lupus Erythematosus (SLE) patients was studied by 10X scRNAseq. This scRNAseq study encompassed 3 SLE patients at 3 time points: before or after (1 month, and 3 months) MSC injection in order to analyze B cell subsets and their DEG. The aim of this study was to observe the potential changes of B cell subsets after MSC(UC) injection in SLE patients.
The aim was to molecularly characterize the human ventral striatum in CUD. For this, we generated epigenomic, transcriptomic, and proteomic datasets at the bulk-level, performed integrative multi-omics analyses and further analyzed CUD-associated transcriptomic changes at a cell type-specific level using snRNA-seq.