WGS dataset of Hepatoblastoma tumoroid biobank with WGS of tumor and matched tumoroids
Although cancer-associated fibroblast (CAF) heterogeneity is well-established, the impact of chemotherapy on CAF populations remains poorly understood. We address this question in high-grade serous ovarian cancer (HGSOC), in which we previously identified 4 CAF populations. While the global content in stroma increases in HGSOC after chemotherapy, the proportion of FAP+ CAF (also called CAF-S1) decreases. Still, maintenance of high residual CAF-S1 content after chemotherapy is associated with reduced CD8+ T lymphocyte density and poor patient prognosis, emphasizing the importance of CAF-S1 reduction upon treatment. Single cell analysis, spatial transcriptomics and immunohistochemistry reveal that the content in the ECM-producing ANTXR1+ CAF-S1 cluster (ECM-myCAF) is the most affected by chemotherapy. Moreover, functional assays demonstrate that ECM-myCAF isolated from HGSOC reduce CD8+ T-cell cytotoxicity through a Yes Associated Protein 1 (YAP1)-dependent mechanism. Thus, efficient inhibition after treatment of YAP1-signaling pathway in the ECM-myCAF cluster is required to enhance CD8+ T-cell cytotoxicity. Altogether, these data pave the way for novel therapy targeting YAP1 in ECM-myCAF in HGSOC.
Epithelial ovarian cancer is a highly fatal malignancy with known genetic etiology; however, it is estimated that a substantial portion of narrow sense heritability remains to be discovered. The goal of this research was to use genetic information from distantly related ovarian cancer cases to identify regions of the genome that may harbor rare risk variants for epithelial ovarian cancer. The study was conducted in the setting of the Utah Population Database, a statewide data resource in which extensive ancestry data are linked to Utah Cancer Registry records. Focusing on families with a statistically significant excess risk of epithelial ovarian cancer, we used the Infinium Global Screening Array v.3.0 to generate germline genotyping data for distantly related cases. We then applied Shared Genomic Segment Analysis to each set of related cases to identify genomic regions that may harbor ovarian cancer risk variants. Eleven regions were identified. Genotype data are available through dbGaP for one case from each pedigree. Genotype data from other ovarian cancer cases within the high-risk pedigrees as well as the pedigrees' corresponding structures can be requested and made available with approval from the Utah Resource for Genetic and Epidemiologic Research (RGE).
Single cell transcriptomics study of tissue from human kidney transplant nephrectomies Tissue is acquired from kidney transplant nephrectomies, homogenised and disaggregated to a single cell suspension prior to droplet encapsulation single cell RNAseq (10X protocol) or plate based single cell RNAseq (SS2 protocol). Human tissues, including the kidney are seeded with a tissue resident immune cell network throughout life. These cells may self renew in situ or be replaced by circulating precursors which enter the tissue in response to tissue specific niche and inflammatory signals. We are using single cell RNA sequencing of kidney transplant nephrectomy tissue to 1) understand the immune cell populations which are resident within the human kidney 2) identify which of these populations is donor or recipient derived. 3) Uncover the temporal dynamics of donor/recipient chimerism in the the human kidney following transplantation. 4) Understand transcriptional differences between organs removed for a variety of reasons (infection, chronic rejection, vascular emergencies) This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
Most known genetic variation in human genomes has been called from comparison of short reads to the reference genome, an approach biased against finding complex variation. We sequenced 150 individuals from 50 parent-offspring trios with multiple insert-size libraries to very high coverage. We show that each genome could be independently de novo assembled into a small number of high-quality scaffolds (median N50 > 21 Mb), each of quality comparable to long read assemblies while being very cost-effective. We show that our variant call set from comparing de novo assemblies is far more complete in terms of complex variation than previous studies. Importantly, even the complex 4-5 Mb extended MHC region was assembled and resolved into haplotypes, revealing >700kb novel sequence in this important region of the genome, and major parts of the Y chromosome including some palindromes were assembled with high accuracy. Finally, we show that our variant call-set allows for the genotyping of many more complex variants when used as a reference-panel for imputation into SNP-chip data or into previously resequenced genomes.
To elucidate the timing and mechanism of the clonal expansion of somatic mutations in cancer-associated genes in the normal endometrium, we conducted target sequencing of 112 genes for 1,298 endometrial glands and matched blood samples from 36 women. By collecting endometrial glands from different parts of the endometrium, we showed that multiple glands with the same somatic mutations occupied substantial areas of the endometrium. The 112 genes are as follows: ABCC1, ACRC, ANK3, ARHGAP35, ARID1A, ARID5B, ATCAY, ATM, ATR, BARD1, BCOR, BRCA1, BRCA2, BRD4, BRIP1, CAMTA1, CDC23, CDYL, CFAP54, CHD4, CHEK1, CHEK2, CTCF, CTNNB1, CUX1, DGKA, DISP2, DYNC2H1, EMSY, FAAP24, FAM135B, FAM175A, FAM65C, FANCA, FANCB, FANCC, FANCD2, FANCE, FANCF, FANCG, FANCI, FANCL, FANCM, FAT1, FAT3, FBN2, FBXW7, FGFR2, FRG1, GPR50, HEATR1, HIST1H4B, HNRNPCL1, HOOK3, KIAA1109, KIF26A, KMT2B, KMT2C, KRAS, LAMA2, LRP1B, MLH1, MON2, MRE11A, MSH2, MSH6, MTOR, NBN, PALB2, PHEX, PIK3CA, PIK3R1, PLXNB2, PLXND1, PMS2, POLE, POLR3B, PPP2R1A, PTEN, PTPN13, RAD50, RAD51, RAD51B, RAD51C, RAD51D, RAD52, RAD54B, RAD54L, RICTOR, SACS, SIGLEC9, SLC19A1, SLX4, SPEG, STT3A, TAF1, TAF2, TAS2R31, TFAP2C, TNC, TONSL, TP53, TTC6, UBA7, VNN1, WT1, XIRP2, ZBED6, ZC3H13, ZFHX3, ZFHX4, ZMYM4.
This dataset consists of 116 tumor and normal samples analyzed with whole exome sequencing on the HiSeq2500 instruments with 100bp paired-end reads as well as 760 tumor and normal samples analyzed with the PGDx elio tissue complete assay. The PGDx elio tissue complete assay is a hybrid capture approach targeting 500+ genes with sequencing on the NextSeq instruments with 150bp paired-end reads. The bam files provided have been adapter masked and contain duplicate reads.
Ionizing radiation is an effective therapeutic agent for cancer treatment as well as a potent carcinogen. Sensitivity to the cell-killing effects of radiation can vary across human population with a subset of individuals displaying extreme hypersensitivity. It is usually attributable to inherited defects in DNA damage response pathways. The present study was designed to elucidate the genetic basis of variation in hypersensitivity to radiation exposure through exome sequencing of radiosensitive individuals, with the ultimate goal of identifying genes with the most significant effects on cellular DNA damage responses. The study participants included subjects referred for clinical testing for Ataxia-telangiectasia (A-T), Nijmegen Breakage Syndrome (NBS) or Ligase IV Syndrome. These are rare, recessive genetic disorders and hypersensitivity to radiation exposure is a common phenotype among individuals affected by all the three disorders. The study participants exhibited phenotypic characteristics similar to individuals with A-T, NBS or Ligase IV Syndrome, but lacked the causative mutations in ATM (GeneID:472) or NBN (GeneID:4683) genes. For further validation of the radiation sensitivity phenotype among the enrolled subjects, B-lymphoblastoid cells lines were established for each subject from peripheral blood lymphocytes. Each cell line was evaluated for displaying impaired survival rates relative to normal controls after exposure to ionizing radiation. 53 subjects with validated phenotype were finally included in the study and DNA extracted from their B-lymphoblastoid cell lines was used for exome sequencing. This sequencing data for radiation sensitive subjects is being made available in the dbGaP. It is hoped that this resource will be beneficial for researchers who wish to further investigate components of human cellular DNA damage response pathways and/or genetic architecture underlying radiation hypersensitivity. This data may also aid in the rational design of new radiosensitizing or radioprotective agents.
DM1 is characterized by dysregulation of alternative mRNA splicing, to an quantitative as well as qualitative extend that is heterogenous between clinically similar patients. Hundreds of transcripts have been identified that can be misspliced in DM1, and for many of these it is still unclear whether they generate stable proteins in DM1 patients. We generated long-read transcriptome data of DM1 patients whole blood after hemoglobin depletion. Each sample is a mix of 5 patients of varying age.
The data come from 9 studies participating in the Endometrial Cancer Association Consortium (ECAC). ECAC data included in this dbGaP submission include OncoArray data from 4486 endometrial cancer cases genotyped at the Center for Inherited Disease Research (CIDR). Samples were genotyped using the Illumina OncoArray beadchip 500K SNP custom array. Details of the genotyping process and sample selection are included in O'Mara et al, Identification of nine new susceptibility loci for endometrial cancer. Nature Communications (PMID:30093612).