The dataset includes FASTq files and normalised gene expression counts from bulk RNA-sequencing of pre-treatment tumour (n=39) and adjacent non-tumour tissue samples (n=13) from advanced hepatocellular carcinoma (HCC) patients treated with atezolizumab plus bevacizumab. Specifically, RNA was extracted from the FFPE tumour tissue using RNeasy Plus Mini Kit (Qiagen), according to the manufacturer’s instructions. RNA quality and quantity were evaluated using Agilent RNA 600 Pico Chips (Agilent Technologies). Sequencing libraries were prepared using the SMARTer Stranded Total RNA-seq kit v3 (Takara Bio). Sequencing was performed on an Illumina NovaSeq 6000 system. After obtaining the Fastq files, read 2 UMIs and adapters were trimmed using seqtk (v1.2), reads were aligned using STAR (v2.7.7a) to the human reference genome (GRCh38) and quantified with HTSeq (v2.0.2).
Males and females show dramatic differences in their vulnerability to the same diseases. For example, compared to men, lupus is six times more prevalent, thyroid cancer is three times more prevalent, and unipolar depression is twice as prevalent in women. Diseases with a strong male bias include autism (5:1), dilated cardiomyopathy (3:1), and ankylosing spondylitis (5:1). Historically, such differences have been attributed solely to extrinsic factors such as circulating sex hormones or environmental influences. We hypothesized that intrinsic factors - genetic differences between XX and XY cells - have unappreciated biological consequences throughout the body and contribute to sex differences in disease incidence and severity. This hypothesis stems from our long-term effort to sequence the sex chromosomes of diverse mammalian species, which has identified a set of homologous genes on the X and Y chromosomes that are dosage-sensitive, expressed throughout the body, and encode regulators of chromatin modification, transcription, translation, and protein stability. These X- and Y-encoded genes differ in sequence and expression pattern, which likely manifests in genome-wide differences in gene regulation between XX and XY cells and influences all aspects of human biology, including sex differences in disease susceptibility. These hard-wired molecular sex differences have been largely overlooked and understudied, representing a significant gap in our knowledge of human biology.The gene expression study of individuals with sex chromosome aneuploidies takes advantage of natural human variation in sex chromosome number, i.e. sex chromosome aneuploidy, to investigate alterations in genome-wide gene expression that correlate with changes in X- and Y-chromosome dosage. We analyzed samples from 114 individuals with a variety of sex chromosome aneuploidies, including 45,X; 47,XXY; 47,XYY; 47,XXX; 48,XXYY; and 49,XXXXY. We generated lymphoblastoid cell lines (LCLs) from blood samples and, in some cases, fibroblast cultures from skin biopsies. We supplemented our collection with previously-derived cell lines. To evaluate gene expression, we performed deep profiling of the transcriptome (RNA-seq) from these LCLs and fibroblasts. We performed parallel analyses on samples collected from 62 control 46,XX and 46,XY individuals, 6 individuals with trisomy 21, and 14 individuals with structural variations of the X and Y chromosomes. In addition, we performed CRISPRi knockdowns on 3 of the 46,XX and 3 of the 46,XY fibroblast samples for the homologous transcription factors ZFX and ZFY, encoded on the X and Y chromosomes, respectively.In the April 2024 update, we added RNA-seq datasets derived from isolated CD4+ T cells and monocytes from 76 and 72 adults, respectively, with the following sex chromosome constitutions: 45,X; 46,XX; 46,XY; 47,XXX; 47,XXY; 47,XYY. These individuals are largely a subset of the same cohort described above. In addition, we performed RNA-seq on in-vitro stimulated CD4+ T cells with the following sex chromosome constitutions: 45,X; 46,XX; 46,XY; 47,XXY.In the August 2024 update, we added RNA-seq datasets generated from the following: 1) LCLs derived from individuals with AZFa deletions of the Y chromosome, 2) DDX3X and DDX3Y knockdown (via CRISPRi) in XY fibroblasts, and 3) 5-ethyl uridine (5-EU) treatment in XY and XYYYY LCLs.
To study the one of the mechanisms through which obesity could affect placental cells, we used a novel microfluidic organs-on-a-chip device fabricated for co-culturing adipose spheroids and trophoblast organoids. The organoids were derived from the term placenta of a over-weight women. The adipose spheorids were derived from the white adipose tissue of a woman. And we sequenced the RNAs from the trophoblast organoids. The RNA extracted were subjective to Prime-seq library preparation protocol and being sequenced on Illumina NovaSeq 6000 at Novogene.
In other analysis in the current manuscript, we find a similar gene signature (to dissociation based artifacts in mouse and human tissue) is present in post-mortem microglia and astrocytes, across all snRNA-seq datasets analyzed, although it is highly variable between subjects. ​ Using acutely-resected neurosurgical tissue, we performed single-nucleus RNA-seq and reveal that a similar signature can be detected in microglia following prolonged exposure to room temperature. Tissue handling and methods details, as well as sequencing and analysis details) can be found in the methods section of related manuscript (Marsh et al., 2022). Together, these results suggest that the presence of this signature in post-mortem brain samples may be the result of a combination of acute pre-mortem (agonal state, cause of death, comorbidities, etc.) and post-mortem (post-mortem interval (PMI), storage time, RNA quality, etc.) variables and may not represent normally present cell state.
3' mRNA-Seq obtained from distinct isolated cell types (epithelia cells,immune cells, fibroblasts) of endoscopically obtained esophageal adenocarcinoma tissue as well as normal esophageal mucosa. Libraries for RNA-sequencing were prepared using the QuantSeq 3' mRNA-Seq Library Prep Kit FWD for Illumina according to the low input protocol. Libraries were sequenced on a HiSeq 4000 (Illumina) by 1x 50 bases.
MeD-seq data (fastq files) from gynecological cancers and associated healthy tissues. In total 292 fastq files generated by MeD-seq are deposited consisting of: healthy tissues (vulva n=11, cervix n=15, endometrium n=13, fallopian tube n=18 and ovary n=13), precursor lesions of cancer (vulva n=23 and cervix n=46) and cancer (vulva n=21, cervix n=45, endometrium n=26, fallopian tube n=8 and ovary n=33)
This dataset contains comprises enzymatic methylation sequencing (EM-Seq) data generated from 208 rectal mucus samples from patients suspected to have colorectal cancer and healthy controls. Targeted CpG sites were enriched using a panel of genes implicated in colorectal cancer, and libraries were prepared with NEBNext EM-Seq prior to Illumina sequencing. Sequencing was performed on the Illumina NovaSeq6000 platform, producing paired-end FASTQ files. These data support analyses of DNA methylation patterns in CRC detection and progression