SDH deficient renal cell carcinomas are a rare and recently defined subtype of kidney cancer, often associated with an inherited mutation in one of the SDH gene subunits. This dataset sought to understand the genomic events that underpin tumour formation, from putative cell of origin, characterisation of the tumour microenvironment, to the genomic evolution of these rare tumours. We performed whole genome and RNA sequencing of 4 patients with SDH deficient renal cell carcinomas, including one patient who had an additional paraganglioma. An addition patient in this cohort had the initial diagnosis revised to a clear cell renal cell carcinoma.
The IYDP dataset includes BAM files of 126 Y chromosomes extracted from whole genome sequences. These are from individuals from a broad range of Indonesian islands - communities close to mainland Asia through to New Guinea. The original whole genome sequencing libraries were prepared using TruSeq DNA PCR-Free and TruSeq Nano DNA HT kits depending on DNA quantity. 150 bp paired-end sequencing was performed on the Illumina HiSeq X sequencer. Individuals were sequenced to expected mean depth of 30x, with an achieved median depth of raw reads across samples of 43x.
This dataset included 19 paired diagnostic and remission samples with high hyperdiploid acute lymphoblastic leukemia (ALL) that were collected from four different cohorts: the Division of Clinical Genetics, Lund University, Sweden. All samples were subjected to whole genome sequencing using the Illumina HiSeqX platform. Paired-end sequencing (2x150bp) was done to ~60x coverage for diagnostic samples and ~30x coverage for remission samples. The paired-end reads were aligned to the human reference genome GRCh37 (ftp://ftp.ncbi.nlm.nih.gov/genomes/refseq/vertebrate_mammalian/Homo_sapiens/all_assembly_versions/GCF_000001405.25_GRCh37.p13/GCF_000001405.25_GRCh37.p13_genomic.fna.gz) by the Burrows-Wheeler Aligner tool (version 0.7.17). Duplicate reads marking and local realignment were performed by GATK (version 4.0.11.0).
This work aims to identify homogeneous and robust molecular subtypes in HCC based on a large, homogenous and well-annotated cohort. We performed whole genome/exome sequencing (WGS/WES) on 529 HCC from 461 patients collected mainly in France. Based on the genomic data, we identified 9 robust HCC subtypes primarily driven by key mutations. We further characterized these subtypes using genomic and clinicopathological features. Among the 9 subtypes, 5 belonged to chromosome instable tumors, while 3 belonged to chromosome stable tumors. Our subtypes were associated with prognosis and showed distinct distributions across features like etiology and gender. This study offers a comprehensive molecular classification of HCC, enhancing our understanding of hepatocarcinogenesis and supporting the development of genome-based precision medicine for liver cancer.
Knowledge about abnormal organ development is important to understand pathology and to develop novel treatment approaches for individuals with congenital and acquired disease. Most of our current understanding is based on examination of tissues from the embryo and early foetus, collected from women undergoing termination of pregnancy in the first trimester (third) of pregnancy. There is very little known about normal and abnormal organ development from a developmental perspective during the crucial last two-thirds of pregnancy when much remodelling of foetal tissues occurs. This study will generate a single-cell atlas of late-foetal lungs, blood, heart, bone and immune organs. . This dataset contains all the data available for this study on 2025-10-14.
This dataset contains Visium spatial transcriptomics data for the PCA Atlas study EGAS00001008332. It includes: - Raw Visium FASTQ runs (EGAR00004172272–EGAR00004172283) for all libraries. - spaceranger alignment BAM analyses for those libraries where BAM output was generated. - visium.csv: a slide-level mapping table linking Visium slide samples to EGA sample accessions, sequencing runs/experiments and spaceranger BAM analyses. - visium_tma.csv: a TMA mapping table describing how mini-TMA slides relate slide-level Visium libraries to tissue-level samples/cores. Together these objects provide the raw and aligned Visium data and the technical/biological mappings needed to understand and reuse the spatial transcriptomics component of the PCA Atlas. High resolution histology images are available from the following Zenodo repository, along with additional spaceranger outs for each slide: - https://zenodo.org/records/17411292
Sequence reads were obtained from a set of 920 unrelated individuals from the CIRdb using Illumina paired-end reads, at the Institute of Technology and Renewable Energy (ITER). Briefly, we used bcl2fastq v2.18 to perform sample demultiplexing and BWA-MEM v0.7.15 (https://github.com/lh3/bwa) to align reads to GRCh37/hg19 reference. Resulting BAM files were assessed with SAMtools v1.3 (http://www.htslib.org) and Picard v2.10.10 (https://broadinstitute.github.io/picard/) for quality control steps. Small insertions/deletions (< 50 bp) and single nucleotide variants (SNVs) were identified using an in-house bioinformatics pipeline based on GATK HaplotypeCaller v3.8 (https://gatk.broadinstitute.org/hc/en-us/articles/360037225632-HaplotypeCaller). This pipeline follows the Best Practices recommendations for germline variant calling and its description is publicly available (https://github.com/genomicsITER/benchmarking/tree/master/WES).
The cohort used in this study includes 36 African American (17 female, 19 male) and 36 White American (19 female, 17 male) individuals. The total cohort has a median age of 32 years, with a range of 20-64 years. The African American cohort has a median age of 34, with a range of 21-52 years. The White American cohort has a median age of 31, with a range of 20-64 years. Primary dermal fibroblast lines were derived from skin biopsies obtained from adult individuals at the NIEHS under institutional review board approved protocol human subjects 10-E-0063, "Sample Collection Registry for Quality Control of Biological and Environmental Specimens and Assay Development and Testing protocol" (ClinicalTrials.gov #NCT01087307). All participants gave written informed consent for tissue donation. Donor sex, age, and ancestry were voluntarily self-reported. Fibroblast cells were reprogrammed via lentiviral transduction using six transcription factors contained in three plasmids (ADDGENE/PSIN4-EF2-N2L, ADDGENE/PSIN4-EF2-O2S and ADDGENE/PSIN4-CMV-K2M). Reprogramming efficiency was determined by alkaline phosphatase staining of triplicate 10cm reprogrammed dishes containing colonies, which were then scanned. The saved images were analyzed with ImageJ 1.51h (Wayne Rasband, National Institutes of Health, USA) to count colonies, using color threshold adjusted and binary converted images of each dish. Triplicate plates were averaged and reported as colony counts or percent reprogramming efficiency ((# of colonies/250,000) x100). The African American cohort had reprogramming efficiencies ranging from 0.06-1.37%, with a median of 0.655%. The White American cohort had reprogramming efficiencies ranging from 0.02-1.13%, with a median of 0.455%. Our goal was to define transcriptomic heterogeneity that could be contributing to differences in reprogramming efficiency between individuals and between groups. Total RNA was obtained from dermal fibroblasts and matched iPSCs. For each sample, 500 ng total RNA was used as input for preparation of whole transcriptome rRNA depleted libraries. An adapter-ligated library was prepared with the KAPA HyperPrep Kit (KAPA Biosystems, Wilmington, MA) using Bioo Scientific NEXTflex™ DNA Barcoded Adapters (Bioo Scientific, Austin, TX, USA) according to KAPA-provided protocol. Sequencing was performed using an Illumina HiSeq 2500 following Illumina-provided protocols for 2x150 bp paired-end sequencing. Each transcriptome was sequenced to a target depth of 125 million reads. The following mean raw reads were obtained: African American dermal fibroblasts= 129,571,450; White American dermal fibroblasts= 131,939,505; African American iPSCs= 132,501,335; White American iPSCs= 134,394,164. Raw reads were aligned to hg19 using the STAR alignment tool (https://github.com/alexdobin/STAR). The following mean aligned reads were obtained: African American dermal fibroblasts= 123,315,343; White American dermal fibroblasts= 125,178,035; African American iPSCs= 123,148,312; White American iPSCs= 123,886,992. Reprinted from L. C. Mackey et al., Epigenetic Enzymes, Age, and Ancestry Regulate the Efficiency of Human iPSC Reprogramming. Stem Cells 36, 1697-1708 (2018), with permission from Wiley. Reprinted from L. S. Bisogno et al., Ancestry-dependent gene expression correlates with reprogramming to pluripotency and multiple dynamic biological processes. Science Advances 6 (47) (2020) (PMID: 33219026), with permission from AAAS.