Chronic obstructive pulmonary disease (COPD) is a major respiratory disease characterized by small airway inflammation, emphysema and severe breathing difficulties. Low-grade systemic inflammation is an established hallmark of severe disease, however, the molecular changes in peripheral immune cells remain far from understood. We combined multi-color flow cytometry with single-cell RNA sequencing and showed that blood neutrophil numbers are significantly increased in COPD and they are a heterogeneous population. A transcriptomic state that expressed interferon response genes correlated with alveolar damage and acute exacerbations. Furthermore, bronchoalveolar neutrophils expressed gene signatures corresponding to certain blood neutrophil states. Last, our data in a murine model of cigarette smoke exposure demonstrated that bone marrow neutrophil progenitors are expanded in smoke-treated animals and display signs of immune activation. Our study provides evidence that COPD systemic inflammation may derive from an activated haematopoietic precursor compartment.
Malignant Peripheral Nerve Sheath Tumors (MPNST) are soft-tissue sarcomas with poor prognosis and a lack of treatment options. About half of them arise in people with Neurofibromatosis Type 1 (NF1). In this study, we set out to explore the malignant progression and tumor heterogeneity by exploring the different tumors along the MPNST progression at the single cell level. This work is part of the project "NF1-Associated Peripheral Nerve Sheath Tumors at Single-Cell Resolution: Heterogeneity, Tumor Growth, and Malignant Progression".
Data Quality Control High-throughput sequencing techniques have become the leading method to study, decode and discover the genomic origins of biological phenomenons. EGA provides a secure archival of such identifiable genomics data with the purpose of data-upcycling, i.e. to re-use these data for research. High-quality data standards are essential to ensure the quality and credibility of the research. Moreover, a quality check report can assure a researcher beforehand about the data that they will request access, therefore saving time and effort. The EGA has developed a File Quality Control Report (QC Report) to provide generic quality control reports for Fastq, SAM/BAM/CRAM, and VCF files deposited at EGA. This QC Report will allow users to get information regarding the files submitted within a specific dataset. The data requesters will obtain information such as the quality of reads, mapped reads, number of variants, and other features before starting the requesting process, which will save the efforts and time.Accessing file quality control reportsIn each dataset page, the user can explore the files that it contains by clicking the "files" tab. The Quality Control report of a file has two sections. The first one, contains general information about the file, such as the inferred assembly, total reads, the dataset or study where it comes from, etc. The second section contains plots that summarise interesting information about the file, for example, the base coverage distribution, base quality or mapped reads. The description of each plot is accessible by clicking the "i" button at the top-right corner of each plot box. Technical Description For analysing the fastq, SAM/BAM/CRAM and VCF files, the EGA applies a set of tools widely used in the bioinformatics community. FASTQ: FastQC, recognised as the gold standard tool by the community.Per base sequence quality, per sequence quality scores, per base sequence content, per sequence GC content, sequence duplication levels, etc.SAM/BAM/CRAM: samtools, also the gold standard, generates results plots useful to get an overall idea of the quality of the file.base coverage distribution, base quality, % of mapped reads, % of both mates mapped, singletons, duplicates, etc.VCF: vcftools and bcftools, combined with a custom script to infer the genome assembly.site frequency distribution, Ts/Tv, base changes, indel distribution, etc.
Solitary fibrous tumor/Hemangiopericytoma (SFT/HPC) is a rare subtype of soft tissue sarcoma associated with NAB2-STAT6 gene fusions. In this study, a novel SFT/HPC was characterized using whole genome sequencing.
This dataset contains 207 samples sequenced using rapid-multiplexed barcoded library preparation on Nanopore PromethION R10 flow cells on P2 Solo and P24 systems. One unaligned BAM file per sample is provided.
The dataset contains amplicon sequencing data from 48 samples from 35 different patients with ovarian cancers. Cell free DNA was collected from plasma. Sequencing was performed on an Ion Torrent platform and the sequencing data is provided in bam format.
The purpose is to perform miRNA analysis on FACS sorted target immune cell populations to uncover novel potential miRNA diagnostic markers biomarkers predictive of therapy response but also possibly identify perturbed signaling pathways or miRNA targets of intervention with the use of agomirs/antagomirs.
Bulk RNA sequencing was performed on mCRC organoids subjected to cetuximab treatment, comparing ATOH1 knockout and control conditions. The dataset is intended to elucidate the contribution of ATOH1-regulated secretory cell population to cetuximab persistence mechanisms.
Single-cell RNA sequencing of 18 peripheral blood samples from six melanoma patients. The raw data is available as fastq files.
Human placenta samples from 52: 5 first trimester , 7 second trimester, and 40 term placenta. Data is uploaded as BAM files.