response efforts On 11 March 2020 the World Health Organization declared the novel coronavirus outbreak a global pandemic. Four months later, the European Genome-phenome Archive (EGA) released its first COVID-19 dataset. This dataset – single cell RNA and VDJ sequencing of B cells from 60 COVID-19 patients – showed that neutralizing antibodies could be identified by high-throughput sequencing in response to SARS-CoV-2 infection. That was one year ago. Today, the EGA provides access to fifteen COVID-19 datasets from researchers across seven countries in Asia, Europe, and North America. These studies represent almost 17,000 individuals and have resulted in at least sixteen publications and preprints. Researchers deposit controlled access COVID-19 data at EGA The global research community has come together rapidly to investigate the SARS-CoV-2 coronavirus and better understand the related disease, COVID-19. These research efforts generate valuable genetic and phenotypic data from patients and research participants that can be shared with approved researchers. The EGA enables sharing of this research data by providing a service for archiving and controlled distribution of sensitive data. Over the past year, the EGA has worked with researchers to archive and distribute COVID-19 data from high-throughput sequencing experiments, genotyping studies, and phenotypic information. These datasets investigate the immune system, blood, and cells and tissues of the lung, which are relevant for studying a contagious respiratory illness caused by a viral infection. *Study Spotlight. In January 2021, Ancestry.com demonstrated the utility of deep phenotyping based on self-reported outcomes from a large population of mild and asymptomatic COVID-19 cases. They identified genetic associations with eight COVID-19 phenotypes and showed distinct patterns of association, most notably related to the chr3/SLC6A20/LZTFL1 and chr9/ABO regions. The supporting data is available at the EGA to approved researchers and includes both genotype and phenotype data for 15,000 individuals. EGA collaborates with global COVID-19 community Since the coronavirus outbreak, the EGA has collaborated with other life science resources to support discovery and access to COVID-19 datasets. COVID-19 Host Genetics Initiative. With the NHGRI’s Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL) platform, the EGA enables sharing of individual-level genetic and phenotypic data from the COVID-19 Host Genetics Initiative (HGI). This initiative aims to generate, share, and analyze data from COVID-19 host genetics research projects to better understand the genetic determinants of COVID-19 susceptibility, severity, and outcomes. In response to COVID-19, the EGA actively supports COVID-19 data submissions and integration of data access and flow into the COVID-19-HGI analysis platform. *Study Spotlight. The COVID-19 HGI has combined individual-level data for 13,868 COVID-19 positive patients (N=7,167 hospitalized) from 17 cohorts in nine countries. The data were used to assess the association of the major common COVID-19 genetic risk factor (chromosome 3 locus tagged by rs10490770) with mortality, COVID-19-related complications, and laboratory values. The genotype and phenotype data for 10 of these cohorts is available at the EGA to approved researchers under accession EGAS00001005304. Fig 1: EGA COVID-19 studies are searchable in the European COVID-19 Data Portal alongside other COVID-19 and SARS-CoV-2 public datasets. European COVID-19 Data Portal. EGA-archived COVID-19 data are discoverable via the European COVID-19 Data Portal (Fig. 1), which brings together public and controlled access data to accelerate coronavirus research for the international community. By indexing all COVID-19-related data in one place, researchers can more easily discover relevant datasets of interest, thus increasing the “FAIRness” (Findability, Accessibility, Interoperability, Reusability) of this valuable data. Fig 2: SARS-CoV-2 viral sequences are imported from ENA and analysed in Galaxy to detect variants. Results are accessible to researchers through the COVID-19 Viral Beacon. COVID-19 Viral Beacon. The COVID-19 Viral Beacon tool was developed in collaboration with the European Nucleotide Archive and Galaxy to enable near real-time browsing of SARS-CoV-2 variability at genomic, amino acid, and motif levels (Fig. 2). The COVID-19 Viral Beacon allows researchers to (i) perform detailed searches about genomic variants, (ii) filter queries and find unique cases, (iii) filter data based on strain-specific variants, and (iv) explore associated metadata. It uses the Global Alliance for Genomics and Health (GA4GH) Beacon standard including new Beacon v2 features. With this tool, researchers can study intra-host mutations on genomic regions of interest, or trace any variant frequency over time using raw read data. More than 200,000 SARS-CoV-2 analysed genomic data files are now available to researchers for further exploration. Ongoing COVID-19 efforts at EGA Addressing the COVID-19 pandemic is a global effort. Federated resources are necessary to support transnational deposition, access, and analysis of sensitive COVID-19 host genetics and other related data. At the same time, many countries now have emerging personalized medicine programmes which are generating data from national or regional healthcare initiatives. These data are subject to more stringent information governance than research data and often must comply with national data protection legislation. To address this need, the Federated EGA was established to serve as the primary global resource for discovery and access of sensitive human omics and associated data consented for secondary use. The Federated EGA will comprise a network of national human data repositories and will implement community standards and common interfaces. Launching Federated EGA promises to accelerate not only global research efforts to understand, diagnose, and treat COVID-19, but also to foster data reuse, enable reproducibility, and accelerate biomedical and disease research to ultimately improve human health.
In this study, we compared various multiplexing reagents, including MULTI-Seq, Hashtag antibody, and CellPlex, in patient-derived xenografts (PDXs). We found that all multiplexing reagents worked well in cell types robust to ex vivo manipulation but suffered from signal-to-noise issues in more delicate sample types. We also compared the performance of fixed scRNA-Seq kits and CRISPR-based destruction of non-informative genes.
Background: Cis-regulatory elements such as enhancers and promoters are crucial for directing gene expression in the human heart. Dysregulation of these elements can result in many cardiovascular diseases that are major leading causes of morbidity and mortality worldwide. In addition, genetic variants associated with cardiovascular disease risk are enriched within cis-regulatory elements. However, the location and activity of these cis-regulatory elements in individual cardiac cell types remains to be fully defined. Methods: We performed single nucleus ATAC-seq and single nucleus RNA-seq to define a comprehensive catalogue of candidate cis-regulatory elements (cCREs) and gene expression patterns for the distinct cell types comprising each chamber of four non-failing human hearts. We used this catalogue to computationally deconvolute dynamic enhancers in failing hearts and to assign cardiovascular disease risk variants to cCREs in individual cardiac cell types. Finally, we applied reporter assays, genome editing, and electrophysiogical measurements in in vitro differentiated human cardiomyocytes to validate the molecular mechanisms of cardiovascular disease risk variants. Results: We defined >287,000 candidate cis-regulatory elements (cCREs) in human hearts at single-cell resolution, which notably revealed gene regulatory programs controlling specific cell types in a cardiac region/structure-dependent manner and during heart failure. We further report enrichment of cardiovascular disease risk variants in cCREs of distinct cardiac cell types, including a strong enrichment of atrial fibrillation variants in cardiomyocyte cCREs, and reveal 38 candidate causal atrial fibrillation variants localized to cardiomyocyte cCREs. Two such risk variants residing within a cardiomyocyte-specific cCRE at the KCNH2/HERG locus resulted in reduced enhancer activity compared to the non-risk allele. Finally, we found that deletion of the cCRE containing these variants decreased KCNH2 expression and prolonged action potential repolarization in an enhancer dosage-dependent manner. Conclusion: This comprehensive atlas of human cardiac cCREs provides the foundation for not only illuminating cell type-specific gene regulatory programs controlling human hearts during health and disease, but also interpreting genetic risk loci for a wide spectrum of cardiovascular diseases.
Introduction: Tumour Mutation Burden (TMB) is a potential biomarker for immune therapies. Identifying factors that may affect TMB prediction is key to its utility as biomarker for treatment options. Here we investigated parameters that might affect TMB using cytology smears obtained from endobronchial ultrasound transbronchial needle aspiration (EBUS TBNA)-sampled malignant lymph nodes, which are often the only diagnostic samples collected in patients with advanced lung cancer. Methods: Individual Diff-Quik cytology smears were prepared for each needle pass from the same enlarged malignant mediastinal lymph nodes. DNA extracted from smears of each pass underwent sequencing using a large gene panel sequencing (TSO500, Illumina). TMB was estimated for each individual needle pass using the TSO500 Local App v. 2.0 (Illumina). Results: Twenty patients had two or more Diff-Quik smears (total 45 smears) which passed sequencing quality control. Average TMB for all smears was 8.7 ± 5.0 mutations per megabase (Mb). Sixteen of the 20 patients had paired samples with minimal differences in TMB score (average difference 1.3 ± 0.85). Paired samples from 13 patients had concordant TMB in terms of scores being below or above a threshold of 10 mutations/Mb. Samples from seven patients had discrepant TMB, with four cases in particular exhibiting an average difference of 11.3 ± 2.7 mutations/Mb. Factors affecting TMB calling involved tumour content of the samples, the amount of DNA used in sequencing as well as bone fide intra tumour heterogeneity between paired samples. Conclusions: TMB assessment is feasible from EBUS-TBNA derived Diff-Quik slides from a single needle pass. Repeated samples of one or more lymph nodes have minimal variation in TMB in most cases (80%), suggesting accuracy of the test overall. To enhance accuracy of the result, care should be taken in relation to the tumour content of the cytology smear. Further, clinicians should be aware that even slight differences in lymph node site selection could give rise to variable genomics and discrepant TMB due to tumour heterogeneity. Therefore, it would be reasonable for multi-lymph node site analysis of TMB to be performed using EBUS TBNA aspirates from different easily accessible lymph nodes. Further investigations need to determine whether these samples should be combined as one sample or tested separately.
Single-cell ATAC-seq data for 5 CLL samples (2 controls, 3 tumor) of the CancerEpiSys-PRECiSe project.
Circle-seq data for 21 primary neuroblastoma samples supporting Koche et al. Extrachromosomal circular DNA drives oncogenic genome remodeling in neuroblastoma (2020).
This dataset contains the raw sequencing data (Runs) for all 10x Genomics single-cell ATAC-seq Experiment.
release_2: ICGC PedBrain: ChIP-Seq
miRNA seq data of 43 cases out of dataset EGAD00001000650 (MMML)
ATAC-seq (Illumina TDE1 Transposase) to profile accessible chromatin regions of cohesin-mutated (STAG2 or RAD21 mutations) and -wildtype adult AMLs.