This collection contains all of NHLBI's authorized individual-level genomic datasets currently in dbGaP that are approved for General Research Use (GRU) and have no further limitations beyond those outlined in the model Data Use Certification Agreement. The public posting of Genomic Summary Results are not allowed. Access to this study collection will include additional authorized individual-level GRU datasets that become available. Renewal of this study collection is required annually. To request access to this study collection, select phs003132 in the dbGaP Authorized Access System.
This collection contains all of NHLBI's authorized individual-level genomic datasets currently in dbGaP that are approved for Health/Medical/Biomedical (HMB) Research Use and have no further limitations beyond those outlined in the model Data Use Certification Agreement. The public posting of Genomic Summary Results are allowed. Access to this study collection will include additional authorized individual-level HMB datasets that become available. Renewal of this study collection is required annually. To request access to this study collection, select phs003134 in the dbGaP Authorized Access System.
This collection contains all of NHLBI's authorized individual-level genomic datasets currently in dbGaP that are approved for Health/Medical/Biomedical (HMB) and have no further limitations beyond those outlined in the model Data Use Certification Agreement. The public posting of Genomic Summary Results are not allowed. Access to this study collection will include additional authorized individual-level HMB datasets that become available. Renewal of this study collection is required annually. To request access to this study collection, select phs003133 in the dbGaP Authorized Access System.
CTNNB1 mutation is a major driver of HCC, particularly in metabolic-associated subtype. This study utilized patient-derived tumour organoids and cell lines harbouring various endogenous CTNNB1 mutations to comprehensively map the transcriptional repertoire regulated by mutant β-catenin through ChIP-seq under normal and fatty conditions. Genome-wide binding patterns of β-catenin were integrated with ChIP-seq data for histone modifications H3K27ac and H3K27me3 to assess active and repressive chromatin states. Oleic acid treatment was used to model fatty conditions.
Toxoplasmosis is a zoonotic disease caused by a ubiquitous protozoan parasite called Toxoplasma gondii, which can infect all mammal and bird species throughout the world. seroprevalence varies widely between countries. Studies have estimated that between 7-34% of people in the UK have been infected with T. gondii. The vast majority of these people will not have noticed any symptoms, however about 10% of people develop a mild to moderate self limiting flu-like illness. Following the acute active stage of the infection the parasite persists in the body in the form of cysts, particularly in heart and skeletal muscle and nervous system tissues, for many years, and usually for life. In immunocompetent persons these cysts do not pose a health risk. We will use RNA-seq to quantify the transcriptional response of macrophages to T gondii infection. This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
EGA Webin EGA Webin serves as a platform for registering metadata for array based submissions, large scale sequence submission as well as for legacy EGA submission accounts (ega-box-XXXX). For large scale submitters of sequence data you have also the option to submit metadata via XMLS programmatic submission, while new submitters are advised to utilise the Submitter Portal for their submissions. You can request a legacy EGA submission account (ega-box-XXXX) by populating this form. Please, allow two business days for our Helpdesk team to contact you after populating this form. WEBIN actions: Register metadata for a sequence submission Register study, samples, experiments, runs, DAC, policy and dataset/s after file upload. Register components for your array-based metadata submission Register study, samples, DAC or policy before uploading files. Edit existing submission metadata Change or update previously submitted metadata. Register metadata for a sequence submission Ensure that all sequence files have been encrypted before uploading them to your submission account using the EgaCryptor. Go to the EGA Webin and log in using your submission account name (ega-box-XXX) and password. Register components of your metadata submission Study Samples Data Access Committee (DAC) Data access policy Dataset For array-base submissions: Study, Samples, Data Access Committee (DAC) and Data access policy may all be registered BEFORE file upload and dataset registration through the array-base template. Register your Study Go to the “Studies (Projects)” box Click on “Register Study” and fill in the information related to your study. Click on “Submit”, this will save the information and generate an EGA study ID. To use the study accession number in a publication, the study has to be previously released on the EGA website, we suggest the following format: "Sequence data has been deposited at the European Genome-phenome Archive (EGA), which is hosted by the EBI and the CRG, under accession number EGASXXXXXXXXXXX.Further information about EGA can be found on https://ega-archive.org and “The European Genome-phenome Archive in 202 "(10.1093/nar/gkab1059)" Register your Samples Go to the “Samples” box Click on “Register Samples” Select “Download spreadsheet to register samples” and customise your template, there is a default EGA template (EGA default checklist) but more attributes can be added if required. For the EGA default checklist, there are mandatory,recommended and optional attributes. As well custom fields which can be added if required. Mandatory attributes Field Name Description tax_id Taxonomy ID of the organism as in the NCBI Taxonomy database. Entries in the NCBI Taxonomy database have integer taxon IDs. See our tips for sample taxonomy here scientific_name Scientific name of the organism as in the NCBI Taxonomy database. Scientific names typically follow the binomial nomenclature. For example, the scientific name for humans is Homo sapiens. sample_alias Unique name of the sample. If not selected system will auto generate an unique alias sample_title Title of the sample sample_description Description of the sample phenotype *** Where possible, please use the Experimental Factor Ontology (EFO) to describe your phenotypes. Recommended attributes Field Name Description subject_id Identifier for the subject where the sample has been derived from gender * Sex Optional attributes Field Name Description sex sex of the organism from which the sample was obtained disease_site Affected organ sample type Affected organ donor_id ** Identifier of the donor where the sample has been derived from *Gender should be described as 'male', 'female' or 'unknown'. If 'unknown' due to a known sex chromosome aneuploidy, please create a user defined attribute called 'Sex chromosome karyotype' and add the appropriate value, for example, 'XXY'. **Donor id (Subject id) should be a de-identified subject handle. If unknown, please add 'unknown' to the field. ***Phenotypes should, where possible, be an Experimental Factor Ontology accession. If a term cannot be found to describe your phenotype please use free text. All sample phenotypes considered important for further analysis of the data should be provided (for example, tumour type), additional phenotype attributes can be created by defining your own attributes; use the notion 'phenotype2', 'phenotype3', etc. After you have customised the fields for the sample submission, download the template and fill in the information. Example of the sample template: Finally upload the sample template to get the EGA accession IDs for the samples. Register your Data Access Committee (DAC) Further information on the role of your DAC Go to the “Data Access” box Click on “Register Dacs” Input the information about the DAC and register at least one main DAC contact. Register your Data Access Policy Your Data Access Policy provides the terms and conditions of data use, this is also referred to as the Data Access Agreement (DAA). Completion of a DAA by the applicant/s should form part of the application process to the Data Access Committee (DAC). Go to the “Data Access” box Click on “Register Policies” Select the DAC to which this policy will be linked to and fill in the policy information. Submitting your Runs and Analyses This section is only for sequence data submission, for array-based submission it can be skipped. Please refer to our Submitting array based metadata Runs Registration Go to the “Raw Reads (Experiments and Runs)” box Click on “Submit Reads” Select “Download spreadsheet template for Read submission” Select the template corresponding to your submission type For the templates you have the option to customise the optional fields. To check their description click on “Show Description “ Download the template and fill in the required information. Example of the runs template: We recommend that Fastq, BAM, and CRAM read files are submitted using Webin-CLI When using this interface instead of Webin-CLI, raw sequences must be uploaded in one of the supported data formats before they can be submitted. The files can be uploaded using FTP or Aspera. The study and the sequenced samples must be pre-registered before the raw reads are submitted. Please note that each individual study and sample should be registered only once. You will be asked to provide information about the sequencing libraries and instruments. Submitting your Dataset This section is only for sequence data submissions, for array based submissions it can be skipped. Please refer to our Submitting array based metadata The dataset describes the data files, defined by the run (EGARXXXXXXXXXXX) and analysis (EGAZXXXXXXXXXXX) accessions that make up the dataset and links the collection of data files to a specified Data Access Committee and Data Access Policy. As a result, you must have registered your Reads and experiments, Data Access Committee (DAC) and Data access policy before submitting your Dataset. Please consider the number of datasets that your submission consists of, for example, a case control study is likely to consist of at least two datasets. In addition, we suggest that multiple datasets should be described for studies using the same samples but different sequence technologies. Please contact EGA Helpdesk for further assistance. Go to the “Data Access” box Click on “Register Dataset” Select the Data Access Committee (DAC) and Data access policy Register your dataset After submitting your dataset you should contact the EGA Helpdesk to provide a release date for your dataset. Datasets are automatically held (i.e. not released) unless they are affiliated to a study that has already been released. Edit/update existing submission metadata Go to the “Report” section of the object you would like to edit. Locate the object and click on the arrow under action. An option menu will be displayed. Objects can be edited through their XML or with the WEBIN menu. After an object has been edited, changes would be available on the website until the submission is released again. Please contact the EGA Helpdesk if you require further assistance.
South African breast cancer GWAS genotype data for 2823 female African breast cancer cases. The data was generated using the H3Africa Custom microarray and genotyped on the Illumina HiScan instrument. The dataset is in VCF format.
This Data Access Committee is responsible for reviewing and managing data access requests related to genomic variant files from whole exome sequencing of patients diagnosed with Meniere disease. This includes data from both familial and sporadic forms of the disease.
This dataset contains whole sequencing data from 30 patients with pediatric AML. For each patient, there is one dataset derived from tumor cells and one dataset derived from a matched non-malignant control.
Whole genome sequncing data of original/SHANK2 modified/SHANK2 knockout. Note that the SHANK2 knockout sample is a different sample from 1_0441_003. Please refer to other paper for the data.