Controlled Access Sequence Datasets
We're happy to announce that we are now accepting Affiliates in the Federated EGA Network! Federated EGA is a global network of repositories enabling secure discovery and access to sensitive human data. Currently, Federated EGA consists of Central EGA (CEGA), which serves the global community, and nine national Nodes that serve respective jurisdictions. Did you know that the Federated EGA (FEGA) has always envisioned a third tier within the network? This brings us to FEGA Affiliates, recently defined in the newly approved "Federated EGA Structure and Organisation" document. This new tier broadens the range of institutions or organisations that can become part of FEGA. Thus begins a new era for the network. What is a FEGA Affiliate? A FEGA Affiliate archives and serves controlled-access data within its own governance scope, while still participating in federated discovery through the EGA catalogue. Affiliates can be individual organisations or institutions, healthcare genomics initiatives, consortia of institutions, as well as project-specific consortia (national or international). In practical terms, a FEGA Affiliate: Hosts and safeguards the data it archives. Submits non-personal metadata to CEGA so studies are discoverable and can receive EGA accessions. Manages permissions and data access workflows for its datasets. Provides data access and/or distribution, along with support, to authorised users. Key takeaways about FEGA Affiliates FEGA is now organised into three tiers: CEGA, FEGA Nodes, and FEGA Affiliates. Affiliates archive data from their own organisation and/or selected partner institutions (for example, an international consortium). Affiliates make studies discoverable through the EGA catalogue by submitting non-personal metadata for accessioning. Access decisions and conditions are managed locally by the Affiliate (e.g. distribution or access in a trusted research environment). Are you considering whether your institution or project might be a good fit as a FEGA Affiliate? Would you like to learn more about this tier? We have published a blog post that addresses potential common questions, providing answers on this topic. You can read it here.
This sample represents a collection of cases across a range of sites. All of these samples were ascertained for ADHD with most meeting criteria for combined type ADHD. The collection sites span Europe and America. Further details on the source and inclusion and exclusion information can be found in Neale et al. "Case-Control Genome-Wide Association of Attention-Deficit / Hyperactivity Disorder" J Am Acad Child Adolesc Psychiatry. 2010 September; 49(9): 906-920 PMID20732627. For online access to this manuscript see: PMC2928577.
The INCLUDE (INvestigation of Co-occurring conditions across the Lifespan to Understand Down syndromE) Project is an NIH-wide collaboration that seeks to improve health and quality-of-life for people with Down syndrome. The INCLUDE Project Data Coordinating Center and partners created the INCLUDE Data Hub, a centralized data resource that allows access to large-scale clinical and multi-omics datasets specific to Down syndrome and supports collaborative, cloud-based analysis to accelerate scientific discoveries related to Down syndrome and its co-occurring conditions.
Multiomics analysis of rare cancers to identify genetic alterations related with pathogenesis and prognosis. To identification of genetics alterations of Dedifferentiated liposarcoma, DNAs and RNAs were extracted from dedifferentiated liposarcoma and paired non-cancer (normal) tissues. NGS libaraies were prepared by using Agilent SureSelect XT Human All Exon V5 + IncRNA kit for whole exome sequencing (WES) and TruSeq Stranded mRNA Prep kit or TruSeq RNA Access Library Prep kit (RNA seq). Sequencing was perfomred by Illumina HiSeq 2500.
Data access committee for U1236 SEP datasets
Data access comittee for ICARUS LUNG 01
Data access committee for ADARIO / Targeting ADAR1 in Immuno-Oncology
Submitter Portal API The metadata submission process can be difficult and time-consuming. For this reason, the EGA has developed the Submitter Portal, a tool that was created to offer a simplified and user-friendly method of registering metadata. Our portal provides features that are intended to make the input of metadata easier, ensuring that your data is registered correctly and effectively. Our page is divided into logical sections and includes a helpful video instruction to further assist you as you complete the submission process. With the Submitter Portal, you can rest assured that the submission of your data is in good hands. For those that want a more flexible and automated approach, we also provide a programmatic approach using the Submitter Portal AP in addition to the user interface of the Submitter Portal. With the help of our API, you can quickly include the submission of metadata into your own workflow for a more effective and individualised experience. Previous steps Create your EGA account To submit data you first need to create your EGA user. Then, once your account has been verified, you will have to request a submitter role and sign the EGA Data Processing Agreement (DPA). When the DPA is signed, you must send it to EGA Helpdesk for further validation. Please, note that if you already have an EGA account, or you have an ega-box (submission account), you can skip this step. Upload your files Please note that all your files must be encrypted using the Crypt4GH tool before upload them.As soon as you are assigned with a submitter role, you will be able to connect to the EGA inbox and upload your files. Understand the EGA metadata schema It's crucial to comprehend the EGA metadata schema, a set of rules that specify how data is organised, described, and shared inside the EGA, in order to get the most out of this resource. Learn all about EGA metadata schema! Register your DAC and policy There are two objects in the EGA metadata schema that are registered in a separate portal. All DACs and policy objects are registered using the DAC Portal, a tool developed by EGA to help data controllers manage their data stored at the EGA. You can find the relevant information in the DAC Portal Guide. Programmatic submission All the calls to the API need to be Authenticated. We use the OpenID Connect protocol. API Usage Flow: Click here to check the API usage flow schema Obtain Access and Refresh Token: The first step to using the API is to obtain an access and refresh token. These tokens are required for authentication and authorisation of API requests. To obtain these tokens, you need to log in using your EGA credentials. Example: curl "https://idp.ega-archive.org/realms/EGA/protocol/openid-connect/token" \ -d "grant_type=password" \ -d "client_id=sp-api" \ -d "username=your-username" \ --data-urlencode "password=your-password" Use Access Token in API Calls: Once you have obtained the access token, you must use it in all the calls to the API. The access token is valid for a limited time period. When it expires, you will need to use the refresh token to obtain a new access token. Create Submission Object: After authentication, you can start creating a submission object. This object will be used to store all the metadata objects and files related to the submission. Example: curl "https://submission.ega-archive.org/api/submissions" \ -H "Authorization: Bearer your-access-token" \ -H "Content-Type: application/json" \ -d "{ \"title\":\"My submission title\", \"description\":\"My submission description\" }" Create EGA Metadata Objects: Within the submission object, you can create other metadata objects required for the submission. These objects may include information about the submitter, study, sample, experiment, and run. Example of study creation: curl "https://submission.ega-archive.org/api/submissions/{submisison-id}/studies" \ -H "Authorization: Bearer your-access-token" \ -H "Content-Type: application/json" \ -d "{ \"title\":\"My study title\", \"description\":\"My study description\", \"study_type\":\"Metagenomics\" }" If you would like to reuse already registered objects in new submissions, bear in mind that they must have an accession which you must use to reference them. You cannot reuse objects that have not been finalised and accepted yet.For example, when creating a new run which is reusing a previously submitted experiment, instead of using experiment_provisional_id, you must use experiment_accession_id. Enums There are some fields that only accept specific values, i.e. study_type. The API provides several enumerations to list these values. You can list all the enumerations, or a specific one, for example, study_types. Obtain IDs of files You can list the files available in your inbox and filter by a prefix by adding the prefix param, i.e. &prefix=/my_folder/abc Example: curl "https://submission.ega-archive.org/api/files?status=inbox&prefix=/my_folder/abc" \ -H "Authorization: Bearer your-access-token" \ -H "Content-Type: application/json" To improve performance and manageability, we have limited the number of files that the Submitter Portal API can return in a response. If a given call exceeds this limit, user will be asked to narrow down the search by using the query parameters already available in this endpoint. Finalise Submission: Once all the metadata objects are created and linked to the submission, you can finalise the submission. Finalising the submission sends it to the Helpdesk team for review. Please, note that all files linked to the submission must be ingested, and all objects created in a submission must be linked before you can finalise the submission. API Reference: For a full API reference, please check our specification documentation.