Need Help?

BAM files of Roma and non-Roma Romanians

BAM files (Illumina HiSeq 2000) with whole genome sequencing data of 49 individuals of European/Romanian descent, and 50 individuals of Roma (Romani/Rroma) ethnic background from Romania.

Request Access

Data sharing policy for the Roma/Romanian data. No special requirements.

DATA ACCESS AGREEMENT These terms and conditions govern access to the managed access datasets (details of which are set out in Appendix I) to which the User Institution has requested access. The User Institution agrees to be bound by these terms and conditions. Definitions Authorised Personnel: The individuals at the User Institution to whom DAC for WGS data Romanians and Roma/Rroma (Romania) grants access to the Data. This includes the User, the individuals listed in Appendix II and any other individuals for whom the User Institution subsequently requests access to the Data. Details of the initial Authorised Personnel are set out in Appendix II. Data: The managed access datasets to which the User Institution has requested access. Data Producers: DAC for WGS data Romanians and Roma/Rroma (Romania) and the collaborators listed in Appendix I responsible for the development, organisation, and oversight of these Data. External Collaborator: A collaborator of the User, working for an institution other than the User Institution. Project: The project for which the User Institution has requested access to these Data. A description of the Project is set out in Appendix II. Publications: Includes, without limitation, articles published in print journals, electronic journals, reviews, books, posters and other written and verbal presentations of research. Research Participant: An individual whose data form part of these Data. Research Purposes: Shall mean research that is seeking to advance the understanding of genetics and genomics, including the treatment of disorders, and work on statistical methods that may be applied to such research. User: The principal investigator for the Project. User Institution(s): The Institution that has requested access to the Data. DAC for WGS data Romanians and Roma/Rroma (Romania): Contact Person: Hafid Laayouni Email: hafid.laayouni@upf.edu Telephone: (+34) 93-316-0845. 1. The User Institution agrees to only use these Data for the purpose of the Project (described in Appendix II) and only for Research Purposes. The User Institution further agrees that it will only use these Data for Research Purposes which are within the limitations (if any) set out in Appendix I. 2. The User Institution agrees to preserve, at all times, the confidentiality of these Data. In particular, it undertakes not to use, or attempt to use these Data to compromise or otherwise infringe the confidentiality of information on Research Participants. Without prejudice to the generality of the foregoing, the User Institution agrees to use at least the measures set out in Appendix I to protect these Data. 3. The User Institution agrees to protect the confidentiality of Research Participants in any research papers or publications that they prepare by taking all reasonable care to limit the possibility of identification. 4. The User Institution agrees not to link or combine these Data to other information or archived data available in a way that could re-identify the Research Participants, even if access to that data has been formally granted to the User Institution or is freely available without restriction. 5. The User Institution agrees only to transfer or disclose these Data, in whole or part, or any material derived from these Data, to the Authorised Personnel. Should the User Institution wish to share these Data with an External Collaborator, the External Collaborator must complete a separate application for access to these Data. 6. The User Institution agrees that the Data Producers, and all other parties involved in the creation, funding or protection of these Data: a) make no warranty or representation, express or implied as to the accuracy, quality or comprehensiveness of these Data; b) exclude to the fullest extent permitted by law all liability for actions, claims, proceedings, demands, losses (including but not limited to loss of profit), costs, awards damages and payments made by the Recipient that may arise (whether directly or indirectly) in any way whatsoever from the Recipient’s use of these Data or from the unavailability of, or break in access to, these Data for whatever reason and; c) bear no responsibility for the further analysis or interpretation of these Data. 7. The User Institution agrees to follow the Fort Lauderdale Guidelines (http://www.wellcome.ac.uk/stellent/groups/corporatesite/@policy_communications/documents/web_document/wtd003207.pdf) and the Toronto Statement (http://www.nature.com/nature/journal/v461/n7261/full/461168a.html). This includes but is not limited to recognising the contribution of the Data Producers and including a proper acknowledgment in all reports or publications resulting from the use of these Data. 8. The User Institution agrees to follow the Publication Policy in Appendix III. This includes respecting the moratorium period for the Data Producers to publish the first peer-reviewed report describing and analysing these Data. 9. The User Institution agrees not to make intellectual property claims on these Data and not to use intellectual property protection in ways that would prevent or block access to, or use of, any element of these Data, or conclusion drawn directly from these Data. 10. The User Institution can elect to perform further research that would add intellectual and resource capital to these data and decide to obtain intellectual property rights on these downstream discoveries. In this case, the User Institution agrees to implement licensing policies that will not obstruct further research and to follow the U.S. National Institutes of Health Best Practices for the Licensing of Genomic Inventions (2005) (https://www.icgc.org/files/daco/NIH_BestPracticesLicensingGenomicInventions_2005_en.pdf) in conformity with the Organisation for Economic Co-operation and Development Guidelines for the Licensing of the Genetic Inventions (2006) (http://www.oecd.org/science/biotech/36198812.pdf ). 11. The User Institution agrees to destroy/discard the Data held, once it is no longer used for the Project, unless obliged to retain the data for archival purposes in conformity with audit or legal requirements. 12. The User Institution will notify DAC for WGS data Romanians and Roma/Rroma (Romania) within 30 days of any changes or departures of Authorised Personnel. 13. The User Institution will notify DAC for WGS data Romanians and Roma/Rroma (Romania) prior to any significant changes to the protocol for the Project. 14. The User Institution will notify DAC for WGS data Romanians and Roma/Rroma (Romania) as soon as it becomes aware of a breach of the terms or conditions of this agreement. 15. DAC for WGS data Romanians and Roma/Rroma (Romania) may terminate this agreement by written notice to the User Institution. If this agreement terminates for any reason, the User Institution will be required to destroy any Data held, including copies and backup copies. This clause does not prevent the User Institution from retaining these data for archival purpose in conformity with audit or legal requirements. 16. The User Institution accepts that it may be necessary for the Data Producers to alter the terms of this agreement from time to time. As an example, this may include specific provisions relating to the Data required by Data Producers other than DAC for WGS data Romanians and Roma/Rroma (Romania). In the event that changes are required, the Data Producers or their appointed agent will contact the User Institution to inform it of the changes and the User Institution may elect to accept the changes or terminate the agreement. 17. If requested, the User Institution will allow data security and management documentation to be inspected to verify that it is complying with the terms of this agreement. 18. The User Institution agrees to distribute a copy of these terms to the Authorised Personnel. The User Institution will procure that the Authorised Personnel comply with the terms of this agreement. 19. This agreement (and any dispute, controversy, proceedings or claim of whatever nature arising out of this agreement or its formation) shall be construed, interpreted and governed by the laws of Spain and shall be subject to the exclusive jurisdiction of the Spanish courts. Agreed for User Institution Signature: Name: Title: Date: Principal Investigator I confirm that I have read and understood this Agreement. Signature: Name: Title: Date: Agreed for DAC for WGS data Romanians and Roma/Rroma (Romania) Signature: Name: Title: Date: APPENDIX I – DATASET DETAILS APPENDIX II ––PROJECT DETAILS APPENDIX III –– PUBLICATION POLICY APPENDIX I – DATASET DETAILS Dataset reference (EGA Study ID and Dataset Details) EGA Study ID: EGAS00001003624 Dataset Description: The dataset consists of 99 whole genome sequencing files (BAM, Illumina HiSeq 2000) of 49 individuals of European/Romanian descent, and 50 individuals of Roma (Romani/Rroma) ethnic background from Dolj County (Romania). The data is also included in VCF format. Name of project that created the dataset The shaping of immunological response through natural selection after migration: the case of the Roma. Names of other data producers/collaborators Begoña Dobon Berenguer Jaume Bertranpetit Busquets Hafid Laayouni Mihai Netea Rob ter Horst Specific limitations on areas of research There are no limitations on the areas of research. Minimum protection measures required File access: Data can be held in unencrypted files on an institutional compute system, with Unix user group read/write access for one or more appropriate groups but not Unix world read/write access behind a secure firewall. Laptops holding these data should have password protected logins and screenlocks (set to lock after 5 min of inactivity). If held on USB keys or other portable hard drives, the data must be encrypted. APPENDIX II – PROJECT DETAILS (to be completed by the Requestor) Details of dataset requested i.e., EGA Study and Dataset Accession Number Brief abstract of the Project in which the Data will be used (500 words max) All Individuals who the User Institution to be named as registered users Name of Registered User Email Job Title Supervisor* All Individuals that should have an account created at the EGA Name of Registered User Email Job Title APPENDIX III – PUBLICATION POLICY DAC for WGS data Romanians and Roma/Rroma (Romania) intend to publish the results of their analysis of this dataset and do not consider its deposition into public databases to be the equivalent of such publications. DAC for WGS data Romanians and Roma/Rroma (Romania) anticipate that the dataset could be useful to other qualified researchers for a variety of purposes. However, some areas of work are subject to a publication moratorium. The publication moratorium covers any publications (including oral communications) that describe the use of the dataset. For research papers, submission for publication should not occur until 12 months after these data were first made available on the relevant hosting database, unless DAC for WGS data Romanians and Roma/Rroma (Romania) has provided written consent to earlier submission. In any publications based on these data, please describe how the data can be accessed, including the name of the hosting database (e.g., The European Genome-phenome Archive at the European Bioinformatics Institute) and its accession numbers (e.g., EGAS00001003624), and acknowledge its use by citing the related initial publication in a form agreed by the User Institution with DAC for WGS data Romanians and Roma/Rroma (Romania).

Studies are experimental investigations of a particular phenomenon, e.g., case-control studies on a particular trait or cancer research projects reporting matching cancer normal genomes from patients.

Study ID Study Title Study Type
EGAS00001003624 Other

This table displays only public information pertaining to the files in the dataset. If you wish to access this dataset, please submit a request. If you already have access to these data files, please consult the download documentation.

ID File Type Size Located in
EGAF00002452261 bam 76.9 GB
EGAF00002452262 bam 77.9 GB
EGAF00002452263 bam 78.6 GB
EGAF00002452264 bam 89.4 GB
EGAF00002452265 bam 77.6 GB
EGAF00002452266 bam 82.5 GB
EGAF00002452267 bam 86.3 GB
EGAF00002452268 bam 81.0 GB
EGAF00002452269 bam 101.0 GB
EGAF00002452270 bam 82.7 GB
EGAF00002452271 bam 100.2 GB
EGAF00002452272 bam 79.1 GB
EGAF00002452273 bam 80.9 GB
EGAF00002452274 bam 75.4 GB
EGAF00002452275 bam 74.8 GB
EGAF00002452276 bam 74.6 GB
EGAF00002452277 bam 113.4 GB
EGAF00002452278 bam 82.6 GB
EGAF00002452279 bam 74.2 GB
EGAF00002452280 bam 68.0 GB
EGAF00002452281 bam 76.3 GB
EGAF00002452282 bam 86.5 GB
EGAF00002452283 bam 74.8 GB
EGAF00002452284 bam 72.4 GB
EGAF00002452285 bam 76.3 GB
EGAF00002452286 bam 75.7 GB
EGAF00002452287 bam 72.0 GB
EGAF00002452288 bam 70.5 GB
EGAF00002452289 bam 74.8 GB
EGAF00002452290 bam 74.0 GB
EGAF00002452291 bam 76.7 GB
EGAF00002452292 bam 77.4 GB
EGAF00002452293 bam 78.4 GB
EGAF00002452294 bam 73.5 GB
EGAF00002452295 bam 74.5 GB
EGAF00002452296 bam 80.1 GB
EGAF00002452297 bam 72.2 GB
EGAF00002452298 bam 73.1 GB
EGAF00002452299 bam 86.6 GB
EGAF00002452300 bam 76.0 GB
EGAF00002452301 bam 72.2 GB
EGAF00002452302 bam 74.0 GB
EGAF00002452303 bam 69.6 GB
EGAF00002452304 bam 71.6 GB
EGAF00002452305 bam 83.5 GB
EGAF00002452306 bam 79.1 GB
EGAF00002452307 bam 78.3 GB
EGAF00002452308 bam 79.7 GB
EGAF00002452309 bam 81.2 GB
EGAF00002452310 bam 82.1 GB
EGAF00002452311 bam 75.3 GB
EGAF00002452312 bam 71.1 GB
EGAF00002452313 bam 79.7 GB
EGAF00002452314 bam 78.7 GB
EGAF00002452315 bam 79.0 GB
EGAF00002452316 bam 80.7 GB
EGAF00002452317 bam 83.4 GB
EGAF00002452318 bam 90.5 GB
EGAF00002452319 bam 82.8 GB
EGAF00002452320 bam 100.4 GB
EGAF00002452321 bam 95.1 GB
EGAF00002452322 bam 89.7 GB
EGAF00002452323 bam 82.5 GB
EGAF00002452324 bam 88.3 GB
EGAF00002452325 bam 89.8 GB
EGAF00002452326 bam 74.3 GB
EGAF00002452327 bam 85.3 GB
EGAF00002452328 bam 94.9 GB
EGAF00002452329 bam 83.8 GB
EGAF00002452330 bam 80.7 GB
EGAF00002452331 bam 89.0 GB
EGAF00002452332 bam 78.3 GB
EGAF00002452333 bam 76.2 GB
EGAF00002452334 bam 73.6 GB
EGAF00002452335 bam 78.5 GB
EGAF00002452336 bam 79.0 GB
EGAF00002452337 bam 76.2 GB
EGAF00002452338 bam 77.1 GB
EGAF00002452339 bam 76.4 GB
EGAF00002452340 bam 80.8 GB
EGAF00002452341 bam 83.1 GB
EGAF00002452342 bam 80.8 GB
EGAF00002452343 bam 83.7 GB
EGAF00002452344 bam 80.5 GB
EGAF00002452345 bam 75.9 GB
EGAF00002452346 bam 79.6 GB
EGAF00002452347 bam 97.6 GB
EGAF00002452348 bam 78.3 GB
EGAF00002452349 bam 96.4 GB
EGAF00002452350 bam 81.1 GB
EGAF00002452351 bam 84.0 GB
EGAF00002452352 bam 92.1 GB
EGAF00002452353 bam 88.5 GB
EGAF00002452354 bam 84.0 GB
EGAF00002452355 bam 77.8 GB
EGAF00002452356 bam 84.6 GB
EGAF00002452357 bam 78.4 GB
EGAF00002452358 bam 85.2 GB
EGAF00002452359 bam 78.4 GB
99 Files (8.0 TB)