Whole genome sequencing of circulating cell-free DNA on Illumina and Ultima platforms
The dataset includes Illumina (n=1) and Ultima (n=92) sequencing of circulating cell-free DNA. The dataset was generated through standard whole genome sequencing (Illumina n=1 & Ultima n = 15) and using a duplex unique molecular identifier whole genome sequencing workflow (Ultima, n=77). This dataset includes cancer samples and cancer-free control samples.
- 19/03/2025
- 93 samples
- DAC: EGAC00001001587
- Technologies: HiSeq X Ten, unspecified
Cornell DAA General
To access this dataset, your institution needs to a sign a data access agreement with Cornell University. The DAC coordinator will email a copy of the agreement to you. Here is the text of the DAA: DATA ACCESS AGREEMENT These terms and conditions govern access to the managed access datasets (details of which are set out in Appendix I) to which User Institution has requested access. User Institution agrees to be bound by these terms and conditions. Definitions Authorized Personnel: The individuals at User Institution to whom Cornell University grants access to the Data. This includes User, the individuals listed in Appendix II and any other individuals for whom User Institution subsequently requests access to the Data. Details of the initial Authorized Personnel are set out in Appendix II. Data: The managed access datasets to which User Institution has requested access. “Data” includes any dataset or database into which the Data, in whole or in part, has been aggregated, merged, or combined. Data Producers: Cornell University and the collaborators listed in Appendix I were responsible for the development, organization, and oversight of these Data. External Collaborator: A collaborator of User, working for an institution other than User Institution. Project: The project for which User Institution has requested access to these Data. A description of the Project is set out in Appendix II. Publications: Includes, without limitation, articles published in print journals, electronic journals, reviews, books, posters and other written and verbal presentations of research. Research Participant: An individual whose data form part of these Data. Research Purposes: Shall mean research that is seeking to advance the understanding of genetics and genomics, including the treatment of disorders, and work on statistical methods that may be applied to such research. “Research Purposes” specifically excludes: (a) licensing or sale of Data; (b) offering a service or other product using Data; (c) commercializing a machine learning model trained on the Data; and (d) using Data in regulatory filings. User: The principal investigator for the Project. User Institution(s): The institution that has requested access to the Data. Cornell: Cornell University, through its Center for Technology Licensing, with offices at 395 Pine Tree Road, Suite 310, Ithaca, NY 14850 and at 1155 York Avenue, New York, NY 10065. Permanent email: mta-ctl@cornell.edu. Agreement 1. User Institution will use Data only for the Project as described in Appendix II, only for Research Purposes, and only within the limitations set out in Appendix I. 2. User Institution will destroy/discard Data held once it is no longer used for the Project, unless obliged to retain a copy of Data for archival purposes in conformity with audit or legal requirements. 3. User Institution will preserve, at all times, the confidentiality of Data. Without prejudice to the generality of the foregoing, User Institution will use at least the measures set out in Appendix I to protect Data. In particular, it will not use, or attempt to use, Data to compromise or otherwise infringe the confidentiality of information about Research Participants, and will limit the possibility of identification and in any research papers or publications. User Institution will not link or combine Data to other information or archived data available in a way that could re-identify the Research Participants, even if access to that data has been formally granted to User Institution or is freely available without restriction. 4. User Institution will only transfer or disclose Data, in whole or part, to the Authorized Personnel. User Institution will distribute a copy of these terms to the Authorized Personnel. User Institution will require that the Authorized Personnel comply with the terms of this agreement. Should User Institution wish to share Data with an External Collaborator, the External Collaborator must complete a separate application for access to Data. 5. User Institution will notify Cornell as soon as it becomes aware of a breach of the terms or conditions of this agreement. 6. The Data Producers, and all other parties involved in the creation, funding or protection of Data: a) make no warranty or representation, express or implied as to the accuracy, quality or comprehensiveness of Data; b) shall have no liability for actions, claims, proceedings, demands, losses (including but not limited to loss of profit), costs, awards, damages and payments made by User Institution that may arise (whether directly or indirectly) in any way whatsoever from User Institution’s use of Data or from the unavailability of, or break in access to, Data for whatever reason and; c) bear no responsibility for the further analysis or interpretation of Data. 7. User Institution will acknowledge the Data Producers and the EGA in all reports or publications resulting from the use of Data as described in the Publication Policy in Appendix III and will follow the Fort Lauderdale Guidelines (https://www.genome.gov/Pages/Research/WellcomeReport0303.pdf ) and the Toronto Statement (http://www.nature.com/nature/journal/v461/n7261/full/461168a.html). 8. (a)User Institution will not make intellectual property claims on Data and will not use intellectual property protection in ways that would prevent or block access to, or use of, any element of Data, or conclusion drawn directly from Data. (b) Should User Institution elect to obtain intellectual property rights on downstream discoveries arising from its use of Data that comply with Paragraph 8a, User Institution will implement licensing policies that will not obstruct further research and to follow the U.S. National Institutes of Health Best Practices for the Licensing of Genomic Inventions (2005) (https://www.ott.nih.gov/sites/default/files/documents/pdfs/70fr18413.pdf ) and the Organization for Economic Co-operation and Development Guidelines for the Licensing of the Genetic Inventions (2006) (http://www.oecd.org/science/biotech/36198812.pdf ). 9. Cornell may terminate this agreement by written notice to User Institution. If this agreement terminates for any reason, User Institution will destroy any Data in its possession or control other than one copy Data retained solely for archival purpose in conformity with audit or legal requirements. 10. User Institution accepts that it may be necessary for the Data Producers to alter the terms of this agreement from time to time. In the event that changes are required, Cornell will contact User Institution to inform it of the changes and User Institution may elect to accept the changes or terminate the agreement. 11. If requested, User Institution will allow data security and management documentation to be inspected to verify that it is complying with the terms of this agreement, including the “Minimum protection measures required” and “File access” sections of Appendix I. 12. This agreement (and any dispute, controversy, proceedings or claim of whatever nature arising out of this agreement or its formation) shall be construed, interpreted, and governed by the laws of the United States and shall be subject to the exclusive jurisdiction of the United States courts. Agreed for User Institution Signature: Name: Title: Institution name: Date: Principal Investigator I confirm that I have read and understood this Agreement. Signature: Name: Title: Date: Agreed for Cornell Signature: Name: Brian Kelly Title: Director, Technology Licensing Center for Technology Licensing at Cornell University Date: APPENDIX I – DATASET DETAILS APPENDIX II ––PROJECT DETAILS APPENDIX III –– PUBLICATION POLICY
Studies are experimental investigations of a particular phenomenon, e.g., case-control studies on a particular trait or cancer research projects reporting matching cancer normal genomes from patients.
Study ID | Study Title | Study Type |
---|---|---|
EGAS50000000844 | Whole Genome Sequencing |
This table displays only public information pertaining to the files in the dataset. If you wish to access this dataset, please submit a request. If you already have access to these data files, please consult the download documentation.
ID | File Type | Size | Located in | |
---|---|---|---|---|
EGAF50000311237 | bam | 221.3 GB | ||
EGAF50000311238 | bam | 423.4 GB | ||
EGAF50000311239 | bam | 250.4 GB | ||
EGAF50000311240 | bam | 73.7 GB | ||
EGAF50000311241 | bam | 252.9 GB | ||
EGAF50000311242 | bam | 312.9 GB | ||
EGAF50000311243 | bam | 211.3 GB | ||
EGAF50000311244 | bam | 401.0 GB | ||
EGAF50000311245 | bam | 413.3 GB | ||
EGAF50000311246 | bam | 256.5 GB | ||
EGAF50000311247 | bam | 370.4 GB | ||
EGAF50000311248 | bam | 323.8 GB | ||
EGAF50000311249 | bam | 222.8 GB | ||
EGAF50000311250 | bam | 422.5 GB | ||
EGAF50000311251 | bam | 354.4 GB | ||
EGAF50000311252 | bam | 286.2 GB | ||
EGAF50000311253 | bam | 288.9 GB | ||
EGAF50000311254 | bam | 205.0 GB | ||
EGAF50000311255 | bam | 303.5 GB | ||
EGAF50000311256 | bam | 104.5 GB | ||
EGAF50000311257 | bam | 303.2 GB | ||
EGAF50000311258 | bam | 364.0 GB | ||
EGAF50000311259 | bam | 270.4 GB | ||
EGAF50000311260 | bam | 392.2 GB | ||
EGAF50000311261 | bam | 1.3 TB | ||
EGAF50000311262 | bam | 294.5 GB | ||
EGAF50000311263 | bam | 282.4 GB | ||
EGAF50000311264 | bam | 332.0 GB | ||
EGAF50000311265 | bam | 403.7 GB | ||
EGAF50000311266 | bam | 381.3 GB | ||
EGAF50000311267 | bam | 373.2 GB | ||
EGAF50000311268 | bam | 96.6 GB | ||
EGAF50000311269 | bam | 237.8 GB | ||
EGAF50000311270 | bam | 345.4 GB | ||
EGAF50000311271 | bam | 96.8 GB | ||
EGAF50000311272 | bam | 343.4 GB | ||
EGAF50000311273 | bam | 110.3 GB | ||
EGAF50000311274 | bam | 285.9 GB | ||
EGAF50000311275 | bam | 397.3 GB | ||
EGAF50000311276 | bam | 385.0 GB | ||
EGAF50000311277 | bam | 294.6 GB | ||
EGAF50000311278 | bam | 366.7 GB | ||
EGAF50000311279 | bam | 632.4 GB | ||
EGAF50000311280 | bam | 226.2 GB | ||
EGAF50000311281 | bam | 281.1 GB | ||
EGAF50000311282 | bam | 2.3 TB | ||
EGAF50000311283 | bam | 429.0 GB | ||
EGAF50000311284 | bam | 387.6 GB | ||
EGAF50000311285 | bam | 718.4 GB | ||
EGAF50000311286 | bam | 1.5 TB | ||
EGAF50000311287 | bam | 257.4 GB | ||
EGAF50000311288 | bam | 412.6 GB | ||
EGAF50000311289 | bam | 368.0 GB | ||
EGAF50000311290 | bam | 374.0 GB | ||
EGAF50000311291 | bam | 518.7 GB | ||
EGAF50000311292 | bam | 259.7 GB | ||
EGAF50000311293 | bam | 392.3 GB | ||
EGAF50000311294 | bam | 380.6 GB | ||
EGAF50000311295 | bam | 104.6 GB | ||
EGAF50000311296 | bam | 106.2 GB | ||
EGAF50000311297 | bam | 448.9 GB | ||
EGAF50000311298 | bam | 411.3 GB | ||
EGAF50000311299 | bam | 363.9 GB | ||
EGAF50000311300 | bam | 130.1 GB | ||
EGAF50000311301 | bam | 405.8 GB | ||
EGAF50000311302 | bam | 404.7 GB | ||
EGAF50000311303 | bam | 375.2 GB | ||
EGAF50000311304 | bam | 398.6 GB | ||
EGAF50000311305 | bam | 651.0 GB | ||
EGAF50000311306 | bam | 409.0 GB | ||
EGAF50000311307 | bam | 420.3 GB | ||
EGAF50000311308 | bam | 107.3 GB | ||
EGAF50000311309 | bam | 214.1 GB | ||
EGAF50000311310 | bam | 1.6 TB | ||
EGAF50000311311 | bam | 363.8 GB | ||
EGAF50000311312 | bam | 390.1 GB | ||
EGAF50000311313 | bam | 91.6 GB | ||
EGAF50000311314 | bam | 126.7 GB | ||
EGAF50000311315 | bam | 381.0 GB | ||
EGAF50000311316 | bam | 397.4 GB | ||
EGAF50000311317 | bam | 82.4 GB | ||
EGAF50000311318 | bam | 329.2 GB | ||
EGAF50000311319 | bam | 110.8 GB | ||
EGAF50000311320 | bam | 319.7 GB | ||
EGAF50000311321 | bam | 428.4 GB | ||
EGAF50000311322 | bam | 121.3 GB | ||
EGAF50000311323 | bam | 222.7 GB | ||
EGAF50000311324 | bam | 256.8 GB | ||
EGAF50000311325 | bam | 680.3 GB | ||
EGAF50000311326 | bam | 105.6 GB | ||
EGAF50000311327 | bam | 279.2 GB | ||
EGAF50000311328 | bam | 415.5 GB | ||
EGAF50000311329 | bam | 380.5 GB | ||
93 Files (34.8 TB) |