Login
Register
Need Help?
ABOUT
ABOUT THE EGA
EGA
Privacy Notice
Security
Team
STATISTICS
Bibliography
Growth
Community
Archive
Distribution
Catalog
PROJECTS AND FUNDERS
Projects
Funders
GA4GH
Federated EGA
Beacon
DISCOVERY
CATALOGUE
Studies
Datasets
DACs
Synthetic Data
METADATA
Search Box
Public Metadata API
SUBMISSION
DATA
File preparation
Uploading files
METADATA
EGA Schema
Sequencing & Phenotype
Submitter Portal
Submitter Portal API
Array
Programmatic Submission XML
ACCESS
DATA ACCESS COMMITTEE
What is a DAC?
Best Practices
DAC Portal
Data Use Conditions
REQUEST DATA
How to request data?
Quality Control Reports
DOWNLOAD
Metadata
Files
PyEGA3
Live Outbox
Visualisation
FUSE Client
EGA QuickView
Tips on how to search
DACs
EGAC00001001544
UAE Genomes Data access Committee
Contact Information
Habiba Alsafar
habiba.alsafar@ku.ac.ae
Request Access
This DAC controls 2 datasets
Dataset ID
Description
Technology
Samples
EGAD00010001886
This dataset contains PLINK processed (PED and MAP) genotype data, from 1000 samples from the UAE using the Illumina Omni5 Exome Bead Chip
Illumina
1000
EGAD50000001558
The Emirati Genome Project (EGP) Variome comprises allele frequency data derived from 43,608 individuals sequenced as part of the national genome program in the United Arab Emirates. Samples were processed at the M42 EGP Facility and sequenced to a minimum of 30x coverage using Illumina NovaSeq 6000 short-read technology. Variant calling and alignment were performed using the DRAGEN pipeline (v3.9) against the GRCh38 reference genome. This dataset contains a total of 421,605,069 short variants (SNVs and indels), stored in VCF format. Each variant is annotated with population-level metrics in the INFO field, including: AC (alternate allele count) AF (alternate allele frequency) RC (reference allele count) RF (reference allele frequency) For convenience, VCF files are split by chromosome (chr1–22, X, Y), compressed and indexed using bgzip and tabix.
Illumina NovaSeq 6000
24