Need Help?

Emirati Genome Project Population Variome (MAF Table)

The Emirati Genome Project (EGP) Variome comprises allele frequency data derived from 43,608 individuals sequenced as part of the national genome program in the United Arab Emirates. Samples were processed at the M42 EGP Facility and sequenced to a minimum of 30x coverage using Illumina NovaSeq 6000 short-read technology. Variant calling and alignment were performed using the DRAGEN pipeline (v3.9) against the GRCh38 reference genome. This dataset contains a total of 421,605,069 short variants (SNVs and indels), stored in VCF format. Each variant is annotated with population-level metrics in the INFO field, including: AC (alternate allele count) AF (alternate allele frequency) RC (reference allele count) RF (reference allele frequency) For convenience, VCF files are split by chromosome (chr1–22, X, Y), compressed and indexed using bgzip and tabix.

Request Access

Data access policy for UAE Genomes data.

Submissions for data access is granted upon written request for research purposes (non-commercial) use only. Bona fide, academic users can apply for data access under the following conditions: 1. Data is provided for non-clinical, non-commercial research purposes only. 2. Data is not distributed to any other individual or entity without the UAE Genomes Data access Committee's permission. 3. Data present is experimental in nature, and must not be used to make any clinical decisions. 4. Data that you are accessing is done so with no warranties, expressed or implied, and employees or agents of Khalifa University of Science and Technology have no liability in connection with its use. If you agree with the conditions above, please apply for access by email indicating your consent.

Studies are experimental investigations of a particular phenomenon, e.g., case-control studies on a particular trait or cancer research projects reporting matching cancer normal genomes from patients.

Study ID Study Title Study Type
EGAS50000001071 Population Genomics

This table displays only public information pertaining to the files in the dataset. If you wish to access this dataset, please submit a request. If you already have access to these data files, please consult the download documentation.

ID File Type Size Quality Report
Located in
EGAF50000384237 tbi 184.3 kB
EGAF50000384238 vcf.gz 238.6 MB
EGAF50000384239 tbi 188.0 kB
EGAF50000384240 vcf.gz 253.3 MB
EGAF50000384241 tbi 157.6 kB
EGAF50000384242 vcf.gz 190.5 MB
EGAF50000384243 tbi 150.7 kB
EGAF50000384244 vcf.gz 183.9 MB
EGAF50000384245 tbi 141.7 kB
EGAF50000384246 vcf.gz 167.8 MB
EGAF50000384247 tbi 134.4 kB
EGAF50000384248 vcf.gz 157.9 MB
EGAF50000384249 tbi 127.4 kB
EGAF50000384250 vcf.gz 156.1 MB
EGAF50000384251 tbi 114.4 kB
EGAF50000384252 vcf.gz 189.2 MB
EGAF50000384253 tbi 97.7 kB
EGAF50000384254 vcf.gz 164.9 MB
EGAF50000384255 tbi 105.1 kB
EGAF50000384256 vcf.gz 145.0 MB
EGAF50000384257 tbi 108.0 kB
EGAF50000384258 vcf.gz 153.5 MB
EGAF50000384259 tbi 104.6 kB
EGAF50000384260 vcf.gz 176.9 MB
EGAF50000384261 tbi 79.4 kB
EGAF50000384262 vcf.gz 78.3 MB
EGAF50000384263 tbi 71.3 kB
EGAF50000384264 vcf.gz 107.8 MB
EGAF50000384265 tbi 68.1 kB
EGAF50000384266 vcf.gz 103.0 MB
EGAF50000384267 tbi 64.0 kB
EGAF50000384268 vcf.gz 111.3 MB
EGAF50000384269 tbi 65.3 kB
EGAF50000384270 vcf.gz 97.1 MB
EGAF50000384271 tbi 65.8 kB
EGAF50000384272 vcf.gz 127.0 MB
EGAF50000384273 tbi 42.7 kB
EGAF50000384274 vcf.gz 62.9 MB
EGAF50000384275 tbi 50.5 kB
EGAF50000384276 vcf.gz 44.5 MB
EGAF50000384277 tbi 29.5 kB
EGAF50000384278 vcf.gz 87.5 MB
EGAF50000384279 tbi 27.8 kB
EGAF50000384280 vcf.gz 48.0 MB
EGAF50000384281 tbi 123.8 kB
EGAF50000384282 vcf.gz 23.3 MB
EGAF50000384283 tbi 15.4 kB
EGAF50000384284 vcf.gz 133.7 MB
48 Files (3.2 GB)