Login
Register
Need Help?
ABOUT
ABOUT THE EGA
EGA
Privacy Notice
Security
Team
STATISTICS
Bibliography
Growth
Community
Archive
Distribution
Catalogue
PROJECTS AND FUNDERS
Projects
Funders
GA4GH
Federated EGA
Beacon
DISCOVERY
CATALOGUE
Studies
Datasets
DACs
Synthetic Data
METADATA
Search Box
Public Metadata API
SUBMISSION
DATA
File preparation
Uploading files
METADATA
EGA Schema
Sequencing & Phenotype
Submitter Portal
Submitter Portal API
Array
Programmatic Submission XML
ACCESS
DATA ACCESS COMMITTEE
What is a DAC?
Best Practices
DAC Portal
Data Use Conditions
REQUEST DATA
How to request data?
Quality Control Reports
DOWNLOAD
Metadata
Files
PyEGA3
Live Outbox
Visualisation
FUSE Client
EGA QuickView
Tips on how to search
DACs
EGAC50000000931
TenK10K project
Contact Information
Mr Blake Bowen
b.bowen@garvan.org.au
Request Access
This DAC controls 4 datasets
Dataset ID
Description
Technology
Samples
EGAD50000002377
SNP and indel variants were called for 1,925 samples from phase 1 of the TenK10K project. Variant calling from WGS alignments to the GRCh38 reference assembly was performed using GATK4 HaplotypeCaller in DRAGEN mode. These VCFs consist of common (>=1% minor allele frequency [MAF]) and rare (<1% MAF) variants . This dataset is comprised of autosomal variants provided as multisample compressed VCF format files. Principal component scores were derived by using gnomAD’s run_pca_with_relateds method.
1925
EGAD50000002378
Tandem repeat variants were called for 1,925 samples from phase 1 of the TenK10K project. Variant calling from WGS alignments to the GRCh38 reference assembly was completed with ExpansionHunter v5. This dataset is comprised of autosomal tandem repeat variants provided as multisample compressed VCF format files. SNV-derived principal component scores used in the manuscript (Tanudisastro et al.) are also provided in this dataset.
1925
EGAD50000002379
Single cell RNA-sequencing data for PBMCs from the Tenk10k Phase 1 cohort (1925 individuals post-QC). Libraries were prepared using the 10x Genomics 3’ Chromium Next GEM Single Cell HT V3.1 kit and sequenced on the NovaSeq 6000 platform. Reads were mapped to the GRCh38 reference genome with Cellranger, and count matrices were preprocessed using Scanpy.
1925
EGAD50000002466
Whole genome sequencing was performed for 1,925 samples from phase 1 of the TenK10K project. Sequencing was done with Illumina 2 x 150bp chemistry on a NovaSeq 6000 instrument to achieve mean 30x coverage. Sequence reads were aligned to the GRCh38 reference assembly with a fork of DRAGMAP v1.3.1. This dataset is comprised of 1,925 alignment files in cram format, and their corresponding index files.
1925