Need Help?

TenK10K project

Contact Information

Dr Joseph E Powell
joseph.powell@unsw.edu.au

Request Access

This DAC controls 6 datasets

Dataset ID	Description	Technology	Samples
EGAD50000002377	SNP and indel variants were called for 1,925 samples from phase 1 of the TenK10K project. Variant calling from WGS alignments to the GRCh38 reference assembly was performed using GATK4 HaplotypeCaller in DRAGEN mode. These VCFs consist of common (>=1% minor allele frequency [MAF]) and rare (<1% MAF) variants . This dataset is comprised of autosomal variants provided as multisample compressed VCF format files. Principal component scores were derived by using gnomAD’s run_pca_with_relateds method.		1925
EGAD50000002378	Tandem repeat variants were called for 1,925 samples from phase 1 of the TenK10K project. Variant calling from WGS alignments to the GRCh38 reference assembly was completed with ExpansionHunter v5. This dataset is comprised of autosomal tandem repeat variants provided as multisample compressed VCF format files. SNV-derived principal component scores used in the manuscript (Tanudisastro et al.) are also provided in this dataset.		1925
EGAD50000002379	Single cell RNA-sequencing data for PBMCs from the Tenk10k Phase 1 cohort (1925 individuals post-QC). Libraries were prepared using the 10x Genomics 3’ Chromium Next GEM Single Cell HT V3.1 kit and sequenced on the NovaSeq 6000 platform. Reads were mapped to the GRCh38 reference genome with Cellranger, and count matrices were preprocessed using Scanpy.		1925
EGAD50000002466	Whole genome sequencing was performed for 1,925 samples from phase 1 of the TenK10K project. Sequencing was done with Illumina 2 x 150bp chemistry on a NovaSeq 6000 instrument to achieve mean 30x coverage. Sequence reads were aligned to the GRCh38 reference assembly with a fork of DRAGMAP v1.3.1. This dataset is comprised of 1,925 alignment files in cram format, and their corresponding index files.		1925
EGAD50000002517	Single cell RNA-sequencing data for PBMCs from the Tenk10k Phase 1 cohort (1925 individuals post-QC). Libraries were prepared using the 10x Genomics 3’ Chromium Next GEM Single Cell HT V3.1 kit and sequenced on the NovaSeq 6000 platform. This analysis consists of 4,768 fastq format files in 1,192 runs, and an analysis which provides the relationship of 298 library pool samples to 1,925 individual samples.	Illumina NovaSeq 6000	298
EGAD50000002682	Raw FASTQ files for single cell ATAC-sequencing data for PBMCs from the Tenk10k Phase 1 cohort (952 individuals pre-QC). Libraries were prepared using the 10x Genomics Chromium Next GEM Epi ATAC v2 kit and sequenced on the NovaSeq 6000 platform.	Illumina NovaSeq 6000	238