Dataset

SNV and indel calls from 8086 individuals in the British Autozygosity Populations BioResource dataset

Dataset ID Technology Samples
EGAD00001004581 N/A N/A

Dataset Description

This includes variant calls (single nucleotide variants and small insertions/deletions) from 8086 (mostly British Pakistani/British Bangladeshi) individuals from the following studies:
1. 3781 British Pakistani/British Bangladeshi adults from East London Genes and Health
2. 2791 British South Asian mothers from Born in Bradford
3. 1428 British South Asian adults from Birmingham
4. 86 individuals (mixed ancestries) from families with rare diseases, from Queen Mary University London

All of the Birmingham and most of the Born in Bradford samples were previously sequenced as part of PMID: 26940866.

Mapping was done with bwa-mem and variant calling was carried out with GATK HaplotypeCaller. We removed variant sites for which the following was true:
SNPs: "QD < 2.0 || FS > 30 || MQ < 40.0 || MQRankSum < -12.5 || ReadPosRankSum < -8.0"
Indels: "QD < 2.0 || FS > 30 || ReadPosRankSum < -20.0"

Who controls access to this dataset

For each dataset that requires controlled access, there is a corresponding Data Access Committee (DAC) who determine access permissions. Access to actual data files is not managed by the EGA. If you need to request access to this data set, please contact:

Wellcome Trust Sanger Institute
Contact person: Data Sharing
Email: datasharing [at] sanger [dot] ac [dot] uk
Access information: https://www.sanger.ac.uk/legal/DAA/MasterController
More details: EGAC00001000205

Downloads

You don't have access to the download section.