SNV and indel calls from 8921 individuals in the British Autozygosity Populations BioResource dataset
This includes variant calls (single nucleotide variants and small insertions/deletions) from 8086 (mostly British Pakistani/British Bangladeshi) individuals from the following studies: 1. 5236 British Pakistani/British Bangladeshi adults from East London Genes and Health (ELGH) 2. 2624 British South Asian mothers from Born in Bradford (mostly Pakistani) (BiB) 3. 1061 British South Asian adults from Birmingham (mostly Pakistani) (Birm) All of the Birmingham and most of the Born in Bradford samples were previously sequenced as part of PMID: 26940866. In the sample list file, the columns of interest to most people will be: vcf.id - sample ID from the vcf cohort - which cohort they're in sex.assigned - sex inferred from coverage on the X and Y chromosomes. Individuals for whom this did not match their reported sex have been discarded total, chrX and chrY - coverage within bait regions across all chromosomes, chrX and chrY respectively Mapping was done with bwa-mem and variant calling was carried out with GATK HaplotypeCaller. We removed variant sites for which the following was true: SNPs: "QD < 2.0 || FS > 30 || MQ < 40.0 || MQRankSum < -12.5 || ReadPosRankSum < -8.0" Indels: "QD < 2.0 || FS > 30 || ReadPosRankSum < -20.0"
- DAC: EGAC00001000205
Wellcome Trust Sanger Institute Data Sharing Policy
Studies are experimental investigations of a particular phenomenon, e.g., case-control studies on a particular trait or cancer research projects reporting matching cancer normal genomes from patients.
Study ID | Study Title | Study Type |
---|---|---|
EGAS00001001565 | Other |
This table displays only public information pertaining to the files in the dataset. If you wish to access this dataset, please submit a request. If you already have access to these data files, please consult the download documentation.