Next Generation Sequencing in an IBD Pedigree Whole Genome Data

Dataset ID Technology Samples
EGAD00001000399 Illumina HiSeq 2000 8

Dataset Description

In 2009 we identified a four-generation family with over 700 members and 41 affected with Crohn's disease (CD). At the time we sequenced the exome of 6 affected individuals but did not identify any coding variants which appear to explain the high prevalence of disease. Since then we have collected DNA from a large number of additional family members, genotyped linkage arrays on the entire family to refine genomic regions shared by identity by descent and genotyped affected and unaffected members at known CD risk loci identified by Genome Wide Association Studies (GWAS). These analyses have confirmed that a significant unexplained excess of disease remains after accounting for all known genetic factors, and that several regions of the genome are shared by a large fraction of affected individuals. We therefore perform whole genomes sequencing from 8 individuals which will allow us to impute the complete sequence of nearly all the members of the two largest and most severely affected branches of the family.

Who controls access to this dataset

For each dataset that requires controlled access, there is a corresponding Data Access Committee (DAC) who determine access permissions. Access to actual data files is not managed by the EGA. If you need to request access to this data set, please contact:

Wellcome Trust Sanger Institute
Contact person: Data Sharing
Email: datasharing [at] sanger [dot] ac [dot] uk
Access information:
More details: EGAC00001000205


You don't have access to the download section.