Identification of Modifiers of 22q11.2 Deletion Syndrome in Whole Genome Sequence - CIDR

This project aims to identify genetic modifiers of congenital heart disease (CHD), which occurs among 22q11.2 deletion syndrome (22q11.2DS; also known as DiGeorge syndrome/velo-cardio-facial syndrome) patients. In contrast to the prevalence of ~0.8% in the general population, about 60%-70% of 22q11.2DS patients have CHD, of which most are conotruncal heart defects, making 22q11.2DS a very ideal model to study the genetic underpins of CHD. The chromosome 22q11.2 region is susceptible to meiotic chromosome rearrangements leading to hemizygous deletions, because there are large blocks of low copy repeats, termed LCR22s, that are dispersed in the region. Non-allelic homologous recombination during meiosis can lead to specific-sized deletions. Approximately 90% have a typical hemizygous 3 million base pair (Mb) deletion between LCR22-A and -D, while some have nested deletions between LCR22-A and B or LCR22-A and C.

Our goal is to perform a case-control association study, in which cases have CHD, while controls have normal heart structures, all with 22q11.2DS. All individuals are unrelated and have been collected from collaborators in the US, Canada and Europe. All subjects received a clinical diagnosis of 22q11.2DS and signed informed consent to allow a blood sample to be collected for research. The DNA samples were de-identified in 2 steps to protect their privacy. We then determined deletion type using multiplex ligation-dependent probe amplification (MLPA). This information is provided for the great majority of subjects. We also obtained cardiac phenotype information from echocardiogram reports as well as reported race and ethnicity (Hispanic vs. non-Hispanic information), which is required for the analysis.

We will obtain 30X whole-genome sequence data from NIH CIDR. We will first annotate the sequence variants identified to uncover rare (<1%) variants that are potentially damaging. We have chosen a gene set approach because of the relatively small sample size of the study. Gene sets will include those previously used to study the genetics of sporadic CHD in the general population (PMID:31624253). The first analysis will focus on coding variants and we will then follow up on examining non-coding variants. This will be followed by other statistical approaches and we will compare findings to that published for sporadic CHD in the general population. Our hypothesis is that some modifiers of CHD in 22q11.2DS might be risk factors for sporadic CHD. We will also use this data as an independent replication to previous whole-genome sequence data that was obtained on 22q11.2DS subjects.

For genotype data, we will provide the raw fastq data from whole-genome sequencing as well as the called variants file in a VCF file. For phenotype data, we will provide the sample relationship files (ped files), CHD affection status, deletion type, and race.

Type: Case-Control
Archiver: The database of Genotypes and Phenotypes (dbGaP)