Kids First: Whole Genome Sequencing in Structural Defects of the Neural Tube
Myelomeningocele (aka meningomyelocele, MM) is the most severe form of spina bifida, a neural tube defect (NTD) in humans and the most common CNS birth defect. MM is considered a genetically complex disease, and occurs in 3.72/10,000 live US birth, and is partly preventable with prenatal folate, but the genetic basis and the mechanisms by which folate work to reduce disease incidence remain obscure. MM is associated nearly uniformly with prenatal hydrocephalus and the Arnold-Chiari malformation, as well as paraplegia and lifelong neuromotor disability. The genes for several rare syndromic forms of NTDs are known, but the causes for the majority with sporadic MM remain unknown.
Despite the importance of MM, most previous research has been limited to targeted sequencing and association studies of folate metabolism genes, or very small-scale exome sequencing. We hypothesize that de novo mutations (DNMs) produce likely gene disrupting (LGD) events that contribution to MM risk. Using conservative estimates of between 50-100 recurrently mutated discoverable genes contributing to risk, and our preliminary data demonstrating an excess of LGD DNMs in MM compared with control individuals, we estimate that with a cohort size of 1000 trios, we should uncover between 5-20 new recurrently mutated genes underlying MM, with minimal false-discovery. With this in mind, we formed the Spina Bifida Sequencing Consortium, and established a platform for data and sample sharing. Preliminary analysis of our first batch of 100 trios analyzed by WGS from GMKF suggests a wealth of important gene mutations. We have embarked on a new recruitment effort of an additional cohort of 400 new simplex MM trios, in collaboration with the US Spina Bifida Association, consented trios to allow for data sharing, and have performed detailed sample quality control. We also have preliminary data that use of dried bloodspot DNA from trios performs comparably to whole blood DNA, so will be happy to swap saliva for bloodspot recruitment at NIH’s preference. This cohort is now half-way assembled, with the remaining cohort to be ascertained in the next 6 months. We have established a workflow for de novo SNP/INDEL/SV detection from WGS and have ample computer storage and nodes to see the project to completion. We also plan to continue recruitment into the future with the goal of 2000 trios in the next 5 years. We propose a detailed bioinformatics workflow to identify gene mutations within a statistical framework, considering detailed scRNA expression profiling from developing mammalian neural tube, and have developed a robust functional validation workflow using Xenopus and mouse gene targeting. Our project has the potential to uncover a host of causes for this most common of the CNS birth defects, paving the way for future breakthroughs in detection, treatment, and prevention.
- Type: Parent-Offspring Trios
- Archiver: The database of Genotypes and Phenotypes (dbGaP)