Phenotype Risk Scores Identify Patients with Unrecognized Mendelian Disease Patterns
This study was designed to assess the phenotypic effects of rare variants. Rare variants are difficult to study in a high-throughput manner because most cohorts are underpowered to detect associations. In order to gain power to test rare variants, we use Phenotype Risk Scores (PheRS) based on features of Mendelian diseases. PheRS is calculated using claims data from an EHR that is mapped to clinical features from OMIM's clinical descriptions of Mendelian disease. We calculated PheRS for 1,204 Mendelians diseases in a cohort of 21,701 individuals genotyped on the HumanExome BeadChip. We then tested for association between these diseases and rare variants in causal genes.
A phenotype risk score is calculated as the weighted sum of features for a given disease. The features are defined as a set of phecodes (consolidated ICD codes) associated with a particular Mendelian disease as described in OMIM. The weights for each phecode are calculated as the log inverse prevalence of the phecode in our cohort. For an individual, their score equals the sum of the weights for each phecode that is present in their medical record.
Our study was based on a cohort of individuals with genotype data linked to de-identified electronic health records (EHR) from Vanderbilt's BioVU resource. We hope that this method will help generate phenotype-genotype correlation data on rare variants which in turn may inform rare variant interpretations use for diagnostics.
- Type: Cohort
- Archiver: The database of Genotypes and Phenotypes (dbGaP)