Neuropsychiatric and autoimmune disorders have substantial epidemiological correlations (Benros et al., JAMA Psychiatry 2013, PMID: 23760347) and broad, genome-wide patterns of shared genetic risk (Pouget et al., Hum Mol Genet, PMID: 31211845; Tylee et al., Am J Med Genet B Neuropsychiatr Genet 2018, PMID: 30325587). Some cases of psychosis involve autoantibodies against the NMDA receptor, AMPA receptor, or other synaptic proteins (reviewed in Dalmau et al, Lancet Neurol 2011, PMID: 21163445). The related autoimmune conditions lupus and Sjogren's Syndrome also involve the development of autoantibodies. The possibility that neuropsychiatric disorders can have molecular mechanisms in common with autoimmune disorders - for example, that psychosis involves an inflammatory or autoimmune component in some patients, or that immune molecules are re-used in the brain to underlie other important biological activities (Stevens et al., Cell 2007, PMID: 18083105) - could open novel therapeutic possibilities for neuropsychiatric disorders. At a genetic level, the strongest genetic associations of schizophrenia, lupus, and Sjogren's Syndrome to common genetic variation involve associations to genetic markers in the Major Histocompatibility Complex (MHC) locus. Bipolar disorder in some studies also associates with variation in or near the MHC locus, though less strongly than schizophrenia does. Intriguingly, the same specific SNPs appear to associate strongly with schizophrenia, lupus, and Sjogren's; these strongly associating SNPs span a genomic segment that includes the HLA class II genes (which have an important role in antibody production) and the complement component 4 (C4) genes. The specific genes and alleles responsible for these associations need to be completely defined, and the extent to which they represent shared or distinct genetic influences in neuropsychiatric and autoimmune illnesses needs to be clarified. The complement component 4 (C4A and C4B) genes are present in the MHC locus, between the class I and class II HLA genes. C4A and C4B commonly vary in genomic copy number and encode complement proteins with distinct affinities for molecular targets. The complex genetic variation at C4 - arising from many alleles with different numbers of C4A and C4B genes - has been challenging to analyze in large cohorts. We recently developed an approach to this problem based on imputation: people share long haplotypes with the same combinations of SNP and C4 alleles, such that C4A and C4B gene copy numbers can be imputed from SNP data (Sekar et al., Nature 2016, PMID: 26814963). In the current work, to analyze C4 in large cohorts, we developed a way to identify C4 alleles from whole-genome sequence (WGS) data, then analyzed WGS data from 1,234 individuals to create a large multi-ancestry panel of 2,530 reference haplotypes of MHC SNPs and C4 alleles that can then be imputed into still-larger cohorts for which SNP data are available. With this dbGaP submission, we make this reference panel available for other studies. Protocols and software for imputing C4 alleles into genome-wide SNP data, and for performing molecular analyses on the C4 genes (such as direct measurement of copy number from genomic DNA), can be found on the McCarroll Lab web site (http://mccarrolllab.org/resources). We are also working to create additional reference panels for imputation of C4 alleles that will be based on still-larger and more diverse population samples; links to these will also be available on the McCarroll Lab web site as we create and validate them.
Amebiasis is a common cause of diarrhea and is associated with malnutrition in grade-school aged children in an urban slum of Dhaka, Bangladesh. Field Studies of Human Immunity to Amebiasis in Bangladesh was designed to determine the contribution of amebiasis to illness in the first 2 years of life when most deaths due to diarrhea occur, and understand the immunologic and genetic factors that protect children from amebiasis. The hypothesis underlying the study is that susceptibility to amebiasis is determined by host innate and acquired immune responses that vary between individuals in part due to: human genetic polymorphisms; environmental influences including malnutrition and concurrent geohelminth infection; and virulence differences among Entamoeba histolytica genotypes. Specific aims proposed in the design of the study were to: a) Measure the incidence of amebiasis and correlate it with human and parasite genetic polymorphisms, immune responses, and environmental factors such as geohelminth infection and malnutrition; b) Test the hypothesis that protective immunity is mediated both by innate immune responses initiated via TLR stimulation as well as by mucosal IgA against the Gal/GalNAc lectin and systemic IFN-γ; c) Test for the association of common genetic polymorphisms in host innate and acquired immune genes with incidence of amebiasis. 629 newborn babies were enrolled and followed regularly through bi-weekly surveillance for diarrheal episodes, anthropometry at 3-month interval until 60 months of age. The infants that were consented for GWAS analysis were genotyped in 3 separate batches at different times, on 3 different arrays. Quality control was performed on the 3 separate data sets and then jointly after merging. Genetic data available on 447 infants together with their phenotype data is made available in this submission.
WES ON GASTRO-ESOPHAGEAL TUMORS TO IDENTIFY BIOMARKERS OF RESPONSE TO EGFR INHIBITION
DAC to regulate access to Olink targeted proteomics data in Chronic rhinosinusitis (CRS) patient cohort.
To identify the causative germline mutations of congenital macrothrombocytopenia, whole-exome study of 6 families (21 individuals) with autosomal dominant mode of transmission.
The goal of this study was to identify immune correlates of clinical protection against SARS-CoV-2 infection.
Here we sequenced biopsies from a patient with ALK-positive NSCLC progressing on first-line lorlatinib. This dataset contains one Exome-seq (the tumor and its normal). This aims to investigate the impact of the K1150dup mutation on lorlatinib progression and to explore potential therapeutic strategies to overcome resistance.
Bulk RNA sequencing was performed on mCRC organoids subjected to cetuximab treatment, comparing ATOH1 knockout and control conditions. The dataset is intended to elucidate the contribution of ATOH1-regulated secretory cell population to cetuximab persistence mechanisms.