WTCCC genome-wide case-control association study for Type 1 Diabetes (T1D) using the 1958 British Birth Cohort and the UK National Blood Service collections as controls.
WTCCC genome-wide case-control association study for Type 2 Diabetes (T2D) using the 1958 British Birth Cohort and the UK National Blood Service collections as controls.
WTCCC genome-wide case-control association study for Bipolar disorder (CAD) using the 1958 British Birth Cohort and the UK National Blood Service collections as controls.
Neuropsychiatric and autoimmune disorders have substantial epidemiological correlations (Benros et al., JAMA Psychiatry 2013, PMID: 23760347) and broad, genome-wide patterns of shared genetic risk (Pouget et al., Hum Mol Genet, PMID: 31211845; Tylee et al., Am J Med Genet B Neuropsychiatr Genet 2018, PMID: 30325587). Some cases of psychosis involve autoantibodies against the NMDA receptor, AMPA receptor, or other synaptic proteins (reviewed in Dalmau et al, Lancet Neurol 2011, PMID: 21163445). The related autoimmune conditions lupus and Sjogren's Syndrome also involve the development of autoantibodies. The possibility that neuropsychiatric disorders can have molecular mechanisms in common with autoimmune disorders - for example, that psychosis involves an inflammatory or autoimmune component in some patients, or that immune molecules are re-used in the brain to underlie other important biological activities (Stevens et al., Cell 2007, PMID: 18083105) - could open novel therapeutic possibilities for neuropsychiatric disorders. At a genetic level, the strongest genetic associations of schizophrenia, lupus, and Sjogren's Syndrome to common genetic variation involve associations to genetic markers in the Major Histocompatibility Complex (MHC) locus. Bipolar disorder in some studies also associates with variation in or near the MHC locus, though less strongly than schizophrenia does. Intriguingly, the same specific SNPs appear to associate strongly with schizophrenia, lupus, and Sjogren's; these strongly associating SNPs span a genomic segment that includes the HLA class II genes (which have an important role in antibody production) and the complement component 4 (C4) genes. The specific genes and alleles responsible for these associations need to be completely defined, and the extent to which they represent shared or distinct genetic influences in neuropsychiatric and autoimmune illnesses needs to be clarified. The complement component 4 (C4A and C4B) genes are present in the MHC locus, between the class I and class II HLA genes. C4A and C4B commonly vary in genomic copy number and encode complement proteins with distinct affinities for molecular targets. The complex genetic variation at C4 - arising from many alleles with different numbers of C4A and C4B genes - has been challenging to analyze in large cohorts. We recently developed an approach to this problem based on imputation: people share long haplotypes with the same combinations of SNP and C4 alleles, such that C4A and C4B gene copy numbers can be imputed from SNP data (Sekar et al., Nature 2016, PMID: 26814963). In the current work, to analyze C4 in large cohorts, we developed a way to identify C4 alleles from whole-genome sequence (WGS) data, then analyzed WGS data from 1,234 individuals to create a large multi-ancestry panel of 2,530 reference haplotypes of MHC SNPs and C4 alleles that can then be imputed into still-larger cohorts for which SNP data are available. With this dbGaP submission, we make this reference panel available for other studies. Protocols and software for imputing C4 alleles into genome-wide SNP data, and for performing molecular analyses on the C4 genes (such as direct measurement of copy number from genomic DNA), can be found on the McCarroll Lab web site (http://mccarrolllab.org/resources). We are also working to create additional reference panels for imputation of C4 alleles that will be based on still-larger and more diverse population samples; links to these will also be available on the McCarroll Lab web site as we create and validate them.
WTCCC genome-wide case-control association study using two disease collections together with the 1958 British Birth Cohort and the UK National Blood Service collections as controls.
This is a comprehensive genomic analysis of a rare set of secondary histiocytic/dendritic sarcomas from the consult service of the Hematopathology section of the Laboratory of Pathology, NCI, NIH.
WTCCC2 project genome-wide case-control association study for Ulcerative Colitis (UC) using the 1958 British Birth Cohort and the UK National Blood Service collections as controls.
WTCCC genome-wide case-control association study using three disease collections together with the 1958 British Birth Cohort and the UK National Blood Service collections as controls.
Somatic RNA for 40 samples matched to the WGS was extracted using the Qiagen Qiasymphony RNA protcol (cat no 931636). The tissue was initially homogenised using a Qiagen Bioruptor, followed by the manufacturers recommended protocol (including DNase digestion). The resulting RNA the underwent quality control as follows: firstly, A260 and A280nm were measured on a Denovix DS-11 Fx to qualitatively illustrate A260/280nm and A260/230nm ratios as measures of RNA purity. A260/280 had to be 2.0 and A260/230 had to be 2.0-2.2. Then RNA was quantified using LifeTechnologies Qubit RNA BR kit (cat no Q10210). RNAseq was carried out by the Edinburgh Clinical Research Facility on an Illumina NExtSeq500. Total RNA samples were assessed on the Agilent Bioanalyser (Agilent Technologies, #G2939AA) with the RNA 6000 Nano Kit (#5067-1512) for quality and integrity of total RNA, and then quantified using the Qubit 2.0 Fluorometer (Thermo Fisher Scientific Inc, #Q32866) and the Qubit RNA HS assay kit (#Q32855). Libraries were prepared from total-RNA sample using the NEBNext Ultra 2 Directional RNA library prep kit for Illumina (#E7760S) with the NEBNext rRNA Depletion kit (#E6310) according to the provided protocol. 400ng of totalRNA was then added to the ribosomal RNA (rRNA) depletion reaction using the NEBNext rRNA depletion kit (Human/mouse/rat) (#E6310). This step uses specific probes that bind to the rRNA in order to cleave it. rRNA-depleted RNA was then DNase treated and purified using Agencourt RNAClean XP beads (Beckman Coulter Inc, #66514). RNA was then fragmented using random primers before undergoing first strand and second strand synthesis to create cDNA. cDNA was end repaired before ligation of sequencing adapters, and libraries were enriched by PCR using the NEBNext Multiplex oligos for Illumina set 1 and 2 (#E7500). Final libraries had an average peak size of 271bp. Libraries were quantified by fluorometry using the Qubit dsDNA HS assay and assessed for quality and fragment size using the Agilent Bioanalyser with the DNA HS Kit (#5067-4626). Sequencing was performed using the NextSeq 500/550 High-Output v2 (150 cycle) Kit (# FC- 404-2002) on the NextSeq 550 platform (Illumina Inc, #SY-415-1002). Libraries were combined in an equimolar pool based on the library quantification results and run across 5 High-Output Flow Cell v2.5.