Even though whole genome sequence (WGS) data has been generated and published in many studies, much of this information is not yet processed for use in down-stream analysis. This study's aim is to provide 1342 WGS normal-tumor paired single nucleotide variations (SNP) over 18 different cancer types provided by The Cancer Genome Atlas (TCGA) project. Individual level data for TCGA can be accessed by requesting access for phs000178. This data includes variations within self-reported white and African-American populations. Variations that exist within the tumor tissue but are absent in associated normal organ tissue (as compared to the human reference genome) are reported. Data published includes SNP and small insertions and deletions, which were generated through a pipeline including the VarScan2 variant calling software.
Here, we performed genome-wide 5-mC profiling in plasma of metastatic ALK-rearranged non-small cell lung cancer (NSCLC) patients receiving tyrosine kinase inhibitor therapy. We established a strategy to identify ALK-specific 5-mC changes from cfDNA and demonstrated the suitability of the identified markers for cancer detection, prognosis and therapy monitoring. Longitudinal plasma samples (n = 79) of 21 ALK-positive NSCLC patients and 13 healthy donors were collected alongside 15 ALK-positive tumor tissue and 10 healthy lung tissue specimens. All plasma and tissue samples were analyzed by cell-free DNA methylation immunoprecipitation sequencing to generate genome-wide 5-mC profiles. Information on genomic alterations (i.e., somatic mutations/fusions and copy number alterations) determined in matched plasma samples were available from previous studies.
To understand intrinsic cancer cell signatures and the surrounding microenvironemt and their interactions, we performed single-cell RNA sequencing on 63,689 cells from 23 patients with 23 primary colorectal cancer and 10 matched normal mucosa samples. Analyzing of primary colorectal cancer and normal mucosa samples show a comprehensive cellular landscape of colon cancer, which is a valuable resource for the development of therapeutic strategies.
samples using Affymetrix HG_U133_+2
Study 1 2R01-NS050375 (PI: DOBYNS, William B.) The genetic basis of mid-hindbrain malformations Our general goal for this project is to advance our understanding of human developmental disorders that involve the brainstem and cerebellum - brain structures derived from the embryonic midbrain and hindbrain - that affect a minimum of 2.4 per 1000 resident births based on data from the CDC. Importantly, this large class of disorders co-occurs with more common developmental disorders such as autism, mental retardation and some forms of infantile epilepsy, and shares some of the same causes. With this renewal, we propose to expand the scope of our work beyond single phenotypes and genes to focus on delineating the critical phenotype spectra to which the most common MHM belong, and defining the underlying biological networks that are disrupted. To pursue these goals, we will use our large and growing cohort of human subjects to map additional MHM loci using SNP microarrays that provide both high-resolution autozygosity and linkage data in informative families as well as detect critical copy number variants in sporadic subjects. The causative genes will be identified using traditional Sanger or new high-throughput sequencing methods as appropriate abased on size of the critical region. We will use these and other known MHM causative genes to construct and revise model biological networks of genes and proteins, and test these genes and networks in additional patients as a candidate gene or more accurately a candidate network approach. These approaches need to be supported by ongoing active subject recruitment, as studies of comparable disorders such as mental retardation and autism have benefited from even larger numbers of subjects that we have so far collected. We need to use new high-throughput sequencing methods to more efficiently test larger critical regions, and to test entire gene networks rather than individual genes in matched cohorts of subjects. At every step; phenotype analysis, CNV analysis, model network construction and high-throughput sequencing, we will need expanded bioinformatics capabilities. Finally, we need to test the biological function of new genes and networks to support our gene identification studies. We expect that these studies will contribute immediately to more accurate diagnosis and counseling, and over time will lead to development of specific treatments for a subset of these disorders. We further expect that studies of mid-hindbrain development will have broad significance for human developmental disorders generally, providing compelling evidence for a connection between cerebellar development and other classes of developmental disorders such as autism, mental retardation and epilepsy. Study 2 R01-NS058721 (PI: DOBYNS, William B.) De novo copy number variation and gene discovery in human brain malformations Project Summary/Abstract The number of recognized brain malformations and syndromes has grown rapidly during the past several decades, yet relatively few causative genes have been identified, especially for three common malformations that have been associated with numerous cytogenetically visible chromosome deletions and duplications, and that often occur together: agenesis of the corpus callosum (ACC), cerebellar vermis hypoplasia (CVH) including Dandy-Walker malformation (DWM), and polymicrogyria (PMG). We propose to perform high-resolution array comparative genome hybridization (aCGH), emerging technology able to detect small copy number variants (CNV), in 700 probands with one or more of these three malformations. Our central hypothesis states that more than 10% of patients with ACC, CVH or PMG will have de novo CNV below the resolution of routine cytogenetic analysis, but detectable by current array platforms. We therefore expect to identify 70-100 patients with small CNV. We will distinguish CNV found in normal individuals from potentially disease-associated changes, and will confirm CNV using fluorescence in situ hybridization (FISH) and microsatellite (STRP) analysis. We will give highest priority to CNV that are de novo and involve 2 or more BACs, and secondary priority to familial and smaller CNV excluding known polymorphisms. After that, we will evaluate and rank candidate genes in the critical regions using information from public databases and our own expression studies, and perform mutation analysis of the best candidate genes from well-defined critical regions by sequencing in a large panel of subjects with phenotypes that match the phenotypes of the patients whose CNV define the critical regions. Here, we will use more refined criteria to supplement our clinical classification, such as the developmental level and presence of epilepsy or other birth defects. Any abnormalities found will be analyzed using existing data regarding polymorphisms (i.e. dbSNP), cross-species comparisons, and functional assays appropriate for the specific sequence change. Study 2A In 1995, we described a novel multiple congenital anomaly syndrome associated with facial dysmorphism (congenital ptosis, high arched eyebrows, shallow orbits, trigonocephaly), colobomas of the eyes, neuronal migration malformation (frontal predominant lissencephaly) and variable hearing loss. We hypothesized from de novo mutations and used trio-based exome sequencing to identify de novo mutations in the ACTB and ACTG1 genes. Study 2B In 1997 and 2004, we and others defined two novel developmental syndromes associated with markedly enlarged brain size, or megalencephaly, and other highly recognizable features. The megalencephaly-capillary malformation syndrome (MCAP) consists of megalencephaly and associated growth dysregulation with variable asymmetry, developmental vascular anomalies, distal limb malformations, variable cortical malformation, and a mild connective tissue dysplasia. The megalencephaly-polymicrogyria-polydactyly-hydrocephalus syndrome (MPPH) resembles MCAP but lacks vascular malformations and syndactyly. We hypothesized that MCAP and MPPH result from mutations - including postzygotic events - in the same pathway, and studied them together. Using a combination of exome sequencing, Sanger sequencing, restriction-enzyme assays, and targeted ultra-deep sequencing in 50 families with MCAP or MPPH, we identified de novo germline or postzygotic mutations in three core components of the phosphatidylinositol-3-kinase/AKT pathway. These include two mutations in AKT3, a recurrent mutation in PIK3R2, and multiple mostly postzygotic mutations in PIK3CA (Rivière JB, Mirzaa GM, O'Roak BJ, Beddaoui M, Alcantara D, Conway RL, St-Onge J, Schwartzentruber JA, Gripp KW, Nikkel SM, Worthylake T, Sullivan CT, Ward TR, Butler HE, Kramer NA, Albrecht B, Armour CM, Armstrong L, Caluseriu O, Cytrynbaum C, Drolet BA, Innes AM, Lauzon JL, Lin AE, Mancini GMS, Meschino WS, Reggin JD, Saggar AK, Lerman-Sagie T, Uyanik G, Weksberg R, Zirn B, Beaulieu CL, FORGE Canada Consortium, Majewski J, Bulman DE, O'Driscoll M, Shendure J, Graham Jr. JM, Boycott KM, Dobyns WB. De novo germline and postzygotic mutations in AKT3, PIK3R2 and PIK3CA cause a spectrum of related megalencephaly syndromes. Nat. Genet. In press). Study 3 2R01-NS046616 (PI: GOLDEN, Jeffrey A) The role of ARX in normal and abnormal brain development This subcontract from the Children's Hospital of Philadelphia to the University of Chicago (UC) is intended to support research studies of the ARX and functionally related genes in human subjects with any one of several specific developmental disorders. The Co-investigator at UC (W.B. Dobyns) will identify a series of patients with mental retardation and severe infantile epilepsy, some of whom will have specific brain malformations and others who will have normal brain structure by brain imaging studies, and collect research samples from these subjects with informed consent. The studies to be performed will include mutation analysis of ARX, mutation analysis of specific downstream target genes, X inactivation studies in humans and X inactivation studies in mutant mice. The results will be analyzed to determine the significance of any changes found in the gene.
Lymphoblastoid cell lines established using either wildtype or BALF5-deficient Epstein-Barr virus were analyzed by RNA sequencing.
Transcriptome sequencing for WEHI-AML-1 and WEHI-AML-2. RNA libraries were generated using the Illumina TruSeq RNA Sample Preparation Kit v2 and sequenced on an Illumina HiSeq2500.
Our current understanding of autism spectrum disorders (ASD) delineates a highly heritable, yet etiologically heterogeneous disease. Forward genetic approaches to find disease associated mutations or common variation have been successful and continue to offer considerable power. Yet, given the accumulating evidence for very significant heterogeneity and environmental influences, complementary approaches to classic forward genetics become necessary. Genetic polymorphism and mutation data to date have identified dozens of causal or contributory variants, yet our preliminary data from autism brain suggest that common molecular pathways are involved in a significant subset of cases. This convergence at the tissue level suggests that other mechanisms, specifically epigenetic changes, combined with genetic background, are contributing to such final common pathways. We further tested this hypothesis by taking a comprehensive and integrative genome-wide approach to assessing brain gene-expression, miRNA levels and the related, causal epigenetic mechanisms in ASD etiology. We performed RNA-seq analyses of four cerebral cortical regions and cerebellum from ASD cases and controls, to assess mRNA, miRNA, and splicing isoform regulation. In parallel, we identified key differences in chromatin state and DNA methylation across multiple brain regions in the same ASD and control individuals used in the expression analyses using ChIP-Seq and MeDIP. We assessed the mechanisms by which changes in DNA methylation, histone modification, and DNA sequence contribute to the observed differences in gene expression. This work, which represents an unprecedented effort to unify these often disparate data (usually produced without integration in mind), delineates potential shared molecular pathways in ASD and the underlying mechanism of these differences at the level of miRNA, the chromatin regulatory apparatus, and DNA methylation. The following substudies are part of the PsychENCODE release at dbGaP and offer additional molecular data: PsychENCODE: RNA-Sequencing - SRRM4 Splicing Study phs000872 PsychENCODE: Global Changes in Patterning, Splicing and lncRNAs phs001061 PsychENCODE: Chromatin Contact Map in Fetal Cortical Laminae phs001190 PsychENCODE: Epigenetic Dysregulation in Autism Spectrum Disorder phs001220
Based on the kinetic characteristics of single-molecule real-time sequencing, we implement a method to directly detect DNA modifications such as 5mC. However, there are room to further improve the accuracy of whole-genome-wide DNA modifications. Therefore, this study aims to utilize deep learning methods for the comprehensive profiling of multiple DNA modifications, including 5mC, 5hmC, and 6mA, using SMRT-seq.
To investigate the effect of cryopreservation on PBMCs, we conducted an experiment using samples from three healthy donors under four conditions: fresh PBMCs, PBMCs subjected to a single freeze–thaw cycle, PBMCs refrozen in freezing mix, and PBMCs refrozen in TRIzol. This resulted in a total of 12 samples (three replicates per condition). RNA was extracted from all PBMC samples using the Qiagen miRNeasy Mini Kit, followed by polyA RNA sequencing on the Illumina NovaSeq 6000 platform. Blood was collected from one male and two female adult donors. Sequencing yielded high-quality data, with an average of 27.8 million reads per sample and over 94% of bases exceeding a Q30 quality score.