Human placenta samples from 52: 5 first trimester , 7 second trimester, and 40 term placenta. Data is uploaded as BAM files.
A deeper understanding of the pathological mechanisms of SARS-CoV-2 infection is required to combat COVID-19. Through this dataset, we analyze postmortem lung cells from patients that are infected/uninfected with SARS-CoV-2 with snRNA-seq.
TCRab sequencing was performed on viably frozen cells from 11 T-LGLL samples from 9 T-LGLL patients and 6 age-matched healthy samples. The raw data is available as fastq files.
This dataset contains the methylation sequencing data of 60 nonCancer and 70 colorectal cancer cfDNA samples. The methylation library is constructed by using NEBNext Enzymatic-seq Kit.
Single-cell RNA sequencing of 18 peripheral blood samples from six melanoma patients. The raw data is available as fastq files.
This dataset is derived from whole-transcriptome sequencing (RNA-seq) of RNA from 57 BCR-ABL1 lymphoblastic leukemias (53 diagnostic, 4 relapse).
The objective of the colonoscopy study is to carry out the 16s sequencing of colon biopsies and faecal samples provided in ExHiBITT study to compare potential fluctuations in the microbiota of different sites.
The Type 2 Diabetes (T2D) Genetic Exploration by Next-generation sequencing in Ethnic Samples (T2D-GENES) Consortium is a collaborative international effort to identify genes influencing susceptibility to type 2 diabetes in multiple ethnic groups using next generation sequencing. To fulfill this objective, T2D-GENES Consortium undertook two large sequencing studies, called T2D-GENES Projects 1 and 2. Project 1 has carried out whole exome sequencing of 12,940 individuals, 6,504 with T2D and 6,436 non-diabetic controls, equally divided among five continental ancestry groups: Europeans, East Asians, South Asians, Hispanic Americans, and African Americans. The goal of Project 1 is to identify all genetic variants in the complete coding regions of the genomes (i.e., whole exome) by sequencing, including rare variants. Project 2 (i.e., SAMAFS substudy 2) is a pedigree-based study designed to identify low frequency or rare variants influencing susceptibility to T2D, using whole genome sequence information from approximately 600 individuals in 20 Mexican American T2D-enriched pedigrees from San Antonio, Texas, augmented with family-based imputation into approximately 440 additional family members. The major objectives of Project 2 are to identify low frequency or rare variants in and around known common variant signals for T2D, as well as to find novel low frequency or rare variants influencing susceptibility to T2D. Both T2D-GENES Projects 1 and 2 involve the San Antonio Mexican American Family Studies (SAMAFS), which are composed of four San Antonio, Texas-based family studies: the San Antonio Family Heart Study (SAFHS), San Antonio Family Diabetes/Gallbladder Study (SAFDGS), Veterans Administration Genetic Epidemiology Study (VAGES), and Family Investigation of Nephropathy and Diabetes - San Antonio (FIND-SA) Component and its extension called the Extended FIND [E-FIND]. The SAFHS and SAFDGS began in 1991 and have followed participants with extensive clinical phenotyping related to T2D for over 20 years. The VAGES was initiated in 1994 and a large battery of T2D-related phenotypic data has been obtained from its participants. The FIND-SA began in 2000, a part of the multicenter FIND study which was designed to identify genetic determinants of diabetic kidney disease; data from its participants related to T2D were used for this project. Non-overlapping subsets of SAMAFS participants are part of the T2D-GENES Projects 1 and 2, henceforth referred to as SAMAFS substudies 1 and 2, respectively. The SAMAFS substudies 1 and 2 are part of one of the five awards funded by NIDDK under a cooperative agreement award mechanism, which is governed by the Steering Committee of the T2D-GENES Consortium. Since Project 1 relies on population based subsets of cases and controls, 491 unrelated participants are drawn from the four SAMAFS as part of the T2D-GENES Project 1 Mexican American sample (i.e., SAMAFS substudy 1). The whole exome sequencing was performed at the Broad Institute. For Project 2, 1048 individuals are drawn from two SAMAFS (SAFHS and SAFDGS), representing 20 large families for substudy 2. The substudy 2 strategy is to sequence approximately 600 individuals at an average of 50x coverage across the entire genome, then impute genome wide genotypes for about 440 additional family members. The 600 sequenced individuals are specifically chosen for their value in imputing sequence information into other family members. By studying large pedigrees, we expect to find multiple individuals carrying each genetic variant, even if this variant is very rare in the population at large. Thus, a pedigree-based approach provides an excellent opportunity for identifying rare novel variants influencing risk of T2D and quantitative variation in T2D-related phenotypes. The whole genome sequencing has been done commercially by Complete Genomics, Inc. (CGI). The available sample of 1,048 includes 5 sequenced individuals who do not belong to any of the 20 large pedigrees. The final family data of 1,043 individuals includes whole genome sequence data for 607 individuals. After quality control, 590 sequenced individuals provide data for family based imputation using Merlin linkage analysis software into approximately 440 additional family members for whom chip based genotypes are available to indicate which parental haplotype is transmitted. The complete SAMAFS data including phenotype, genotype, sequence, other T2D-related trait data utilized for Projects 1 and 2 are available. These data can readily be viewed by clicking on the substudy title shown below or in the box: "Substudies", located on the right hand side of this parent or top study page phs000847.v1.p1, titled T2D-GENES Consortium: San Antonio Mexican American Family Studies (SAMAFS). phs000849 T2D-GENES Project 1: San Antonio Mexican American Family Studies (SAMAFS), Substudy 1: Whole Exome Sequencing phs000462 T2D-GENES Project 2: San Antonio Mexican American Family Studies (SAMAFS), Substudy 2: Whole Genome Sequencing in Pedigrees
RNA sequencing (RNAseq) of peripheral blood lymphocytes was used to develop a means to assess immune function in a way that can be used in discovery science and applied to patients individually in clinical settings. The premise is that profiles of RNA present in immune cells is reflective of the combined influence of genetic and environmental variation on immune potential of individuals and that this potential can be tapped to understand human immunity in a variety of biological contexts. CD4+ cells were isolated from fresh whole blood via positive magnetic bead selection and cell lysates were prepared using Qiazol (QIAGEN) and stored at -80ºC for 3 to 8 months. RNA was extracted in batches for cDNA library preparation and RNA-Seq. For this study, we developed standard operating procedures for handling human blood samples and determined: a) the best way to enrich for CD4+ T cells from whole blood and yield high quality RNA, b) the sensitivity of this RNA profiling strategy, and c) the reproducibility of generated immune profiles from healthy subjects. We then developed bioinformatics processes to establish immune response signatures and immune response phenotypes within cohorts of individuals.
Genetic mutations causing disease may be inherited, newly acquired in parental gametes and present in the zygote or acquired as somatic events at some point in development after fertilization. The burden and localization of an acquired mutation depend on when the mutation arises. As is the case for inherited genetic variation, there is accumulating evidence that somatic mutations can lead to severe tissue-specific disease. Malformations of cortical development (MCD) represent a group of disorders characterized by a range of morphological and structural abnormalities of the cerebral cortex reflecting errors in embryonic cortical development. MCD are associated with intellectual disability, as well as refractory epilepsy, and may require the surgical removal of the affected tissue. Inherited gene mutations involved in neuronal development explain only a fraction of MCD cases. In this project we sought to evaluate the hypothesis that somatic mutations disrupt embryonic cortical development and are responsible for a substantial fraction of MCD. Using high coverage next generation sequencing of protein-coding regions in DNA extracted from abnormal brain tissue and unaffected tissue (leukocytes) from patients with three forms of MCD, we sought to identify somatic mutations within each patient with MCD. Exome sequence data from study participants are provided to facilitate research into the genetic underpinnings of MCD.