The dataset includes raw RNA-seq data (fastq files) for miRNAs of monocytes before and after 6-hour exposure to four different immune stimuli, measured in 200 African- and European-descent healthy donors from Belgium. The stimuli include ligands for TLR4 (LPS), TLR1/2 (Pam3CSK4) and TLR7/8 (R848) and to a human seasonal influenza A virus (IAV).
To elucidate the timing and mechanism of the clonal expansion of somatic mutations in cancer-associated genes in the normal endometrium, we conducted target sequencing of 112 genes for 1,298 endometrial glands and matched blood samples from 36 women. By collecting endometrial glands from different parts of the endometrium, we showed that multiple glands with the same somatic mutations occupied substantial areas of the endometrium. The 112 genes are as follows: ABCC1, ACRC, ANK3, ARHGAP35, ARID1A, ARID5B, ATCAY, ATM, ATR, BARD1, BCOR, BRCA1, BRCA2, BRD4, BRIP1, CAMTA1, CDC23, CDYL, CFAP54, CHD4, CHEK1, CHEK2, CTCF, CTNNB1, CUX1, DGKA, DISP2, DYNC2H1, EMSY, FAAP24, FAM135B, FAM175A, FAM65C, FANCA, FANCB, FANCC, FANCD2, FANCE, FANCF, FANCG, FANCI, FANCL, FANCM, FAT1, FAT3, FBN2, FBXW7, FGFR2, FRG1, GPR50, HEATR1, HIST1H4B, HNRNPCL1, HOOK3, KIAA1109, KIF26A, KMT2B, KMT2C, KRAS, LAMA2, LRP1B, MLH1, MON2, MRE11A, MSH2, MSH6, MTOR, NBN, PALB2, PHEX, PIK3CA, PIK3R1, PLXNB2, PLXND1, PMS2, POLE, POLR3B, PPP2R1A, PTEN, PTPN13, RAD50, RAD51, RAD51B, RAD51C, RAD51D, RAD52, RAD54B, RAD54L, RICTOR, SACS, SIGLEC9, SLC19A1, SLX4, SPEG, STT3A, TAF1, TAF2, TAS2R31, TFAP2C, TNC, TONSL, TP53, TTC6, UBA7, VNN1, WT1, XIRP2, ZBED6, ZC3H13, ZFHX3, ZFHX4, ZMYM4.
This study combined whole exome (103 samples) and whole genome (22 samples) sequencing over a total of 108 breast tumors and matched normal DNA to identify novel mutations and translocations. Samples were subjected to paired-end Illumina sequencing with goal of 30x coverage of tumor/normal for whole genomes and 100x tumor/normal coverage for whole exomes. From these sequences, we used various computational techniques to identify somatic point mutations, insertion/deletions and structural rearrangements in these tumors. From these data, we identified new insights into the rates of background mutations in these cancers, novel recurrent mutated genes, and multiple gene rearrangements. One of these rearrangements appears to be a recurrent event in breast cancer.
The Epilepsy Genetics Initiative (EGI) was created in 2014 with the goal of amassing a database of clinically-generated exome sequence data for patients with epilepsy who have had diagnostic whole exome sequencing. Most of the subjects enrolled in the study did not receive a secure genetic diagnosis from their initial sequencing. Following participant/family consent, EGI transfers raw data from the respective clinical lab to the Institute for Genomic Medicine at Columbia University Medical Center. The strategy of the initiative has been to carry out systematic reanalysis of data to identify new diagnoses that were not possible or missed at the time of initial sequencing and to aid in novel gene discovery in epilepsy.
Samples, in a form of PAXgene fixed and paraffin-embedded biopsies, were collected from the multi-site, double-blind, randomized, placebo-controlled trial, aimed at dose-finding and assessing the efficacy and tolerability of a 6-week treatment with ZED1227 capsules vs. placebo in subjects with well-controlled celiac disease undergoing gluten challenge. Total RNA was extracted from the PaxFPE biopsy specimens (n = 116) using additional cuttings from the samples on which histomorphometry was previously assessed. For the extraction, an RNeasy Kit (Qiagen, Hilden, Germany) was used according to the manufacturer’s instructions. Library preparation and next-generation sequencing (NGS) were performed by the Qiagen NGS Service. A total of 10 ng of purified RNA was converted into cDNA NGS libraries. Library preparation was quality controlled using capillary electrophoresis. Based on the quality of the inserts and the concentration measurements, the libraries were pooled in equimolar ratios and then sequenced on a NextSeq (Illumina Inc., San Diego, USA) sequencing instrument according to the manufacturer’s instructions, with 100 bp read length for read 1 and 27bp for read 2. The raw data were de-multiplexed, and FASTQ files for each sample were generated using bcl2fastq2 software (Illumina Inc., San Diego, USA).
Mother and infant pairs were enrolled in the Rhode Island Child Health Study (RICHS) at the Woman and Infant's Hospital from 2009 through 2014. This population consists of singleton, term infants (≥37 weeks gestation) born without serious pregnancy complications or congenital or chromosomal abnormalities. Given a priority interest to study fetal growth, the RICHS population was oversampled for both large for gestational age (LGA, >90% 2013 Fenton Growth Curve) and small for gestational age (SGA, <10% 2013 Fenton Growth Curve) infants.
T cells from five healthy human donors were stimulated with a peptide pool (CEFX, JPT) and anti-CD28/CD49d co-stimulation. Cells were then sorted based on activation (OX40 and PD-L1 for CD4s, CD137 for CD8s) and labeled with corresponding hashing antibodies prior to sequencing. In total, this dataset encompasses 42,370 cells (12,743 activation-induced marker (AIM)-positive, 15,369 AIM-negative, 14,258 Mock) for CITE-sequencing and TCR-sequencing of sorted activated, unactivated, and mock T cells derived from blood.
In this study, we explore the potential of classifying pediatric brain tumors based on methylation profiling of the cell-free DNA in cerebrospinal fluid (CSF). For this proof-of-concept study, we collected 20 cerebrospinal fluid samples of pediatric brain cancer patients via a ventricular drain placed for reasons of increased intracranial pressure. For 11 patients in this study we collected matched tumor DNA. This cohort contains fastQ files of cfRRBS data of these samples.
This dataset includes all data produced in the study describing "scEC&T-seq", a method for parallel sequencing of extrachromosomal circular DNA and transcriptome in single cells. This dataset includes: - Illumina scEC&T-seq Circle-seq data (scCircle-seq) for a total of 626 single cells / nuclei - bam files - Illumina scEC&T-seq RNA-seq data (scRNA-seq-Illumina) for the same single cells / nuclei - bam files - Nanopore scCircle-seq data for 18 single cells - bam files - Nanopore bulk WGS for 2 cell lines and 2 primary tumor samples - bam files - Illumina bulk WGS for 2 cell lines - bam files - Illumina bulk Circle-seq data from 1 cell line - bam file - Illumina ChIP-seq H3K27me3 data from 1 cell line - fasta files + peaks bed file + coverage bw file