Genotype data for 140 present-day individuals from five populations in Pakistan in The first horse herders and the impact of early Bronze Age steppe expansions into Asia DOI: 10.1126/science.aar7711. Sampling details are presented in supplementary section S2.1 Data generation
We have genotyped 140 individuals from 5 populations in Pakistan, using the OmniExpressExome-8 array, which includes approximately one million autosomal SNP markers.
Paired end shallow whole genome sequencing (sWGS) data for the identification of somatic copy number alterations (SCNA) and the estimation of tumor fractions in plasma DNA of renal cell carcinoma (RCC) patients (MonRec Cohort)
RNA-Seq data of 25 HNSCC specimens from patients treated at The Netherlands Cancer Institute, Amsterdam and enrolled in the ARTFORCE trial. HNSCC biopsy samples obtained prior to treatment (chemo-radiotherapy) were used for polyA mRNA sequencing.
The dataset contains raw miRNA sequencing data of plasma samples from 20 newly diagnosed colorectal cancer cases and 20 controls free of colorectal neoplasms matched by age and sex. It includes files in the FASTQ compressed (.gz) format.
Four PAIRED WGS samples, tumor and control, were sequenced on a HiSeq X Ten and the library preparation kit used was Illumina TruSeq Nano DNA. The tumor was multiple myeloma from bone marrow.
The dataset contains RNAseq profiles of 386 patients from the CheckMate-577 (CA209-577) clinical trial whose ICF allows data deposition into a public repository. The Allprep DNA/RNA FFPE kit was used to simultaneously purify genomic DNA and total RNA from formalin-fixed, paraffin embedded (FFPE) tissue sections. RNA was analyzed using the Illumina TruSeq RNA Access method for library preparation, followed by sequencing on the Illumina NovaSeq platform with a 50bp paired-end strategy and a target read depth of 50M per sample. Fastq files are included.
PDX samples of breast cancer IDC mouse models created from patient tumor material of the NKI. Genomescan prepared the samples according to the procedure for Hybridization Capture using an Agilent SureSelect custom 0.5-2.9Mb kit. The prepared libraries were sequenced with Illumina sequencing technology. The samples are in fastq format and consist of the following PDX samples: IDC025, IDC026, IDC029, IDC031, IDC032, IDC038, IDC057, IDC062, IDC065, IDC069, IDC072, IDC090B, IDC092, IDC097, IDC099, IDC107, IDC113, IDC117, IDC143, IDC152, IDC159A, IDC159B, IDC180, IDC186, IDC192, IDC197, IDC198, IDC207, IDC209, IDC216, IDC218, IDC222, IDC229, IDC232, IDC274, IDC282, IDC290, IDC299, IDC307, IDC338, IDC344, IDC346, ILC006, ILC012, ILC083 and ILC248.
The NCI Experimental Therapeutics Clinical Trials Network (ETCTN)/Cancer Therapy Evaluation Program (CTEP) 10026 study (NCT02890329) tested combined ipilimumab and decitabine treatment in two cohorts: Arm A consisted of patients with relapsed acute myeloid leukemia (AML) after allogeneic hematopoietic stem cell transplantation. Arm B consisted of patients with advanced myelodysplastic syndrome (MDS) or AML that did not previously undergo stem cell transplantation. During the study, peripheral blood and bone marrow samples were obtained from study participants at study entry (screening), following the first lead-in cycle of decitabine monotherapy, during combined treatment with decitabine and ipilimumab, and at the end of treatment.This study tested the combination of the CTLA-4 blocking antibody ipilimumab and the DNA methyltransferase inhibitor decitabine in advanced myeloid disease in the transplant-naive and post-transplant context. The samples were used for bulk and single cell RNA sequencing (scRNA-seq) as well as whole exome sequencing (WES).
In 2009 we identified a four-generation family with over 700 members and 41 affected with Crohn's disease (CD). At the time we sequenced the exome of 6 affected individuals but did not identify any coding variants which appear to explain the high prevalence of disease. Since then we have collected DNA from a large number of additional family members, genotyped linkage arrays on the entire family to refine genomic regions shared by identity by descent and genotyped affected and unaffected members at known CD risk loci identified by Genome Wide Association Studies (GWAS). These analyses have confirmed that a significant unexplained excess of disease remains after accounting for all known genetic factors, and that several regions of the genome are shared by a large fraction of affected individuals. We therefore perform whole genomes sequencing from 8 individuals which will allow us to impute the complete sequence of nearly all the members of the two largest and most severely affected branches of the family.