Deep single-cell RNA sequencing data for 11,138 T cells from tumour, adjacent normal tissue and peripheral blood of treatment-naive CRC patients. The DATA ACCESS AGREEMENT is provided at https://github.com/zhangyybio/single-T-cell-data-access. Applicants can request access to the data by directly downloading it or by sending an email to cancerpku@pku.edu.cn. The process that is used to approve an application includes verifying the institution, participants and research purposes of the application. In general this process will take about two weeks. In principal, any academic research institutions complying with the laws and bioethic regulation policies of China will be approved.
We have generated and analyzed genomic data from a cohort of metastatic urothelial carcinoma patients treated with ICI such as anti-PD-(L)1 monoclonal antibodies. The dataset contains whole exome sequencing data of 27 whole blood samples and 27 FFPE tumor samples. Further, it includes RNA sequencing data from 21 tumor samples. Following the RECIST criteria, 10 patients were classified as non-responders to the treatment, and 17 were responders. The dataset also contains a merged vcf file containing somatic mutations called by Strelka2 and Mutect2 following the gatk best practice pipeline.
Increasing evidence shows the value of circulating tumour DNA (ctDNA) to detect cancer and monitor its progression. Somatic genomic structural variations (SVs) are promising personalized biomarkers for sensitive and specific detection of ctDNA in liquid biopsies. However, accurate, affordable, and fast identification of such SV biomarkers is challenging, which hinders routine use in the clinic. Here, we demonstrate a novel approach - termed SHARC - for rapid discovery of somatic SV breakpoints as personalized tumour biomarkers. SHARC combines low-coverage cancer genome sketching by using Oxford Nanopore portable sequencing with a random forest classification and a dedicated filtering pipeline to enrich for somatic SVs. Our method leverages the real-time and long-read capabilities of Nanopore sequencing to identify somatic SV breakpoints at nucleotide resolution from a tumour biopsy within two days. We applied SHARC to tumour samples of high-grade ovarian and prostate cancer and validated on average 10 somatic SVs per sample with the use of PCR mini-amplicons. Finally, we demonstrate that these somatic SV biomarkers can be used to detect tumour presence from liquid biopsies in a quantitative manner and we retrospectively monitored treatment response in patients with prostate cancer, demonstrating its potential benefit for clinical practice.
This dataset includes 4 samples with scRNA-seq and 3 sample with CITE-seq, scTCR-seq and scBCR-seq.
This study is part of the Gene Environment Association Studies initiative (GENEVA) funded by the National Human Genome Research Institute. The overarching goal is to identify novel genetic factors that contribute to addiction through a large-scale genome-wide association study of DSM-IV alcohol dependent (and frequently illicit drug dependent) cases and non-dependent, unrelated control subjects of European and African American descent. The focus of this proposal is a case-control design of unrelated individuals for a genetic association study of addiction. Cases are defined as individuals with DSM-IV alcohol dependence (lifetime) and potentially other illicit drug dependence. In addition to the categorical diagnosis, we have data on ordinal measurements of number of DSM-IV symptoms for alcohol, nicotine, marijuana, cocaine, opiates and other drugs so that we will able to construct quantitative measurements of addiction severity over a wide range of substances. Controls are defined as individuals who have been exposed to alcohol (and possibly to other drugs), but have never met lifetime diagnosis for alcohol dependence or dependence on other illicit substances. Analyses that include refinement of the phenotype and incorporation of important demographic and environmental factors into association studies will be pursued. Cases and controls were selected from three large, complementary datasets: the Collaborative Study on the Genetics of Alcoholism (COGA), the Family Study of Cocaine Dependence (FSCD), and the Collaborative Genetic Study of Nicotine Dependence (COGEND). COGA: COGA was initiated in 1989 and is a large-scale family study that has had as its primary aim the identification of genes that contribute to alcoholism susceptibility and related characteristics. COGA is funded through the National Institute on Alcohol Abuse and Alcoholism (NIAAA). Subjects were recruited from 7 sites across the U.S. Alcohol dependent probands were recruited from treatment facilities and assessed by personal interview. After securing permission, other family members were also assessed. A set of comparison families was drawn from the same communities as the families recruited through the alcohol dependent probands. Assessment involved a comprehensive personal interview developed for this project, the Semi-Structured Assessment for the Genetics of Alcoholism (SSAGA), which gathers detailed information on alcoholism related symptoms along with other drugs and psychiatric symptoms. Families with three or more first-degree relatives who were alcohol dependent were invited for more extensive testing, including neurophysiology evaluations (ERPs and EEGs) and a battery of neuropsychological assessments. Blood was obtained for genetic studies. Institutional Review Boards at all sites approved the protocols, including sharing in the NIAAA national repository. COGA has four Co-Principal Investigators Bernice Porjesz, Victor Hesselbrock, Howard Edenberg, and Laura Bierut. COGA includes nine different centers where data collection, analysis, and storage take place. The nine sites and Principal Investigators and Co-investigators are: University of Connecticut (Victor Hesselbrock); Indiana University (Howard Edenberg, John Nurnberger, Jr., Tatiana Foroud); University of Iowa (Samuel Kuperman); SUNY Downstate (Bernice Porjesz); Washington University in St. Louis (Laura Bierut, Alison Goate, John Rice); University of California at San Diego (Marc Schuckit); Howard University (Robert Taylor); Rutgers University (Jay Tischfield); Southwest Foundation (Laura Almasy). Q. Max Guo serves as the NIAAA Staff Collaborator. This national collaborative study is supported by the NIH Grant U10AA008401 from the National Institute on Alcohol Abuse and Alcoholism (NIAAA) and the National Institute on Drug Abuse (NIDA). Family Study of Cocaine Dependence (FSCD): This project was initiated in 2000 as a case-control family study of cocaine dependence funded through the National Institute on Drug Abuse (NIDA; PI: Laura Bierut). The primary goal was to increase the understanding of the familial and non-familial antecedents and consequences of cocaine dependence. Cocaine dependent individuals were systematically recruited from chemical dependency treatment units (both public and private; residential and outpatient) in the greater St. Louis metropolitan area. Community based control subjects were identified through a Missouri Driver's License Registry (maintained at Washington University for research purposes) and matched by age, race, gender, and residential zip code. As a supplement to this project, blood samples were collected for future genetic analysis and were included in the NIDA Genetics Consortium. Phenotypic data, DNA, and cell lines are in the NIDA Center for Genetics Studies. Collaborative Genetic Study of Nicotine Dependence (COGEND): COGEND was initiated in 2001 as a three-part program project grant funded through the National Cancer Institute (NCI; PI: Laura Bierut). The three projects included a study of the familial transmission of nicotine dependence, a genetic study of nicotine dependence, and a study of the relationship of nicotine dependence with nicotine metabolism. The primary goal is to detect, localize, and characterize genes that predispose or protect an individual with respect to heavy tobacco consumption, nicotine dependence, and related phenotypes and to integrate these findings with the family transmission and nicotine metabolism findings. The primary design is a community based case-control family study. All subjects were recruited from Detroit and St. Louis. Nicotine dependent cases and non-dependent smoking controls were identified and recruited. In addition, one sibling for each case and control subject was recruited in a subset of the sample. Over 56,000 subjects aged 25-44 years were screened by telephone, over 3,100 subjects were personally interviewed, and over 2,900 donated blood samples for genetic studies. All three studies (COGA, COGEND, FSCD) include measures of basic socio-demographic variables, including age, sex, race/ethnicity, family income, educational attainment, religious participation, and family structure. Other important covariates and/or potential moderators of genetic effects include comorbid addictions and age at initiation of use for cigarettes, alcohol and drugs. The assessments also include measures of various life stressors, such as physical and sexual abuse, which have been implicated in gene-environment interactions for several disorders. Coding for both individual variables and indices has been standardized across studies. All subjects were assessed in person by trained research assistants. Funding support for genotyping, which was performed at the Johns Hopkins University Center for Inherited Disease Research (CIDR), was provided by the NIH GEI (U01HG004438), the National Institute on Alcohol Abuse and Alcoholism, the National Institute on Drug Abuse, and the NIH contract "High throughput genotyping for studying the genetic contributions to human disease"(HHSN268200782096C). Note for Publications Related to Study: The Study of Addiction: Genetics and Environment (SAGE) has not yet generated publications. Below is a listing of publications related to the three studies from which the SAGE sample was selected. COGA has over 228 publications listed at www.niaaagenetics.org This study is part of the Gene Environment Association Studies initiative (GENEVA, http://www.genevastudy.org) funded by the trans-NIH Genes, Environment, and Health Initiative (GEI). The overarching goal is to identify novel genetic factors that contribute to addiction through a large-scale genome-wide association study of DSM-IV alcohol dependent (and frequently illicit drug dependent) cases and non-dependent, unrelated control subjects of European and African American descent. Genotyping was performed at the Johns Hopkins University Center for Inherited Disease Research (CIDR). Data cleaning and harmonization were done at the GEI-funded GENEVA Coordinating Center at the University of Washington.
This dataset contains the results of sequencing 9 samples (7 human/PDX, 2 cell lines) of medulloblastoma tumours (or 2 Neural Stem Cell lines). The data was collected to characterize the intratumour heterogeneity of chromothripsis in this tumour type. The 7 tumour samples underwent single cell DNA and RNA sequencing using the 10x CNVkit and 10xRNA protocols, and 1 of the tumour samples as well as the Neural Stem Cell lines underwent StrandSeq sequencing. All experiments were sequenced on Illumina sequencers. The provided files are paired-end fastq files from the sequencing experiments.
The datasets consists of RNA sequencing data of T cells from CLL patients or age-matched healthy donors. In brief, CLL PBMCs are thawed, the sample is split in two, one part is left as it is and stained for sorting and from the other part is stimulated using anti-CD3/CD28 soluble antibodies. After 2 days the stimulated condition is also stained and FACS sorted. The T cell fraction from healthy donors and CLL patients at baseline and after stimulation were sent for bulk sequencing.
This dataset contains fastq-files from single cell 5' RNA sequencing of the AML cell line HNT34 and normal T cells following co-culture with and without an antibody blocking SLAMF6 (TNC-1). The libraries were prepared using 10X GEM-X Universal 5' Gene Expression v3 Reagent Kit. In total, the dataset contains sequenced gene expression libraries from four samples (HNT34 co-cultured with T cells from two different donors; for both donors there is one sample with and one sample without the blocking antibody).
Between November 2010 and May 2014, 20 cases—17 thymomas and 3 thymic carcinomas (including 2 squamous and 1 neuroendocrine carcinoma)—were newly sequenced from surgically removed TETs at Seoul National University Hospital, with written informed consent. Tumors and normal tissues were carefully separated and immediately preserved in liquid nitrogen after resection. DNA and RNA were extracted from the tumors, adjacent normal tissues, and/or blood samples. Libraries for WES were prepared using the SureSelect XT (Human All Exon + UTR v5) Library Prep Kit following standard Illumina protocols.
Using single-nucleus RNA sequencing, we characterized the transcriptome of 880,000 nuclei from 18 control and 61 failing, nonischemic human hearts with pathogenic variants in DCM and ACM genes or idiopathic disease.