Deposited here are whole-genome sequencing data for 26 paired breast cancer DCIS and matched-normal samples taken from the same individual. Average sequence coverage is 118x for DCIS samples and 41x for matched-normals. Matched-normal samples are from blood. Sequencing was performed on an Illumina HiseqX. Due to specific restrictions imposed by the ethical approval at sample collection, the use of the germline data is restricted to filtering of somatic mutation calls only and cannot be used outside this purpose.They are a part of a broader project that uses high-depth WGS to investigate the somatic mutation genomic landscape of DCIS in order to uncover biological insights into breast cancer progression as well as possible methods to stratify DCIS patients for individualised therapy or disease monitoring.
26 cell lines derived from human Diffuse Large B Cell lymphomas (DLBCL) or Burkit Lymphomas (BL) were subjected to whole exome sequencing. Exome capture was carried out using the SeqCap EZ Exome Library 2.0 kit (Roche/Nimblegen) and 100 bp single-read sequencing was performed on a HiSeq2500 (Illumina). 82% of the coding region was covered at least 30x.
This dataset contains snRNA-seq data of 26 regionally sampled GBM tissue (peritumoral region, tumor edge, and tumor core). Regionally sampled GBM patient tissue was dissociated and nuclei were processed unsorted or by sorting with 7AAD to remove debris and dead cells. Single-nuclei suspension was prepared following the nuclei isolation protocol Single Cell Multiome ATAC + Gene Expression Sequencing protocol using Chromium Nuclei Isolation Kit. Single-nuclei suspension was prepared following the nuclei isolation protocol Single Cell Multiome ATAC + Gene Expression Sequencing protocol using Chromium Nuclei Isolation Kit, nuclei were barcoded and RNA and ATAC libraries were constructed, allowing for simultaneous capture of transcriptome and epigenome from the same cells.
The aim of this study is to investigate the somatic mutations in twins with BRCA1/2 negative breast cancer with no strong family history. . This dataset contains all the data available for this study on 2019-04-03.
We explored the relationship between genetic variants influencing predisposition to type 2 diabetes (T2D) and related glycemic traits, and human pancreatic islet transcription using RNA-Seq and genotyping data from 420 islet donors and 26 FAC sorted beta-cells.
This is only the location. For the raw data and any check if the dataset may be relevant please refer to the Harvard Dataverse (https://dataverse.harvard.edu/dataverse/lemola).
1. Great Smoky Mountains Study (GSMS; Costello et al. 1996, 1997) Three cohorts of boys and girls, aged 9, 11, and 13 years at intake in 1993, were selected from a rural population of some 20,000 children using a household equal probability design. A two-phase procedure was used for White and African-American youth to increase power by oversampling children at risk for psychiatric and SUDs. Parents (usually mothers) of the first stage random population sample completed a questionnaire about their child's behavioral problems. Of 4,195 subjects selected, 95% (N=3,896) of parents completed the screen. All children scoring above a predetermined threshold (the top 25% of the total scores), plus a 10% random sample of the remaining 75%, were recruited for detailed interviews. Results can be back-weighted to population levels for analyses. Half of the sample consists of females, and 6% are African Americans, reflecting the population of the study area. The interviewed sample of white and African-American subjects was 1,070 (80% of those recruited). American Indian youth were oversampled (100%) because they are an understudied group known to be at high risk for stressful events, substance disorders, and mood disorders. Of 431 age-eligible children, 350 (81% boys, 49% girls) participated. Thus, the size of total GSMS sample is 1,070 + 350 = 1,420. Data collection is complete for ages 9-26, and age 30 interviews are in progress. By age 26 a total of 9,858 interviews had been completed; the average number of interviews per subject was seven, and by age 26, 97.3% completed two or more interviews. 2. The Caring for Children in the Community Study (CCC; Angold et al., 2002) This representative study of psychiatric illness and service use in African-American and White youth took place in four rural counties in the southeastern USA. The two-stage sampling design and methods are similar to those used in the GSMS. Of 4,500 youth randomly selected from the 17,117 9- to 17-year-olds in the public school's database, 3,613 (80.0%) were successfully contacted and agreed to complete the behavioral screen. Of the 1,302 selected to participate in the study, 920 (70.7%) interviews were completed. Because CCC was also the only study in GEDI to contain more than a very few African-American participants, these were omitted from the multi-site analyses. Reprinted with permission from Cambridge University Press from Costello et al., 2013: PMID: 23461817 References: Costello et al., 1996: PMID: 8956679 Costello et al., 1997: PMID: 9184514 Angold et al., 2002: PMID: 12365876