1. Great Smoky Mountains Study (GSMS; Costello et al. 1996, 1997) Three cohorts of boys and girls, aged 9, 11, and 13 years at intake in 1993, were selected from a rural population of some 20,000 children using a household equal probability design. A two-phase procedure was used for White and African-American youth to increase power by oversampling children at risk for psychiatric and SUDs. Parents (usually mothers) of the first stage random population sample completed a questionnaire about their child's behavioral problems. Of 4,195 subjects selected, 95% (N=3,896) of parents completed the screen. All children scoring above a predetermined threshold (the top 25% of the total scores), plus a 10% random sample of the remaining 75%, were recruited for detailed interviews. Results can be back-weighted to population levels for analyses. Half of the sample consists of females, and 6% are African Americans, reflecting the population of the study area. The interviewed sample of white and African-American subjects was 1,070 (80% of those recruited). American Indian youth were oversampled (100%) because they are an understudied group known to be at high risk for stressful events, substance disorders, and mood disorders. Of 431 age-eligible children, 350 (81% boys, 49% girls) participated. Thus, the size of total GSMS sample is 1,070 + 350 = 1,420. Data collection is complete for ages 9-26, and age 30 interviews are in progress. By age 26 a total of 9,858 interviews had been completed; the average number of interviews per subject was seven, and by age 26, 97.3% completed two or more interviews. 2. The Caring for Children in the Community Study (CCC; Angold et al., 2002) This representative study of psychiatric illness and service use in African-American and White youth took place in four rural counties in the southeastern USA. The two-stage sampling design and methods are similar to those used in the GSMS. Of 4,500 youth randomly selected from the 17,117 9- to 17-year-olds in the public school's database, 3,613 (80.0%) were successfully contacted and agreed to complete the behavioral screen. Of the 1,302 selected to participate in the study, 920 (70.7%) interviews were completed. Because CCC was also the only study in GEDI to contain more than a very few African-American participants, these were omitted from the multi-site analyses. Reprinted with permission from Cambridge University Press from Costello et al., 2013: PMID: 23461817 References: Costello et al., 1996: PMID: 8956679 Costello et al., 1997: PMID: 9184514 Angold et al., 2002: PMID: 12365876
Contains test sample 1-26
DAC-2020-03-26-Lemola (DAC-039)), raw data in EGA, metadata in Harvard Dataverse
There are two datasets: 1. scRNA-seq of human cutaneous immune cells from psoriasis patients. These include pre- and post-Tildrakizumab treated patients and come in a BAM file format. 19006FL-25-01 19006FL-38-01 19006FL-32-01-03 19006FL-33-01 19006FL-28-01-05 19006FL-35-01-01 2. RNA-seq of ZFP36L2 CRISPIR deleted Human T cells are FASTQ files. 19006XR-30-05 19006XR-30-04 19006XR-30-02 19006XR-30-01 19006XR-26-05 19006XR-26-04 19006XR-26-02 19006XR-26-01 19006R-22-04 19006R-22-08 19006R-22-05 19006R-22-01
Single-cell RNA-Sequencing of 26 primary breast cancers from Wu et al. (2021) study. Data was generated using the Chromium controller (10X Genomics) and sequenced on the NextSeq 500 platform.
WGS - PCR-free library preparation (whole blood gDNA), paired-end (2x100bp) sequencing on DNBseq sequencing platform (BGI) Number of samples: 26
ATAC-seq data for 26 CLL samples (7 controls, 19 tumor) of the CancerEpiSys-PRECiSe project.
This dataset contains all the data available for this study on 2019-03-26.
Gene expression profiles of single cells from 26 tumor and ascites samples samples from 17 patients
26 samples from Cameroon generated for the H3Africa Chip Design Study. The dataset includes BAM, FASTQ and decompressed gVCF files.