This project includes RNA-sequencing data from human FSHD and control skeletal muscle biopsies. This project includes data from 28 FSHD patients (total 37 samples, including vastus lateralis and tibialis anterior muscles) and 12 control individuals (total 24 samples, including vastus lateralis and tibialis anterior muscles).
Sequencing data of 39 tumor and control runs (different tumors and blood controls), which were uploaded to EGAS00001004813 and reused in this ImmuNEO publication. The sequencing was always paired.
Dataset contains 5 exome BAM files from a child with neurofibromatosis and relapsed refractory acute lymphoblastic leukaemia. The samples are CD19 positive and CD19 negative bone marrow mononuclear cells at both diagnosis and relapse as well as mesenchymal stem cells as the germline control. Libraries were prepared using the SureSelect Clinical Research Exome v2 kit (Agilent Technologies, Santa Clara, CA, USA) and run on the Illumina NextSeq 500 platform.
RNA-sequencing data: 5 normal thyroid tissues, 14 papillary thyroid carcinomas, 2 lymph node metastases, 19 poorly differentiated thyroid carcinomas and 17 anaplastic thyroid carcinomas; Targeted DNA-sequencing of the 165 genes included in the “Solid and Haematological tumors” panel (BRIGHTCore, Brussels, Belgium): 2 normal thyroid tissues, 2 poorly differentiated thyroid carcinomas and 7 anaplastic thyroid carcinomas; 2 FASTQ files for each sample (paired).
The genetic basis of many rare childhood cancers remains unknown. These include a spectrum of infant soft tissue tumors without canonical gene fusions, encompassing congenital mesoblastic nephroma (CMN) of the kidney and infantile fibrosarcoma (IFS). Here, we integrated whole genome and transcriptome sequencing and identified diagnostic markers and novel therapeutic strategies.
Phenotype determination by SNP-Typing using PCR and snapshotPCR with subsequent fragment analysis. We investigated 400 individuals from Northern Germany and detected up to 12 different SNPs to determine eye, hair and skin colour. More than 1000 different runs on a ABI3130 were performedThis dataset includes:- Phenotype information for 400 samples- Summary and complete genotype calls for 12 SNPs on 400 samples.
Five subjects from pedigree with co-occurrence of neurofibromatosis type 1 and moyamoya were sequenced in duplicate (0 and1). Kinship and phenotype: NF025, NF026 and NF027 were sibling all affected by neurofibromatosis type 1. NF026 also presented moyamoya. NF0262 and NF0263 were sibling both affected by neurofibromatosis type 1. NF0262 also presented moyamoya. NF026 and NF0262 were first cousins.
We profiled the whole genome methylome of SCLC preclinical models and cfDNA samples. We assessed the sensitivity of cfDNA methylation profiling in detecting SCLC, especially at earlier stage of disease and the ability in discriminate among different SCLC molecular subtypes.
This dataset represents two combined study populations. Serrated Colorectal Cancer: An Emerging Disease Subtype (called the Advanced Colorectal Cancer of Serrated Subtype Study or ACCESS Study) was a grant awarded to investigate a newly-recognized, biologically-distinct subtype of colorectal cancer (CRC) called “serrated CRC.” The objective of this project was to characterize factors related to the genetic predisposition, clinical presentation, and prognosis of serrated CRC. The study recruited incident invasive CRC cases diagnosed between April 2016 and December 2018, aged 20-74 years at diagnosis. Cases were identified through the Surveillance, Epidemiology and End Results (SEER) cancer registry serving 13 counties in western Washington State. Eligibility for all individuals was limited to those who were English-speaking and could consent. Participation included completing a baseline epidemiologic questionnaire shortly after diagnosis, optional donation of a saliva sample for genetic analysis, and optional consent to release of medical records and tissue specimens related to their diagnosis. Tumor specimens were tested for serrated CRC-defining molecular characteristics. Further, we have vital status on all participants and cause of death on those that have died since enrollment. Hormones and Colon Cancer: Epigenetic Subtypes, Risks, and Survival (called the Post-Menopausal Hormones Study or PMH Study) was a grant awarded to investigate the impact of post-menopausal hormone use on colon cancer risk, tumor molecular characteristics, and outcomes. Eligible cases were females, newly diagnosed with invasive colorectal adenocarcinoma between October 1998 and February 2002, aged 50 to 74 years. Cases were residents of 10 out of the 13 counties in western Washington State served by the Surveillance, Epidemiology and End Results (SEER) cancer registry. Eligibility for all individuals was limited to those who were English-speaking with available telephone numbers, in which they could be contacted. Unrelated population-based controls were randomly selected according to age distribution (in 5-year age intervals) of the eligible cases by using lists of licensed drivers from the Washington State Department of Licensing (for individuals aged 50 to 64 years) and rosters from the Health Care Financing Administration (now the Centers for Medicare and Medicaid, for individuals older than 64 years). Participation included completing a baseline epidemiologic questionnaire, optional donation of a saliva sample for genetic analysis, and (for cases only) optional consent to release of medical records and tissue specimens related to their diagnosis. Tumor specimens were tested for epigenetic and other molecular characteristics. The ACCESS study was supported by funding from the National Cancer Institute of the National Institutes of Health (NCI/NIH) (R01CA196337, PI: Newcomb, PA), as was the PMH Study (R01CA076366, PI: Newcomb, PA). Additional support for the PMH Study came from the Seattle site of the Colon Cancer Family Registry (SCCFR) (U01CA167551, PI: Jenkins, M, and U01/U24CA074794, PI: Newcomb, PA). Additional support for case ascertainment was provided by the Cancer Surveillance System of the Fred Hutchinson Cancer Center, which is funded by Contract Number HHSN261201300012I; NCI Control Number: N01 PC-2013-00012; Contract Number HHSN261201800004I; and NCI Control Number: N01 PC-2018-00004 from the Surveillance, Epidemiology and End Results (SEER) Program of the National Cancer Institute with additional support from the Fred Hutchinson Cancer Center and the State of Washington. This research was also supported by the Genomics and Bioinformatics, Comparative Medicine, Specialized Pathology, Collaborative Data Services, and Experimental Histopathology Shared Resources of the Fred Hutch/University of Washington Cancer Consortium (P30 CA015704).Tumor marker testing was performed using formalin-fixed paraffin-embedded diagnostic tumor tissue specimens, and DNA extracted from those specimens. Testing for microsatellite instability (MSI) was based on either a 10-gene panel (BAT25, BAT26, BAT40, MYCL, D5S346, D17S250, ACTC, D18S55, D10S197, BAT34C4) or a 4-marker immunohistochemistry panel of DNA mismatch repair proteins (MLH1, MSH2, MSH6, PMS2). CpG island methylator phenotype (CIMP) testing was based on a validated quantitative DNA methylation assay using a five-gene panel (CACNA1G, IGF2, NEUROG1, RUNX3, SOCS1) or eight-gene panel (CACNA1G, IGF2, NEUROG1, RUNX3, SOCS1, MLH1, CRABP1, CDKN2A). Somatic p.V600E BRAF mutation status was tested for using a fluorescent allele-specific PCR assay. KRAS mutations in codons 12 and 13 were also assessed through forward and reverse sequencing of amplified tumor DNA. DNA was extracted from blood/saliva samples using conventional methods. The genotyping panel completed was the Build37 OncoArray500K-C, including 1%-6% blinded duplicates to monitor the quality of the genotyping. Quality control procedures were performed to 1) make sure that there were no patterns of missing data by batch, study, or plate, 2) check for gender discrepancies and kinship, 3) complete Principal Component Analysis, and 4) test for Hardy-Weinberg equilibrium (HWE). Samples were excluded based on call rate, heterozygosity, unexpected duplicates, gender discrepancy, and unexpectedly high identity-by-descent or unexpected genotypic concordance (>65%) with another individual. In addition, variants were excluded based on call rate (98%), lack of HWE in controls (P
The genome-wide association study (GWAS) includes participants enrolled into two different studies. The first study, the San Francisco Bay Area Cancer Study (SFBCS) is a population-based case-control study of breast cancer conducted in the San Francisco Bay Area and included women ages 35-79 years from three racial/ethnic groups: Non-Hispanic whites, African Americans, and Hispanics/Latinas. For the GWAS, only Hispanic/Latina women were included. Women diagnosed with invasive breast cancer between 1995 and 2002 were identified through the Greater Bay Area Cancer Registry. Controls were identified by random digit dialing and were frequency-matched to cases by age in 5 year increments and by race/ethnicity. Hispanic/Latina ethnicity was assessed by self-report. 175 Hispanic/Latina cases and 307 Hispanic/Latina controls from the SFBCS had given adequate consent and provided biospecimens that were used in the GWAS to be included in this data submission. The second study is the Northern California site of the Breast Cancer Family Registry (NC-BCFR). This population-based family study recruited breast cancer cases ages 18-64 years diagnosed from 1995-2009 that were identified through the Greater Bay Area Cancer Registry. Cases included all women at increased genetic susceptibility for breast cancer who met one or more of the following criteria: (a) being diagnosed with breast cancer at age <35 years; b) having a personal history of ovarian cancer or childhood cancer; (c) being diagnosed with two different breast cancers (bilateral breast cancers), with the first one diagnosed at age <50 years; and d) having one or more first-degree relatives with breast cancer, ovarian cancer or childhood cancer. Cases not meeting these criteria were randomly sampled and racial/ethnic minorities were oversampled. Controls were recruited by random digit dialing and were matched by 5-year age increments and by race/ethnicity. For the current GWAS only Latina/Hispanic cases and controls were included. Latina/Hispanic ethnicity was assessed by self-report. 631 Hispanic/Latina cases and 61 Hispanic/Latina controls from the NC-BCFR had given adequate consent and provided biospecimens that were used in the GWAS to be included in this data submission.