No. of samples: 80 (28 ULP-WGS, 26 WES, 26 RNA-SEQ) File types: FASTQ (28 ULP-WGS, 19 WES, 18 RNA) and BAM (7 WES, 8 RNA) Technology used: Sequencing - Illumina Novoseq 6000; Map/Align - Illumina DRAGEN v3.7.5; Genome assembly - GrCh38p13 Filename nomenclature: - SampleName_Passage_SampleType_TissueType_SequencingType - Passage of: PX = unknown; PZ = from patient; P0 = first passage from patient on plastic; P1 = first passage from plastic/PDX/organoid - SampleType: STN = normal; STT = tumor - SampleType STN: 00 = tissue unknown; 01 = adjacent normal; 02 = fibroblast; 03 = germline blood; 21 = cell line from patient tissue; 22 = cell line from PDX; 23 = cell line from patient fibroblast - SampleType STT: 00 = tissue unknown; 01 = primary tumor; 21 = cell line from patient; 22 = cell line from PDX - TissueType: WT = Wilm's tumor; 00 = kidney unknown; 01 = kidney left; 02 = kidney right - SequencingType: 00 = unknown; 02 = ultra-low pass whole-genome sequencing; 20 = whole-exome; 61 = bulk RNA-sequencing
This GWAS of HCC relied on existing biological and data resources from 7 USA sites and one Canadian site which allows for the integration of genetic and environmental data. Total of 2199 case patients and 1103 non cancer controls were genotyped. Case patients are defined as patients with pathological or radiological diagnosis of HCC with and without chronic hepatitis C virus infection. Study was restricted to Caucasian population without prior infection with chronic HBV infection. All DNA samples were extracted from peripheral blood. Study population of the participated sites are: Site-1: University of Texas MD Anderson Cancer Center A hospital based-case-control study was initiated at UT MD Anderson Cancer Center in 2000 and approved by the Institutional Review Board of The University of Texas M. D. Anderson Cancer Center. Study design was previously described in detailed (1-3). Written informed consent for an interview and for a biological sample was obtained from each participant. At time of diagnosis prior to treatment exposure. Case patients were recruited from the population of patients with newly diagnosed HCC who were evaluated and treated at the institution's gastrointestinal medical and surgical oncology outpatient clinics. Inclusion criteria were as follows: a pathologically confirmed diagnosis of HCC, U.S. residency, and the ability to communicate in English. Patients with concurrent or past history of other types of cancers were excluded. Control subjects were healthy spouses of patients with cancers other than liver, gastrointestinal, lung, or head and neck (smoking-related cancers) undergoing treatment at our institution. Eligibility criteria for control subjects were the same as those for patients, except that control subjects could not have ever had cancer. A short, structured questionnaire was used to screen potential control subjects on the basis of eligibility criteria. Control subjects and patients were recruited simultaneously. Case patients and control subjects were interviewed by well-trained interviewers who followed a written protocol to guide ascertainment and reduce surveillance, interviewer, and recall bias. No proxy interviews were conducted. The interviewers used a structured and validated questionnaire to collect information about demographic characteristics and potential risk factors for HCC such as personal smoking history, alcohol use, medical history, occupational history, and family history of cancer. Blood samples from cases and controls were tested for HBV and HCV. HCV antibodies, hepatitis B surface antigen, and antibodies to hepatitis B core antigen were detected by use of a third-generation enzyme-linked immunosorbent assay (ELISA) (Abbott Laboratories, North Chicago, IL). Important clinical information were retrieved from patients' medical records. Total of 1188 case patients and 278 controls from MD Anderson study were genotyped. Site-2: Mayo Clinic Case patients and controls included in the study from Mayo clinic were recruited as part of Mayo International Hepatobiliary Neoplasia Register and Biorepository. Only USA patients and controls were participated. All cases and controls signed informed consent indicating their willingness to participate in genetic studies. Epidemiological and clinical data were collected from participants and retrieved from medical records. Total of 522 HCC case patients and 182 controls were genotyped in this study. Site-3: Toronto University Molecular Epidemiology of Hepatobiliary Tumors (HBT study, CAPCR 09-0289) is a hospital-based study that includes 1710 patients diagnosed with liver, bile duct or gallbladder adenocarcinoma (Hepatocellular carcinoma - HCC; cholangiocarcinoma - CCA; and gall bladder adenocarcinoma -GBCa) recruited at University Health Network (UHN) between 2009 and 2018. HBT study is a prospective study of cancer patients that collects self-reported demographic and epidemiological data, medical records/clinical information and blood samples for research purposes. Eligible patients recruited between 2009 and March 15 2016 were selected to participate in the Genome-Wide Association Study in Patients with Hepatocellular Carcinoma (GWAS in HCC, CAPCR 15-9943-CE), a sub-study of the HBT study. Eligible prospective patients recruited after March 15 2016 signed the Addendum to the HBT consent form and became eligible to participate in the GWAS in HCC study. Total of 271 cases and 21 controls were genotyped from Toronto University Site-4: University of Pittsburgh The design of the study has been described previously (4-6). Briefly, this population-based study enrolled 120 HCC patients and 230 matched controls from black, Hispanic, and non-Hispanic white residents of Los Angeles County who were between 18 and 74 years of age at diagnosis from January 1995 through December 2001. HCC Cases were identified through the Los Angeles County Cancer Surveillance Program. We sought to recruit up to two control subjects per case from the neighborhoods where HCC patients resided at the time of diagnosis, who were matched to the index case by sex, age (within 5 years), and race (Hispanic white, non-Hispanic white, black). Blood samples (plasma and buffy coat), medical and lifestyle factors were collected from all consenting participants. Total of 65 cases and 60 controls were genotyped in this study Site-5: South Western Dallas Under IRB approval, HCC Cases and controls were prospectively collected since 2015 from the outpatient Liver clinics from Parkland Health and Hospital System and UT Southwestern Medical Center, two large health systems in Dallas TX. A Total of 31 cases and 29 controls were genotyped from site 5. Site-6: Columbia University HCC patients from Columbia are those recruited as part of the Herbert Irving Comprehensive Cancer Center Database Shared Resource which seeks to recruit all cancer patients for potential future studies. The PI is Dr. Katherine Crew AAAL5871. This resource collects sociodemographic, lifestyle and clinical data on patients as well as a blood sample. Those who indicated on their consent form that they would participate in genetic studies were included. Total of 79 case patients with HCC were genotyped from Columbia University. Site-7: University of Michigan The study population included from University of Michigan were included from a prospective study of with chronic HCV infection recruited in Ann Arbor Patients who had under- gone liver transplantation, known coinfection with HIV, life expectancy <12 months due to extra-hepatic illnesses, or were receiving HCV treatment at enrolment, were excluded. Protocol, surveys, and data forms were developed where each enrolled patient completed the same questionnaire All patients provided written informed consent before enrolment in the study. The study was approved by the institutional review board or ethics committee at the University of Michigan. Detailed study description was previously published (7). Total of 44 cases and 347 controls were genotyped Site-8: Veteran Administration Medical Center in Houston This research including written informed consent form was jointly approved by Institutional Review Boards for the Baylor College of Medicine and the Michael E. DeBakey VA Medical Center in Houston, Texas. Study details have been previously published (8). Briefly, we prospectively recruited consecutive HCV-infected veterans prior to their previously scheduled HCV clinic visit at a large tertiary care VA medical center between May 1, 2009 and December 31, 2012. Patients completed a research assistant (RA) administered survey interrogating medical and risk factor history including lifetime alcohol use, had anthropometric measurements taken, and completed a fasting venipuncture for performance of the FibroSure-ActiTest as a measure of hepatic pathology. We restricted our current analysis to individuals who were: (1) White male veterans between 18 and 70 years; (2) had no history of HCC, liver transplant, decompensated liver disease including ascites, dementia, or psychosis; (3) were serologically-confirmed to have chronic HCV and to be negative for both HIV and active HBV infection; (4) were not currently receiving anti-HCV pharmacotherapy; and (5) had FibroSURE testing shown F3/F4 fibrosis consistent with cirrhosis.
This is only the location. For the raw data and any check if the dataset may be relevant please refer to the Harvard Dataverse (https://dataverse.harvard.edu/dataverse/lemola).
Subcutaneous panniculitis-like T-cell lymphoma (SPTCL) is a rare subtype of peripheral T-cell lymphoma affecting younger cases and associated with hemophagocytic lymphohistiocytosis. To clarify the molecular pathogenesis of SPTCL, we analyzed paired tumor and germline DNAs from 13 patients by whole exome sequencing.
All libraries were sequenced on Illumina NextSeq or NovaSeq6000 until sufficient saturation was reached.
Low-pass whole genome sequencing samples from pediatric solid tumor patients who are deceased
RNA-Seq is an effective method to study the transcriptome, but specialized methods are required to identify 5' ends of transcripts. Several published strategies exist for this specific purpose, but their relative merits have not been systematically analyzed. Here, we directly compare the performance of six such methods - testing five with cellular RNA as well as a novel spike-in RNA assay that helps address interpretation challenges that arise from uncertainties in annotation or RNA processing. Using a single human RNA sample, we constructed and sequenced 18 libraries with these methods and one standard, control RNA-Seq library. We find that the CAGE method performed best for mRNA and that most of its unannotated peaks are supported by evidence from other genomic methods. We then applied CAGE to eight brain-related samples and revealed sample-specific transcription start site (TSS) usage as well as a transcriptome-wide shift in TSS usage between fetal and adult brain.
The majority of human genomic research studies have been conducted in European-ancestry cohorts, reducing the likelihood of detecting potentially novel and globally impactful findings. Here, we present mid-pass whole-genome sequencing data and a genome-wide association study in a cohort of 264 self-reported Malagasy individuals from three locations on the island of Madagascar. We describe genetic variation in this Malagasy cohort, providing insight into the shared and unique patterns of genetic variation across the island. We observe phenotypic variation by location, and find high rates of hypertension particularly in the Southern Highlands sampling site as well as elevated self-reported malaria prevalence in the West Coast site relative to other sites. After filtering to a subset of 214 minimally-related individuals, we find a number of genetic associations with body composition traits, including many variants that are only observed in African populations or populations with admixed African ancestry from the 1000 Genomes Project. This study highlights the importance of including diverse populations in genomic research for the potential to gain novel insights, even with small cohort sizes. This project was conducted in partnership and consultation with local stakeholders in Madagascar and serves as an example of genomic research that prioritizes community engagement and that has potential impacts on our understanding of human health and disease.
This dataset includes an analyzed DMP file that provide the information about differential methylation positions based on Illumina Infinium MethylationEPIC BeadChip. All samples (5 lung cancer cases vs. 5 benign lung disease controls) were obtained from bronchial washings at the site of the lesion under bronchoscopy manipulation. Of the five lung cancer cases, 3 are adenocarcinoma and 2 are squamous carcinoma.
This dataset contains the BAM files of WES of 26 myeloid neoplasm patients with der(1;7)(q10;p10)(+) cases. The dataset was used to identify key driver genes in patients with der(1;7)(q10;p10). Already known drivers along with novel driver genes were identified through the use of this WES.