HCC array for cnv
We have used a "chemistry first" approach to discover druggable acquired vulnerabilities that arised in the pathogenesis of non-small cell lung cancer (NSCLC). We screened chemical libraries (~200,000 compounds) for chemical toxins that killed subsets of NSCLC but not normal human lung epithelial cells (HBECs). We first screened a panel of 12 NSCLC lines that represented a variety of known oncogenotypes and identified chemicals with large Z scores and appropriate properties including re-supply, chemistry, and reproducible drug response phenotypes. This was then narrowed down to a list of 202 chemicals and 18 drugs with known targeting (henceforth called "Precision Oncology Probe Set", or POPS). These, and a panel of 30 clinically available drugs, targeted therapies, and drug combinations, already in use or in trials for NSCLC treatment, were then tested on a panel of 96 NSCLC lines for their drug response phenotypes in 12-point dose response curves. This information was analyzed using scanning ranked KS (Kolmogorov-Smirnov) and elastic net biostatistics approaches to identify molecular biomarkers (mutations, mRNA expression, copy number variation, protein expression, and metabolomics) which could predict for sensitivity or resistance to a particular chemical toxin or treatment regimen. From this we have discovered that: our approach identifies already known molecular biomarker of drug sensitivities (e.g. EGFR mutations and EGFR TK inhibitors); many clinically available chemotherapy agents have molecular biomarkers predicting preclinical model drug responses; the POP set of chemical toxins provides novel drug response phenotype patterns in the large NSCLC panel different from those found with clinically available agents including a therapeutic window; many of the POP toxins only hit a small percentage (~5%) of the NSCLC panel but the POP set as a whole provides "coverage" of the entire NSCLC panel; there are simple, one or 2 component molecular biomarkers (mutations, mRNA expression) that predict responses to the different chemical toxins in the NSCLC panel; and that the molecular biomarkers provide some information on the targets and pathways involved in response to the chemical toxins. Thus, we have identified a group of chemical toxins with selectivity for subsets of NSCLC and associated tumor molecular biomarkers to facilitate their development for precision medicine, and also, in some cases, information on the targets and pathways interdicted by these chemical compounds. In addition, we have discovered NSCLC predictive biomarkers for clinically available agents.
The case set is from a case-control study designed to identify common genetic risk factors for multiple myeloma in African Americans, the population with the highest risk for this cancer. We conducted two GWAS and combined each of these with convenience controls consisting of unaffected African American participants in cohorts with existing GWAS data. We then conducted a meta-analysis of the two sets. Cases were persons of African ancestry with smoldering or active multiple myeloma identified at participating oncology clinics or through SEER registries, as part of the African American Multiple Myeloma Study (AAMMS) diagnosed from Jan 1, 1988 through July 31, 2016. The majority of the samples were collected from incident and prevalent cases diagnosed since Jan 1, 2008 (80.9%), with a minority obtained from biobanks from cases diagnosed prior to 2008 (19.1%). Additional samples were obtained from the Multiethnic Cohort (USC and University of Hawaii) (n=40), the University of California at San Francisco Multiple Myeloma Study (n=27), and from the Multiple Myeloma Research Consortium for secondary analysis (samples originally provided to MMRC by 8 additional sites (n=84)). We have identified the phenotype (smoldering myeloma, plasma cell multiple myeloma, or myeloma not otherwise specified (myeloma NOS) when myeloma phenotype was not known), sex and age at diagnosis in this data set. The initial GWAS set consisted of 1308 (1,305 passed QC) cases with DNA samples, with a GWAS performed on the Illumina Human Core. Controls consisted of 7,078 unaffected African American subjects who were participants in the African American Prostate and Breast Consortium, with existing GWAS data from the Illumina1M Duo BeadChip. The second GWAS set consisted of 529 African American multiple myeloma patients with samples (406 from University of Arkansas, results not contained in this dataset because NCI funds were not used for the collection and genotyping) with GWAS data resulting from the Illumina Mega-BeadChip v1.1. Controls were 2,390 unaffected African American participants in the Multiethnic Cohort with existing GWAS data from the same array. After QC and removal of duplicates within sets, sex mismatches, and removal of plasmacytoma cases (ICD-0 code 9731), this deposited data set contains the typed GWAS data for the Illumina Human Core (set 1) (n=1298), and for the MegaBead Chip v1.1 (set 2) (n=123), with the University of Arkansas samples removed, for a total of 1,421 case genotypes. Note that 13 samples that overlap set 1 and set 2 were included.