TCF3-PBX1 (E2A-PBX1) is a recurrent gene fusion in B-cell precursor lymphoblastic leukemia (BCP-ALL), which is caused by the translocation t(1;19)(q23;p13). TCF3-PBX1 BCP-ALL patients typically benefit from chemotherapy; however, many relapse and subsequently develop resistant disease with few effective treatment options. Mechanisms driving disease progression and therapy resistance have not been studied in TCF3-PBX1 BCP-ALL. Here, we aimed to identify novel treatment options for TCF3-PBX1 BCP-ALL by profiling leukemia cells from a relapsed patient, and determine molecular mechanisms underlying disease pathogenesis and progression. By drug sensitivity testing of leukemic blasts from the index patient, control samples and TCF3-PBX1 positive and negative BCP-ALL cell lines, we identified the phosphatidylinositide 3-kinase delta (p110δ) inhibitor idelalisib as an effective treatment for TCF3-PBX1 BCP-ALL. This was further supported by evidence showing TCF3-PBX1 directly regulates expression of PIK3CD, the gene encoding p110δ. Other somatic mutations to TP53 and MTOR, as well as aberrant expression of CXCR4, may influence additional drug sensitivities specific to the index patient and accompanied progression of the disease. Our results suggest that idelalisib is a promising treatment option for patients with TCF3-PBX1 BCP-ALL while other drugs could be useful depending on the genetic context of individual patients.
Mantle cell lymphoma (MCL) is an aggressive tumor, but a subset of patients may follow an indolent clinical course. To understand the mechanisms underlying this biological heterogeneity, we performed whole-genome and/or whole-exome sequencing on 29 MCL cases and their respective matched normal DNA, as well as 6 MCL cell lines. Recurrently mutated genes were investigated by targeted sequencing in an independent cohort of 172 MCL patients. We identified 25 significantly mutated genes, including known drivers such as ataxia-telangectasia mutated (ATM), cyclin D1 (CCND1), and the tumor suppressor TP53; mutated genes encoding the anti-apoptotic protein BIRC3 and Toll-like receptor 2 (TLR2); and the chromatin modifiers WHSC1, MLL2, and MEF2B. We also found NOTCH2 mutations as an alternative phenomenon to NOTCH1 mutations in aggressive tumors with a dismal prognosis. Analysis of two simultaneous or subsequent MCL samples by whole-genome/whole-exome (n = 8) or targeted (n = 19) sequencing revealed subclonal heterogeneity at diagnosis in samples from different topographic sites and modulation of the initial mutational profile at the progression of the disease. Some mutations were predominantly clonal or subclonal, indicating an early or late event in tumor evolution, respectively. Our study identifies molecular mechanisms contributing to MCL pathogenesis and offers potential targets for therapeutic intervention.
Recent efforts reclassified B-Cell Precursor Acute Lymphoblastic Leukemia (BCP-ALL) into more refined subtypes. Nevertheless, outcomes of relapsed BCP-ALL remain unsatisfactory, particularly in adult patients where the molecular basis of relapse is still poorly understood. To elucidate the evolution of relapse in BCP-ALL, we established a comprehensive multi-omics dataset including DNA-sequencing, RNA-sequencing, DNA methylation array and proteome MASS-spec data from matched diagnosis and relapse samples of BCP-ALL patients (n = 50) including the subtypes DUX4, Ph-like and two aneuploid subtypes. Relapse-specific alterations were enriched for chromatin modifiers, nucleotide and steroid metabolism including the novel candidates FPGS, AGBL and ZNF483. The proteome expression analysis unraveled deregulation of metabolic pathways at relapse including the key proteins G6PD, TKT, GPI and PGD. Moreover, we identified a novel relapse-specific gene signature specific for DUX4 BCP-ALL patients highlighting chemotaxis and cytokine environment as a possible driver event at relapse. This study presents novel insights at distinct molecular levels of relapsed BCP-ALL based on a comprehensive multi-omics integrated data set including a valuable proteomics data set. The relapse specific aberrations reveal metabolic signatures on genomic and proteomic levels in BCP-ALL relapse. Furthermore, the chemokine expression signature in DUX4 relapse underscores the distinct status of DUX4-fusion BCP-ALL.
Immune regulatory metabolites are key features of the tumor microenvironment (TME), yet with a few notable exceptions, their identities remain largely unknown. We uncovered the immune regulatory metabolic states and metabolomes of sorted tumor and stromal, CD4+, and CD8+ cells from the tumor and ascites of patients with high-grade serous ovarian cancer (HGSC) using high-dimensional flow cytometry and metabolomics supplemented with single cell RNA sequencing. Flow cytometry revealed that tumor cells show a consistently greater uptake of glucose than T cells, but similar mitochondrial activity. Cells within the ascites and tumor had pervasive metabolite differences, with a striking enrichment in 1-methylnicotinamide (MNA) in T cells infiltrating the tumor compared to ascites. Despite the elevated levels of MNA in T cells, the expression of nicotinamide N-methyltransferase, the gene encoding the enzyme that catalyses the transfer of a methyl group from S-adenosylmethionine to nicotinamide, was restricted to fibroblasts and tumor cells. Treatment of T cells with MNA resulted in an increase in T cell-mediated secretion of the tumor promoting cytokine tumor necrosis factor alpha. Thus, the TME-derived metabolite MNA contributes to an alternative and non-cell autonomous mechanism of immune modulation of T cells in HGSC. Collectively, uncovering the tumor-T cell metabolome may reveal metabolic vulnerabilities that can be exploited using T cell-based immunotherapies to treat human cancer.
Patients with small-cell lung cancer (SCLC) have an exceptionally poor prognosis, calling for improved real-time non-invasive biomarkers of therapeutic response. We performed targeted error-correction sequencing on 171 serial plasmas and matched white blood cell (WBC) DNA from 33 patients with metastatic SCLC who received systemic treatment with chemotherapy or immunotherapy-containing regimens. Tumor-derived sequence alterations and plasma aneuploidy were evaluated serially and combined to assess changes in total cell-free tumor load (cfTL). Longitudinal dynamic changes in cfTL were monitored to determine circulating cell-free tumor DNA (ctDNA) molecular response during therapy. Combined tiered analyses of tumor-derived sequence alterations and plasma aneuploidy allowed for assessment of ctDNA molecular response in all patients. Molecular responses captured the therapeutic effect and long-term clinical outcomes in a more accurate and rapid manner compared to radiographic imaging. Patients with sustained molecular responses had longer overall (log-rank p=0.0006) and progression-free (log-rank p<0.0001) survival, with molecular responses detected on average 4 weeks earlier than imaging. ctDNA analyses provide a rapid approach for the assessment of early on-therapy molecular responses and have important implications for the management of patients with SCLC, including the development of improved strategies for real-time tumor burden monitoring.
Original description of the study: From ELLIPSE (linked to the PRACTICAL consortium), we contributed ~78,000 SNPs to the OncoArray. A large fraction of the content was derived from the GWAS meta-analyses in European ancestry populations (overall and aggressive disease; ~27K SNPs). We also selected just over 10,000 SNPs from the meta-analyses in the non-European populations, with a majority of these SNPs coming from the analysis of overall prostate cancer in African ancestry populations as well as from the multiethnic meta-analysis. A substantial fraction of SNPs (~28,000) were also selected for fine-mapping of 53 loci not included in the common fine-mapping regions (tagging at r2>0.9 across ±500kb regions). We also selected a few thousand SNPs related with PSA levels and/or disease survival as well as SNPs from candidate lists provided by study collaborators, as well as from meta-analyses of exome SNP chip data from the Multiethnic Cohort and UK studies. The Contributing Studies: Aarhus: Hospital-based, Retrospective, Observational. Source of cases: Patients treated for prostate adenocarcinoma at Department of Urology, Aarhus University Hospital, Skejby (Aarhus, Denmark). Source of controls: Age-matched males treated for myocardial infarction or undergoing coronary angioplasty, but with no prostate cancer diagnosis based on information retrieved from the Danish Cancer Register and the Danish Cause of Death Register. AHS: Nested case-control study within prospective cohort. Source of cases: linkage to cancer registries in study states. Source of controls: matched controls from cohort ATBC: Prospective, nested case-control. Source of cases: Finnish male smokers aged 50-69 years at baseline. Source of controls: Finnish male smokers aged 50-69 years at baseline BioVu: Cases identified in a biobank linked to electronic health records. Source of cases: A total of 214 cases were identified in the VUMC de-identified electronic health records database (the Synthetic Derivative) and shipped to USC for genotyping in April 2014. The following criteria were used to identify cases: Age 18 or greater; male; African Americans (Black) only. Note that African ancestry is not self-identified, it is administratively or third-party assigned (which has been shown to be highly correlated with genetic ancestry for African Americans in BioVU; see references). Source of controls: Controls were identified in the de-identified electronic health record. Unfortunately, they were not age matched to the cases, and therefore cannot be used for this study. Canary PASS: Prospective, Multi-site, Observational Active Surveillance Study. Source of cases: clinic based from Beth Israel Deaconness Medical Center, Eastern Virginia Medical School, University of California at San Francisco, University of Texas Health Sciences Center San Antonio, University of Washington, VA Puget Sound. Source of controls: N/A CCI: Case series, Hospital-based. Source of cases: Cases identified through clinics at the Cross Cancer Institute. Source of controls: N/A CerePP French Prostate Cancer Case-Control Study (ProGene): Case-Control, Prospective, Observational, Hospital-based. Source of cases: Patients, treated in French departments of Urology, who had histologically confirmed prostate cancer. Source of controls: Controls were recruited as participating in a systematic health screening program and found unaffected (normal digital rectal examination and total PSA < 4 ng/ml, or negative biopsy if PSA > 4 ng/ml). COH: hospital-based cases and controls from outside. Source of cases: Consented prostate cancer cases at City of Hope. Source of controls: Consented unaffected males that were part of other studies where they consented to have their DNA used for other research studies. COSM: Population-based cohort. Source of cases: General population. Source of controls: General population CPCS1: Case-control - Denmark. Source of cases: Hospital referrals. Source of controls: Copenhagen General Population Study CPCS2: Source of cases: Hospital referrals. Source of controls: Copenhagen General Population Study CPDR: Retrospective cohort. Source of cases: Walter Reed National Military Medical Center. Source of controls: Walter Reed National Military Medical Center ACS_CPS-II: Nested case-control derived from a prospective cohort study. Source of cases: Identified through self-report on follow-up questionnaires and verified through medical records or cancer registries, identified through cancer registries or the National Death Index (with prostate cancer as the primary cause of death). Source of controls: Cohort participants who were cancer-free at the time of diagnosis of the matched case, also matched on age (±6 mo) and date of biospecimen donation (±6 mo). EPIC: Case-control - Germany, Greece, Italy, Netherlands, Spain, Sweden, UK. Source of cases: Identified through record linkage with population-based cancer registries in Italy, the Netherlands, Spain, Sweden and UK. In Germany and Greece, follow-up is active and achieved through checks of insurance records and cancer and pathology registries as well as via self-reported questionnaires; self-reported incident cancers are verified through medical records. Source of controls: Cohort participants without a diagnosis of cancer EPICAP: Case-control, Population-based, ages less than 75 years at diagnosis, Hérault, France. Source of cases: Prostate cancer cases in all public hospitals and private urology clinics of département of Hérault in France. Cases validation by the Hérault Cancer Registry. Source of controls: Population-based controls, frequency age matched (5-year groups). Quotas by socio-economic status (SES) in order to obtain a distribution by SES among controls identical to the SES distribution among general population men, conditionally to age. ERSPC: Population-based randomized trial. Source of cases: Men with PrCa from screening arm ERSPC Rotterdam. Source of controls: Men without PrCa from screening arm ERSPC Rotterdam ESTHER: Case-control, Prospective, Observational, Population-based. Source of cases: Prostate cancer cases in all hospitals in the state of Saarland, from 2001-2003. Source of controls: Random sample of participants from routine health check-up in Saarland, in 2000-2002 FHCRC: Population-based, case-control, ages 35-74 years at diagnosis, King County, WA, USA. Source of cases: Identified through the Seattle-Puget Sound SEER cancer registry. Source of controls: Randomly selected, age-frequency matched residents from the same county as cases Gene-PARE: Hospital-based. Source of cases: Patients that received radiotherapy for treatment of prostate cancer. Source of controls: n/a Hamburg-Zagreb: Hospital-based, Prospective. Source of cases: Prostate cancer cases seen at the Department of Oncology, University Hospital Center Zagreb, Croatia. Source of controls: Population-based (Croatia), healthy men, older than 50, with no medical record of cancer, and no family history of cancer (1st & 2nd degree relatives) HPFS: Nested case-control. Source of cases: Participants of the HPFS cohort. Source of controls: Participants of the HPFS cohort IMPACT: Observational. Source of cases: Carriers and non-carriers (with a known mutation in the family) of the BRCA1 and BRCA2 genes, aged between 40 and 69, who are undergoing prostate screening with annual PSA testing. This cohort has been diagnosed with prostate cancer during the study. Source of controls: Carriers and non-carriers (with a known mutation in the family) of the BRCA1 and BRCA2 genes, aged between 40 and 69, who are undergoing prostate screening with annual PSA testing. This cohort has not been diagnosed with prostate cancer during the study. IPO-Porto: Hospital-based. Source of cases: Early onset and/or familial prostate cancer. Source of controls: Blood donors Karuprostate: Case-control, Retrospective, Population-based. Source of cases: From FWI (Guadeloupe): 237 consecutive incident patients with histologically confirmed prostate cancer attending public and private urology clinics; From Democratic Republic of Congo: 148 consecutive incident patients with histologically confirmed prostate cancer attending the University Clinic of Kinshasa. Source of controls: From FWI (Guadeloupe): 277 controls recruited from men participating in a free systematic health screening program open to the general population; From Democratic Republic of Congo: 134 controls recruited from subjects attending the University Clinic of Kinshasa KULEUVEN: Hospital-based, Prospective, Observational. Source of cases: Prostate cancer cases recruited at the University Hospital Leuven. Source of controls: Healthy males with no history of prostate cancer recruited at the University Hospitals, Leuven. LAAPC: Subjects were participants in a population-based case-control study of aggressive prostate cancer conducted in Los Angeles County. Cases were identified through the Los Angeles County Cancer Surveillance Program rapid case ascertainment system. Eligible cases included African American, Hispanic, and non-Hispanic White men diagnosed with a first primary prostate cancer between January 1, 1999 and December 31, 2003. Eligible cases also had (a) prostatectomy with documented tumor extension outside the prostate, (b) metastatic prostate cancer in sites other than prostate, (c) needle biopsy of the prostate with Gleason grade ≥8, or (d) needle biopsy with Gleason grade 7 and tumor in more than two thirds of the biopsy cores. Eligible controls were men never diagnosed with prostate cancer, living in the same neighborhood as a case, and were frequency matched to cases on age (± 5 y) and race/ethnicity. Controls were identified by a neighborhood walk algorithm, which proceeds through an obligatory sequence of adjacent houses or residential units beginning at a specific residence that has a specific geographic relationship to the residence where the case lived at diagnosis. Malaysia: Case-control. Source of cases: Patients attended the outpatient urology or uro-onco clinic at University Malaya Medical Center. Source of controls: Population-based, age matched (5-year groups), ascertained through electoral register, Subang Jaya, Selangor, Malaysia MCC-Spain: Case-control. Source of cases: Identified through the urology departments of the participating hospitals. Source of controls: Population-based, frequency age and region matched, ascertained through the rosters of the primary health care centers MCCS: Nested case-control, Melbourne, Victoria. Source of cases: Identified by linkage to the Victorian Cancer Registry. Source of controls: Cohort participants without a diagnosis of cancer MD Anderson: Participants in this study were identified from epidemiological prostate cancer studies conducted at the University of Texas MD Anderson Cancer Center in the Houston Metropolitan area. Cases were accrued in the Houston Medical Center and were not restricted with respect to Gleason score, stage or PSA. Controls were identified via random-digit-dialing or among hospital visitors and they were frequency matched to cases on age and race. Lifestyle, demographic, and family history data were collected using a standardized questionnaire. MDACC_AS: A prospective cohort study. Source of cases: Men with clinically organ-confined prostate cancer meeting eligibility criteria for a prospective cohort study of active surveillance at MD Anderson Cancer Center. Source of controls: N/A MEC: The Multiethnic Cohort (MEC) is comprised of over 215,000 men and women recruited from Hawaii and the Los Angeles area between 1993 and 1996. Between 1995 and 2006, over 65,000 blood samples were collected from participants for genetic analyses. To identify incident cancer cases, the MEC was cross-linked with the population-based Surveillance, Epidemiology and End Results (SEER) registries in California and Hawaii, and unaffected cohort participants with blood samples were selected as controls MIAMI (WFPCS): Prostate cancer cases and controls were recruited from the Departments of Urology and Internal Medicine of the Wake Forest University School of Medicine using sequential patient populations as described previously (PMID:15342424). All study subjects received a detailed description of the study protocol and signed their informed consent, as approved by the medical center's Institutional Review Board. The general eligibility criteria were (i) able to comprehend informed consent and (ii) without previously diagnosed cancer. The exclusion criteria were (i) clinical diagnosis of autoimmune diseases; (ii) chronic inflammatory conditions; and (iii) infections within the past 6 weeks. Blood samples were collected from all subjects. MOFFITT: Hospital-based. Source of cases: clinic based from Moffitt Cancer Center. Source of controls: Moffitt Cancer Center affiliated Lifetime cancer screening center NMHS: Case-control, clinic based, Nashville TN. Source of cases: All urology clinics in Nashville, TN. Source of controls: Men without prostate cancer at prostate biopsy. PCaP: The North Carolina-Louisiana Prostate Cancer Project (PCaP) is a multidisciplinary population-based case-only study designed to address racial differences in prostate cancer through a comprehensive evaluation of social, individual and tumor level influences on prostate cancer aggressiveness. PCaP enrolled approximately equal numbers of African Americans and Caucasian Americans with newly-diagnosed prostate cancer from North Carolina (42 counties) and Louisiana (30 parishes) identified through state tumor registries. African American PCaP subjects with DNA, who agreed to future use of specimens for research, participated in OncoArray analysis. PCMUS: Case-control - Sofia, Bulgaria. Source of cases: Patients of Clinic of Urology, Alexandrovska University Hospital, Sofia, Bulgaria, PrCa histopathologically confirmed. Source of controls: 72 patients with verified BPH and PSA<3,5; 78 healthy controls from the MMC Biobank, no history of PrCa PHS: Nested case-control. Source of cases: Participants of the PHS1 trial/cohort. Source of controls: Participants of the PHS1 trial/cohort PLCO: Nested case-control. Source of cases: Men with a confirmed diagnosis of prostate cancer from the PLCO Cancer Screening Trial. Source of controls: Controls were men enrolled in the PLCO Cancer Screening Trial without a diagnosis of cancer at the time of case ascertainment. Poland: Case-control. Source of cases: men with unselected prostate cancer, diagnosed in north-western Poland at the University Hospital in Szczecin. Source of controls: cancer-free men from the same population, taken from the healthy adult patients of family doctors in the Szczecin region PROCAP: Population-based, Retrospective, Observational. Source of cases: Cases were ascertained from the National Prostate Cancer Register of Sweden Follow-Up Study, a retrospective nationwide cohort study of patients with localized prostate cancer. Source of controls: Controls were selected among men referred for PSA testing in laboratories in Stockholm County, Sweden, between 2010 and 2012. PROGReSS: Hospital-based, Prospective, Observational. Source of cases: Prostate cancer cases from the Hospital Clínico Universitario de Santiago de Compostela, Galicia, Spain. Source of controls: Cancer-free men from the same population ProMPT: A study to collect samples and data from subjects with and without prostate cancer. Retrospective, Experimental. Source of cases: Subjects attending outpatient clinics in hospitals. Source of controls: Subjects attending outpatient clinics in hospitals ProtecT: Trial of treatment. Samples taken from subjects invited for PSA testing from the community at nine centers across United Kingdom. Source of cases: Subjects who have a proven diagnosis of prostate cancer following testing. Source of controls: Identified through invitation of subjects in the community. PROtEuS: Case-control, population-based. Source of cases: All new histologically-confirmed cases, aged less or equal to 75 years, diagnosed between 2005 and 2009, actively ascertained across Montreal French hospitals. Source of controls: Randomly selected from the Provincial electoral list of French-speaking men between 2005 and 2009, from the same area of residence as cases and frequency-matched on age. QLD: Case-control. Source of cases: A longitudinal cohort study (Prostate Cancer Supportive Care and Patient Outcomes Project: ProsCan) conducted in Queensland, through which men newly diagnosed with prostate cancer from 26 private practices and 10 public hospitals were directly referred to ProsCan at the time of diagnosis by their treating clinician (age range 43-88 years). All cases had histopathologically confirmed prostate cancer, following presentation with an abnormal serum PSA and/or lower urinary tract symptoms. Source of controls: Controls comprised healthy male blood donors with no personal history of prostate cancer, recruited through (i) the Australian Red Cross Blood Services in Brisbane (age range 19-76 years) and (ii) the Australian Electoral Commission (AEC) (age and post-code/ area matched to ProsCan, age range 54-90 years). RAPPER: Multi-centre, hospital based blood sample collection study in patients enrolled in clinical trials with prospective collection of radiotherapy toxicity data. Source of cases: Prostate cancer patients enrolled in radiotherapy trials: CHHiP, RT01, Dose Escalation, RADICALS, Pelvic IMRT, PIVOTAL. Source of controls: N/A SABOR: Prostate Cancer Screening Cohort. Source of cases: Men >45 yrs of age participating in annual PSA screening. Source of controls: Males participating in annual PSA prostate cancer risk evaluations (funded by NCI biomarkers discovery and validation grant), recruited through University of Texas Health Science Center at San Antonio and affiliated sites or through study advertisements, enrolment open to the community SCCS: Case-control in cohort, Southeastern USA. Prospective, Observational, Population-based. Source of cases: SCCS entry population. Source of controls: SCCS entry population SCPCS: Population-based, Retrospective, Observational. Source of cases: South Carolina Central Cancer Registry. Source of controls: Health Care Financing Administration beneficiary file SEARCH: Case-control - East Anglia, UK. Source of cases: Men < 70 years of age registered with prostate cancer at the population-based cancer registry, Eastern Cancer Registration and Information Centre, East Anglia, UK. Source of controls: Men attending general practice in East Anglia with no known prostate cancer diagnosis, frequency matched to cases by age and geographic region SNP_Prostate_Ghent: Hospital-based, Retrospective, Observational. Source of cases: Men treated with IMRT as primary or postoperative treatment for prostate cancer at the Ghent University Hospital between 2000 and 2010. Source of controls: Employees of the University hospital and members of social activity clubs, without a history of any cancer. SPAG: Hospital-based, Retrospective, Observational. Source of cases: Guernsey. Source of controls: Guernsey STHM2: Population-based, Retrospective, Observational. Source of cases: Cases were selected among men referred for PSA testing in laboratories in Stockholm County, Sweden, between 2010 and 2012. Source of controls: Controls were selected among men referred for PSA testing in laboratories in Stockholm County, Sweden, between 2010 and 2012. PCPT: Case-control from a randomized clinical trial. Source of cases: Randomized clinical trial. Source of controls: Randomized clinical trial SELECT: Case-cohort from a randomized clinical trial. Source of cases: Randomized clinical trial. Source of controls: Randomized clinical trial TAMPERE: Case-control - Finland, Retrospective, Observational, Population-based. Source of cases: Identified through linkage to the Finnish Cancer Registry and patient records; and the Finnish arm of the ERSPC study. Source of controls: Cohort participants without a diagnosis of cancer UGANDA: Uganda Prostate Cancer Study: Uganda is a case-control study of prostate cancer in Kampala Uganda that was initiated in 2011. Men with prostate cancer were enrolled from the Urology unit at Mulago Hospital and men without prostate cancer (i.e. controls) were enrolled from other clinics (i.e. surgery) at the hospital. UKGPCS: ICR, UK. Source of cases: Cases identified through clinics at the Royal Marsden hospital and nationwide NCRN hospitals. Source of controls: Ken Muir's control- 2000 ULM: Case-control - Germany. Source of cases: familial cases (n=162): identified through questionnaires for family history by collaborating urologists all over Germany; sporadic cases (n=308): prostatectomy series performed in the Clinic of Urology Ulm between 2012 and 2014. Source of controls: age-matched controls (n=188): age-matched men without prostate cancer and negative family history collected in hospitals of Ulm WUGS/WUPCS: Cases Series, USA. Source of cases: Identified through clinics at Washington University in St. Louis. Source of controls: Men diagnosed and managed with prostate cancer in University based clinic. Acknowledgement Statements: Aarhus: This study was supported by the Danish Strategic Research Council (now Innovation Fund Denmark) and the Danish Cancer Society. The Danish Cancer Biobank (DCB) is acknowledged for biological material. AHS: This work was supported by the Intramural Research Program of the NIH, National Cancer Institute, Division of Cancer Epidemiology and Genetics (Z01CP010119). ATBC: This research was supported in part by the Intramural Research Program of the NIH and the National Cancer Institute. Additionally, this research was supported by U.S. Public Health Service contracts N01-CN-45165, N01-RC-45035, N01-RC-37004, HHSN261201000006C, and HHSN261201500005C from the National Cancer Institute, Department of Health and Human Services. BioVu: The dataset(s) used for the analyses described were obtained from Vanderbilt University Medical Center's BioVU which is supported by institutional funding and by the National Center for Research Resources, Grant UL1 RR024975-01 (which is now at the National Center for Advancing Translational Sciences, Grant 2 UL1 TR000445-06). Canary PASS: PASS was supported by Canary Foundation and the National Cancer Institute's Early Detection Research Network (U01 CA086402) CCI: This work was awarded by Prostate Cancer Canada and is proudly funded by the Movember Foundation - Grant # D2013-36.The CCI group would like to thank David Murray, Razmik Mirzayans, and April Scott for their contribution to this work. CerePP French Prostate Cancer Case-Control Study (ProGene): None reported COH: SLN is partially supported by the Morris and Horowitz Families Endowed Professorship COSM: The Swedish Research Council, the Swedish Cancer Foundation CPCS1 & CPCS2: Department of Clinical Biochemistry, Herlev and Gentofte Hospital, Copenhagen University Hospital, Herlev Ringvej 75, DK-2730 Herlev, DenmarkCPCS1 would like to thank the participants and staff of the Copenhagen General Population Study for their important contributions. CPDR: Uniformed Services University for the Health Sciences HU0001-10-2-0002 (PI: David G. McLeod, MD) CPS-II: The American Cancer Society funds the creation, maintenance, and updating of the Cancer Prevention Study II cohort. CPS-II thanks the participants and Study Management Group for their invaluable contributions to this research. We would also like to acknowledge the contribution to this study from central cancer registries supported through the Centers for Disease Control and Prevention National Program of Cancer Registries, and cancer registries supported by the National Cancer Institute Surveillance Epidemiology and End Results program. EPIC: The coordination of EPIC is financially supported by the European Commission (DG-SANCO) and the International Agency for Research on Cancer. The national cohorts are supported by the Danish Cancer Society (Denmark); the Deutsche Krebshilfe, Deutsches Krebsforschungszentrum and Federal Ministry of Education and Research (Germany); the Hellenic Health Foundation, Greek Ministry of Health; Greek Ministry of Education (Greece); the Italian Association for Research on Cancer (AIRC) and National Research Council (Italy); the Dutch Ministry of Public Health, Welfare and Sports (VWS), Netherlands Cancer Registry (NKR), LK Research Funds, Dutch Prevention Funds, Dutch ZON (Zorg Onderzoek Nederland), World Cancer Research Fund (WCRF); the Statistics Netherlands (The Netherlands); the Health Research Fund (FIS), Regional Governments of Andalucía, Asturias, Basque Country, Murcia and Navarra, Spanish Ministry of Health ISCIII RETIC (RD06/0020), Red de Centros RCESP, C03/09 (Spain); the Swedish Cancer Society, Swedish Scientific Council and Regional Government of Skåne and Västerbotten, Fundacion Federico SA (Sweden); the Cancer Research UK, Medical Research Council (United Kingdom). EPICAP: The EPICAP study was supported by grants from Ligue Nationale Contre le Cancer, Ligue départementale du Val de Marne; Fondation de France; Agence Nationale de sécurité sanitaire de l'alimentation, de l'environnement et du travail (ANSES). The EPICAP study group would like to thank all urologists, Antoinette Anger and Hasina Randrianasolo (study monitors), Anne-Laure Astolfi, Coline Bernard, Oriane Noyer, Marie-Hélène De Campo, Sandrine Margaroline, Louise N'Diaye, and Sabine Perrier-Bonnet (Clinical Research nurses). ERSPC: This study was supported by the DutchCancerSociety (KWF94-869,98-1657,2002-277,2006-3518, 2010-4800), The Netherlands Organisation for Health Research and Development (ZonMW-002822820, 22000106, 50-50110-98-311, 62300035), The Dutch Cancer Research Foundation (SWOP), and an unconditional grant from Beckman-Coulter-HybritechInc. ESTHER: The ESTHER study was supported by a grant from the Baden Württemberg Ministry of Science, Research and Arts. The ESTHER group would like to thank Hartwig Ziegler, Sonja Wolf, Volker Hermann, Heiko Müller, Karina Dieffenbach, Katja Butterbach for valuable contributions to the study. FHCRC: The FHCRC studies were supported by grants R01-CA056678, R01-CA082664, and R01-CA092579 from the US National Cancer Institute, National Institutes of Health, with additional support from the Fred Hutchinson Cancer Research Center. FHCRC would like to thank all the men who participated in these studies. Gene-PARE: The Gene-PARE study was supported by grants 1R01CA134444 from the U.S. National Institutes of Health, PC074201 and W81XWH-15-1-0680 from the Prostate Cancer Research Program of the Department of Defense and RSGT-05-200-01-CCE from the American Cancer Society. Hamburg-Zagreb: None reported HPFS: The Health Professionals Follow-up Study was supported by grants UM1CA167552, CA133891, CA141298, and P01CA055075. HPFS are grateful to the participants and staff of the Physicians' Health Study and Health Professionals Follow-Up Study for their valuable contributions, as well as the following state cancer registries for their help: AL, AZ, AR, CA, CO, CT, DE, FL, GA, ID, IL, IN, IA, KY, LA, ME, MD, MA, MI, NE, NH, NJ, NY, NC, ND, OH, OK, OR, PA, RI, SC, TN, TX, VA, WA, and WY. IMPACT: The IMPACT study was funded by The Ronald and Rita McAulay Foundation, CR-UK Project grant (C5047/A1232), Cancer Australia, AICR Netherlands A10-0227, Cancer Australia and Cancer Council Tasmania, NIHR, EU Framework 6, Cancer Councils of Victoria and South Australia, and Philanthropic donation to Northshore University Health System. We acknowledge support from the National Institute for Health Research (NIHR) to the Biomedical Research Centre at The Institute of Cancer Research and Royal Marsden Foundation NHS Trust. IMPACT acknowledges the IMPACT study steering committee, collaborating centres, and participants. IPO-Porto: The IPO-Porto study was funded by Fundaçäo para a Ciência e a Tecnologia (FCT; UID/DTP/00776/2013 and PTDC/DTP-PIC/1308/2014) and by IPO-Porto Research Center (CI-IPOP-16-2012 and CI-IPOP-24-2015). MC and MPS are research fellows from Liga Portuguesa Contra o Cancro, Núcleo Regional do Norte. SM is a research fellow from FCT (SFRH/BD/71397/2010). IPO-Porto would like to express our gratitude to all patients and families who have participated in this study. Karuprostate: The Karuprostate study was supported by the the Frech National Health Directorate and by the Association pour la Recherche sur les Tumeurs de la ProstateKarusprostate thanks Séverine Ferdinand. KULEUVEN: F.C. and S.J. are holders of grants from FWO Vlaanderen (G.0684.12N and G.0830.13N), the Belgian federal government (National Cancer Plan KPC_29_023), and a Concerted Research Action of the KU Leuven (GOA/15/017). TVDB is holder of a doctoral fellowship of the FWO. LAAPC: This study was funded by grant R01CA84979 (to S.A. Ingles) from the National Cancer Institute, National Institutes of Health. Malaysia: The study was funded by the University Malaya High Impact Research Grant (HIR/MOHE/MED/35). Malaysia thanks all associates in the Urology Unit, University of Malaya, Cancer Research Initiatives Foundation (CARIF) and the Malaysian Men's Health Initiative (MMHI). MCCS: MCCS cohort recruitment was funded by VicHealth and Cancer Council Victoria. The MCCS was further supported by Australian NHMRC grants 209057, 251553, and 504711, and by infrastructure provided by Cancer Council Victoria. Cases and their vital status were ascertained through the Victorian Cancer Registry (VCR) and the Australian Institute of Health and Welfare (AIHW), including the National Death Index and the Australian Cancer Database. MCC-Spain: The study was partially funded by the Accion Transversal del Cancer, approved on the Spanish Ministry Council on the 11th October 2007, by the Instituto de Salud Carlos III-FEDER (PI08/1770, PI09/00773-Cantabria, PI11/01889-FEDER, PI12/00265, PI12/01270, and PI12/00715), by the Fundación Marqués de Valdecilla (API 10/09), by the Spanish Association Against Cancer (AECC) Scientific Foundation and by the Catalan Government DURSI grant 2009SGR1489. Samples: Biological samples were stored at the Parc de Salut MAR Biobank (MARBiobanc; Barcelona) which is supported by Instituto de Salud Carlos III FEDER (RD09/0076/00036). Also sample collection was supported by the Xarxa de Bancs de Tumors de Catalunya sponsored by Pla Director d'Oncologia de Catalunya (XBTC). MCC-Spain acknowledges the contribution from Esther Gracia-Lavedan in preparing the data. We thank all the subjects who participated in the study and all MCC-Spain collaborators. MD Anderson: Prostate Cancer Case-Control Studies at MD Anderson (MDA) supported by grants CA68578, ES007784, DAMD W81XWH-07-1-0645, and CA140388. MDACC_AS: None reported MEC: Funding provided by NIH grant U19CA148537 and grant U01CA164973. MIAMI (WFPCS): ACS MOFFITT: The Moffitt group was supported by the US National Cancer Institute (R01CA128813, PI: J.Y. Park). NMHS: Funding for the Nashville Men's Health Study (NMHS) was provided by the National Institutes of Health Grant numbers: RO1CA121060. PCaP only data: The North Carolina - Louisiana Prostate Cancer Project (PCaP) is carried out as a collaborative study supported by the Department of Defense contract DAMD 17-03-2-0052. For HCaP-NC follow-up data: The Health Care Access and Prostate Cancer Treatment in North Carolina (HCaP-NC) study is carried out as a collaborative study supported by the American Cancer Society award RSGT-08-008-01-CPHPS. For studies using both PCaP and HCaP-NC follow-up data please use: The North Carolina - Louisiana Prostate Cancer Project (PCaP) and the Health Care Access and Prostate Cancer Treatment in North Carolina (HCaP-NC) study are carried out as collaborative studies supported by the Department of Defense contract DAMD 17-03-2-0052 and the American Cancer Society award RSGT-08-008-01-CPHPS, respectively. For any PCaP data, please include: The authors thank the staff, advisory committees and research subjects participating in the PCaP study for their important contributions. For studies using PCaP DNA/genotyping data, please include: We would like to acknowledge the UNC BioSpecimen Facility and LSUHSC Pathology Lab for our DNA extractions, blood processing, storage and sample disbursement (https://genome.unc.edu/bsp). For studies using PCaP tissue, please include: We would like to acknowledge the RPCI Department of Urology Tissue Microarray and Immunoanalysis Core for our tissue processing, storage and sample disbursement. For studies using HCaP-NC follow-up data, please use: The Health Care Access and Prostate Cancer Treatment in North Carolina (HCaP-NC) study is carried out as a collaborative study supported by the American Cancer Society award RSGT-08-008-01-CPHPS. The authors thank the staff, advisory committees and research subjects participating in the HCaP-NC study for their important contributions. For studies that use both PCaP and HCaP-NC, please use: The authors thank the staff, advisory committees and research subjects participating in the PCaP and HCaP-NC studies for their important contributions. PCMUS: The PCMUS study was supported by the Bulgarian National Science Fund, Ministry of Education and Science (contract DOO-119/2009; DUNK01/2-2009; DFNI-B01/28/2012) with additional support from the Science Fund of Medical University - Sofia (contract 51/2009; 8I/2009; 28/2010). PHS: The Physicians' Health Study was supported by grants CA34944, CA40360, CA097193, HL26490, and HL34595. PHS members are grateful to the participants and staff of the Physicians' Health Study and Health Professionals Follow-Up Study for their valuable contributions, as well as the following state cancer registries for their help: AL, AZ, AR, CA, CO, CT, DE, FL, GA, ID, IL, IN, IA, KY, LA, ME, MD, MA, MI, NE, NH, NJ, NY, NC, ND, OH, OK, OR, PA, RI, SC, TN, TX, VA, WA, and WY. PLCO: This PLCO study was supported by the Intramural Research Program of the Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIHPLCO thanks Drs. Christine Berg and Philip Prorok, Division of Cancer Prevention at the National Cancer Institute, the screening center investigators and staff of the PLCO Cancer Screening Trial for their contributions to the PLCO Cancer Screening Trial. We thank Mr. Thomas Riley, Mr. Craig Williams, Mr. Matthew Moore, and Ms. Shannon Merkle at Information Management Services, Inc., for their management of the data and Ms. Barbara O'Brien and staff at Westat, Inc. for their contributions to the PLCO Cancer Screening Trial. We also thank the PLCO study participants for their contributions to making this study possible. Poland: None reported PROCAP: PROCAP was supported by the Swedish Cancer Foundation (08-708, 09-0677). PROCAP thanks and acknowledges all of the participants in the PROCAP study. We thank Carin Cavalli-Björkman and Ami Rönnberg Karlsson for their dedicated work in the collection of data. Michael Broms is acknowledged for his skilful work with the databases. KI Biobank is acknowledged for handling the samples and for DNA extraction. We acknowledge The NPCR steering group: Pär Stattin (chair), Anders Widmark, Stefan Karlsson, Magnus Törnblom, Jan Adolfsson, Anna Bill-Axelson, Ove Andrén, David Robinson, Bill Pettersson, Jonas Hugosson, Jan-Erik Damber, Ola Bratt, Göran Ahlgren, Lars Egevad, and Roy Ehrnström. PROGReSS: The PROGReSS study is founded by grants from the Spanish Ministry of Health (INT15/00070; INT16/00154; FIS PI10/00164, FIS PI13/02030; FIS PI16/00046); the Spanish Ministry of Economy and Competitiveness (PTA2014-10228-I), and Fondo Europeo de Desarrollo Regional (FEDER 2007-2013). ProMPT: Founded by CRUK, NIHR, MRC, Cambride Biomedical Research Centre ProtecT: Founded by NIHR. ProtecT and ProMPT would like to acknowledge the support of The University of Cambridge, Cancer Research UK. Cancer Research UK grants (C8197/A10123) and (C8197/A10865) supported the genotyping team. We would also like to acknowledge the support of the National Institute for Health Research which funds the Cambridge Bio-medical Research Centre, Cambridge, UK. We would also like to acknowledge the support of the National Cancer Research Prostate Cancer: Mechanisms of Progression and Treatment (PROMPT) collaborative (grant code G0500966/75466) which has funded tissue and urine collections in Cambridge. We are grateful to staff at the Welcome Trust Clinical Research Facility, Addenbrooke's Clinical Research Centre, Cambridge, UK for their help in conducting the ProtecT study. We also acknowledge the support of the NIHR Cambridge Biomedical Research Centre, the DOH HTA (ProtecT grant), and the NCRI/MRC (ProMPT grant) for help with the bio-repository. The UK Department of Health funded the ProtecT study through the NIHR Health Technology Assessment Programme (projects 96/20/06, 96/20/99). The ProtecT trial and its linked ProMPT and CAP (Comparison Arm for ProtecT) studies are supported by Department of Health, England; Cancer Research UK grant number C522/A8649, Medical Research Council of England grant number G0500966, ID 75466, and The NCRI, UK. The epidemiological data for ProtecT were generated though funding from the Southwest National Health Service Research and Development. DNA extraction in ProtecT was supported by USA Dept of Defense award W81XWH-04-1-0280, Yorkshire Cancer Research and Cancer Research UK. The authors would like to acknowledge the contribution of all members of the ProtecT study research group. The views and opinions expressed therein are those of the authors and do not necessarily reflect those of the Department of Health of England. The bio-repository from ProtecT is supported by the NCRI (ProMPT) Prostate Cancer Collaborative and the Cambridge BMRC grant from NIHR. We thank the National Institute for Health Research, Hutchison Whampoa Limited, the Human Research Tissue Bank (Addenbrooke's Hospital), and Cancer Research UK. PROtEuS: PROtEuS was supported financially through grants from the Canadian Cancer Society (13149, 19500, 19864, 19865) and the Cancer Research Society, in partnership with the Ministère de l'enseignement supérieur, de la recherche, de la science et de la technologie du Québec, and the Fonds de la recherche du Québec - Santé.PROtEuS would like to thank its collaborators and research personnel, and the urologists involved in subjects recruitment. We also wish to acknowledge the special contribution made by Ann Hsing and Anand Chokkalingam to the conception of the genetic component of PROtEuS. QLD: The QLD research is supported by The National Health and Medical Research Council (NHMRC) Australia Project Grants (390130, 1009458) and NHMRC Career Development Fellowship and Cancer Australia PdCCRS funding to J Batra. The QLD team would like to acknowledge and sincerely thank the urologists, pathologists, data managers and patient participants who have generously and altruistically supported the QLD cohort. RAPPER: RAPPER is funded by Cancer Research UK (C1094/A11728; C1094/A18504) and Experimental Cancer Medicine Centre funding (C1467/A7286). The RAPPER group thank Rebecca Elliott for project management. SABOR: The SABOR research is supported by NIH/NCI Early Detection Research Network, grant U01 CA0866402-12. Also supported by the Cancer Center Support Grant to the Cancer Therapy and Research Center from the National Cancer Institute (US) P30 CA054174. SCCS: SCCS is funded by NIH grant R01 CA092447, and SCCS sample preparation was conducted at the Epidemiology Biospecimen Core Lab that is supported in part by the Vanderbilt-Ingram Cancer Center (P30 CA68485). Data on SCCS cancer cases used in this publication were provided by the Alabama Statewide Cancer Registry; Kentucky Cancer Registry, Lexington, KY; Tennessee Department of Health, Office of Cancer Surveillance; Florida Cancer Data System; North Carolina Central Cancer Registry, North Carolina Division of Public Health; Georgia Comprehensive Cancer Registry; Louisiana Tumor Registry; Mississippi Cancer Registry; South Carolina Central Cancer Registry; Virginia Department of Health, Virginia Cancer Registry; Arkansas Department of Health, Cancer Registry, 4815 W. Markham, Little Rock, AR 72205. The Arkansas Central Cancer Registry is fully funded by a grant from National Program of Cancer Registries, Centers for Disease Control and Prevention (CDC). Data on SCCS cancer cases from Mississippi were collected by the Mississippi Cancer Registry which participates in the National Program of Cancer Registries (NPCR) of the Centers for Disease Control and Prevention (CDC). The contents of this publication are solely the responsibility of the authors and do not necessarily represent the official views of the CDC or the Mississippi Cancer Registry. SCPCS: SCPCS is funded by CDC grant S1135-19/19, and SCPCS sample preparation was conducted at the Epidemiology Biospecimen Core Lab that is supported in part by the Vanderbilt-Ingram Cancer Center (P30 CA68485). SEARCH: SEARCH is funded by a program grant from Cancer Research UK (C490/A10124) and supported by the UK National Institute for Health Research Biomedical Research Centre at the University of Cambridge. SNP_Prostate_Ghent: The study was supported by the National Cancer Plan, financed by the Federal Office of Health and Social Affairs, Belgium. SPAG: Wessex Medical ResearchHope for Guernsey, MUG, HSSD, MSG, Roger Allsopp STHM2: STHM2 was supported by grants from The Strategic Research Programme on Cancer (StratCan), Karolinska Institutet; the Linné Centre for Breast and Prostate Cancer (CRISP, number 70867901), Karolinska Institutet; The Swedish Research Council (number K2010-70X-20430-04-3) and The Swedish Cancer Society (numbers 11-0287 and 11-0624); Stiftelsen Johanna Hagstrand och Sigfrid Linnérs minne; Swedish Council for Working Life and Social Research (FAS), number 2012-0073STHM2 acknowledges the Karolinska University Laboratory, Aleris Medilab, Unilabs and the Regional Prostate Cancer Registry for performing analyses and help to retrieve data. Carin Cavalli-Björkman and Britt-Marie Hune for their enthusiastic work as research nurses. Astrid Björklund for skilful data management. We wish to thank the BBMRI.se biobank facility at Karolinska Institutet for biobank services. PCPT & SELECT are funded by Public Health Service grants U10CA37429 and 5UM1CA182883 from the National Cancer Institute. SWOG and SELECT thank the site investigators and staff and, most importantly, the participants who donated their time to this trial. TAMPERE: The Tampere (Finland) study was supported by the Academy of Finland (251074), The Finnish Cancer Organisations, Sigrid Juselius Foundation, and the Competitive Research Funding of the Tampere University Hospital (X51003). The PSA screening samples were collected by the Finnish part of ERSPC (European Study of Screening for Prostate Cancer). TAMPERE would like to thank Riina Liikanen, Liisa Maeaettaenen and Kirsi Talala for their work on samples and databases. UGANDA: None reported UKGPCS: UKGPCS would also like to thank the following for funding support: The Institute of Cancer Research and The Everyman Campaign, The Prostate Cancer Research Foundation, Prostate Research Campaign UK (now Prostate Action), The Orchid Cancer Appeal, The National Cancer Research Network UK, The National Cancer Research Institute (NCRI) UK. We are grateful for support of NIHR funding to the NIHR Biomedical Research Centre at The Institute of Cancer Research and The Royal Marsden NHS Foundation Trust. UKGPCS should also like to acknowledge the NCRN nurses, data managers, and consultants for their work in the UKGPCS study. UKGPCS would like to thank all urologists and other persons involved in the planning, coordination, and data collection of the study. ULM: The Ulm group received funds from the German Cancer Aid (Deutsche Krebshilfe). WUGS/WUPCS: WUGS would like to thank the following for funding support: The Anthony DeNovi Fund, the Donald C. McGraw Foundation, and the St. Louis Men's Group Against Cancer.
The goal of this study was to identify genetic variants in a genome-wide association study (GWAS) that are associated with previously obtained platelet function phenotypes measured in each individual under baseline conditions and following 2 weeks of low dose aspirin (ASA). During this study we performed a high density 1 million-SNP genome scan on subjects from GeneSTAR (representing 800 2-generational families with a family history of premature coronary artery disease, 60% white and 40% African American, N=3232). We identified genomic loci associated with quantitative platelet phenotypes prioritized for their biological interest, determined associations between genomic loci and baseline platelet phenotypes (primary phenotypes are platelet aggregation in platelet rich plasma induced by collagen, adenosine diphosphate (ADP), arachidonic acid (AA), and epinephrine (Epi), determined associations between genomic loci and post-ASA platelet phenotypes, ie, measures of ASA "resistance" (primary phenotypes are as above, plus urinary levels of the prostaglandin metabolite, 11-dehydro-thromboxane B2 ), and compared peaks of association between genomic loci and three common baseline platelet phenotypes (collagen-, epinephrine-, and ADP-induced aggregation in platelet rich plasma) with associations found for these same phenotypes in the Framingham Heart Study. We also determined whether any significant genotype-phenotype associations in the 1 million SNP genome scan could be localized to any specific genes or potential genes of interest using publicly available databases and further examined whether candidate genes previously associated with a specific platelet phenotype are located in a genomic region of interest as determined from the SNP genome scan. We conducted replications of our findings, and are presently involved in larger scale meta-analysis of findings for pre-aspirin and post-aspirin associations. The study population is constituted of full siblings (SIBS) (ages 35-78 years) identified from The Johns Hopkins Sibling Study, now called GeneSTAR, the spouses of the SIBS, and their adult offspring (>21 years of age). They include 3200 individuals from approximately 300 African American and 500 white extended families. "Spouse" refers to the other parent of any SIB offspring, independent of marital status. Sibship sizes among families range from 1 to 16 (excluding index cases). On average, each SIB has 2 potentially eligible offspring. A family for this study is defined by all of the full sibships and the total numbers of offspring that come from all SIBS, rather than just the nuclear family.
Neuroblastoma, a clinically heterogeneous pediatric cancer, is characterized by distinct genomic profiles but few recurrent mutations. As neuroblastoma is expected to have high degree of genetic heterogeneity, study of neuroblastoma's clonal evolution with deep coverage whole-genome sequencing of diagnosis and relapse samples will lead to a better understanding of the molecular events associated with relapse. Samples were included in this study if sufficient DNA from constitutional, diagnosis and relapse tumors was available for WGS. Whole genome sequencing was performed on trios (constitutional, diagnose and relapse DNA) from eight patients using Illumina Hi-seq2500 leading to paired-ends (PE) 90x90 for 6 of them and 100x100 for two. Expected coverage for sample NB0175 100x100bp was 30X for tumor and constitutional samples. For the seven other patients expected coverage was 80X for tumor samples with PE 100x100, 100X in the other tumor samples and 50X for all constitutional samples (see table 1). Following alignment with BWA (Li et al., Oxford J, 2009 Jul) allowing up to 4% of mismatches, bam files were cleaned up according to the Genome Analysis Toolkit (GATK) recommendations (Van der Auwera et al., Current Protocols in Bioinformatics, 2013, picard-1.45, GenomeAnalysisTK-2.2-16). Variant calling was performed in parallel using 3 variant callers: GenomeAnalysisTK-2.2-16, Samtools-0.1.18 and MuTect-1.1.4 (McKenna et al., Genome Res, 2010; Li et al., Oxford J, 2009 Aug; Cibulskis et al., Nature, 2013). Annovar-v2012-10-23 with cosmic-v64 and dbsnp-v137 were used for the annotation and RefSeq for the structural annotation. For GATK and Samtools, single nucleotide variants (SNVs) with a quality under 30, a depth of coverage under 6 or with less than 2 reads supporting the variant were filter out. MuTect with parameters following GATK and Samtools thresholds have been used to filter our irrelevant variants. .SNVs within and around exons of coding genes overlapping splice sites.. Then,variants reported in more than 1% of the population in the 1000 genomes (1000gAprl_2012) or Exome Sequencing Project (ESP6500) have been discarded in order to filter polymorphisms. Finally, synonymous variants were filtered out. MuTect focuses on somatic by filtering with constitutional sample. Mpileup comparison between constitutional and somatic DNAs allowed us to focus also on tumor specific SNVs with GATK and Samtools. Finally, every SNV called by our pipeline and also supported in any constitutional samples were filtered our in order to prevent putative constitutional DNA coverage deficiency. Then we analyzed CNVs (copy number variants) with HMMcopy-v0.1.1 (Gavin et al., Genome Res, 2012) and control-FREEC-v6.7 (Boeva et al., Bioinformatics 2011) with a respective window of 2000bp and 1000 bp, and auto-correction of normal contamination of tumor samples for Control-FREEC. Finally we explored Structural variants (SVs) including deletions, inversions, tandem duplications and translocations using DELLY-v0.5.5 with standard parameters (Rausch et al., Oxford J, 2012). In tumors, at least 10 supporting reads were required to make a call and 5 supporting reads for the sample NB0175 with a coverage of only 40X (see table 2). To predict SVs in constitutional samples for subsequent somatic filtering, only 2 supporting reads were required in order not to miss one. To identify somatic events, all the SVs in each normal sample were first flanked by 500 bp in both directions and any SVs called in a tumor sample which was in the combined flanked regions of respective normal sample was removed (see graph 1). Deletions with more than 5 genes impacted or larger than 1Mb and inversions or tandem duplications covering more than 4 genes, were removed. We focused on exonic and splicing events for deletions, inversions, and tandem duplications. For translocation, we keep all SVs that occurred in intronic, exonic, 5'UTR, upstream or splicing regions. Bioinformatics detection of variations with Deep sequencing approach Once PE reads merged and adaptors trimmed by SeqPrep with default parameters, merged reads were aligned via the BWA (Li H. and Durbin R. 2009 PMID 19451168) allowing up to 1 differences in the 22-base-long seeds and reporting only unique alignments. Only reads having a mapping quality 20 or more have been further analysed. Variant calling software was not used, since we aimed to predict variations at low frequencies, observed in less than 1% of reads. Such variants require a custom approach. Using DepthOfCoverage functions of the Genome Analysis Toolkit (GATK) v2.13.2 (McKenna A, et al., 2010 Genome Research PMID: 20644199), we focused on high quality coverage of bases A, C, G and T at the targeted variant position. Depth of coverage of each base following a mapping quality higher than 20 and a base quality higher than 10 have been taken into account in order to focus only on high quality data. Aiming to determine the background level of variability at the studied regions, 10 control samples were included in the analysis. The same approach and filtering criteria have been applied as introduced above over the entire amplicons. In order to highlight variants, for each sample the frequencies of each bases at each amplicon position were then compared to those observed in the set of controls. Statistical analyses were performed with the R statistical software (http://www.R-project.org). Fisher’s exact two-sided tests with a Bonferroni correction were performed to compare percentages of bases between the data sets, i.e. for a given base between a case and the controls. Finally, significant variations were filtered-in once (i) a significant increase in the percentage of avariant base and (ii) a significant decrease in the percentage of it's reference base following our p.values criteria was observed (p.val < 0.05).
Coronary artery disease (CAD) is the number one cause of death and mortality world-wide. High levels of serum triglycerides (TGs) and low levels of serum high-density lipoprotein cholesterol (HDL-C) are major risk factors for CAD. Previous epidemiological studies have shown that these two lipid disturbances are highly common dyslipidemias in Mexicans. However, the genetic factors underlying high serum TGs and low HDL-C are underinvestigated and poorly identified in Mexicans. As the Mexican-American and the genetically related Latin-American populations represent the fastest growing minority in the United States, elucidation of the unknown genetic factors influencing the increased susceptibility of Mexicans to these common dyslipidemias is of great relevance to these U.S. minorities and the American healthcare system. This study is a research collaboration between investigators at the University of California, Los Angeles (UCLA) and Instituto Nacional de Ciencias Médicas y Nutrición (INCMN), Mexico City, to perform a two-stage genome-wide association study (GWAS) for hypertriglyceridemia and low high-density lipoprotein cholesterol (HDL-C) in Mexicans (Weissglas-Volkov et al. 2013, PMID: 23505323). This study was funded by an RO1 grant from NIH/NHLBI (PI Paivi Pajukanta; PI of the subaward Teresa Tusie-Luna). In stage 1 of the GWAS, we tested all GWAS SNPs from the Illumina Infinium Human610-Quad platform that passed quality control for association in ~2,200 hypertriglyceridemia cases and controls. In stage 2, we genotyped the ~1,200 positive signals of stage 1 in additional ~2,200 hypertriglyceridemia cases and controls, and performed a combined analysis of the two stages to identify genome-wide significant variants. The subjects were recruited at the INCMN. The DNA isolation and clinical laboratory measurements of the case-control study samples were performed at INCMN using standardized procedures. Genotyping and statistical analyses of the data were performed at UCLA. A summary of this GWAS can be found in Weissglas-Volkov et al. 2013 Journal of Medical Genetics, PMID: 23505323. Briefly, using the two-stage GWAS design, we identified a novel Mexican-specific genome-wide significant locus for serum TG levels close to the Niemann-Pick type C1 protein (NPC1) gene (Weissglas-Volkov et al. 2013, PMID: 23505323). Of the European lipid GWAS loci, three TG loci (APOA5, GCKR, and LPL) and four HDL-C loci (ABCA1, CETP, LIPC and LOC55908) resulted in genome-wide significant signals in Mexicans. Furthermore, our cross-ethnic mapping narrowed three European TG GWAS loci, APOA5, MLXIPL, and CILP2 that exhibited long range LD and a large number of candidate SNPs in the European GWAS scan. In the apolipoprotein A1C3A4A5 gene cluster region, the cross-ethnic fine mapping and LD comparisons reduced the number of candidate variants to one SNP, rs964184. We also found that although 52% of the associations from European lipid GWAS meta-analysis could be generalized to Mexicans, in 82% of the European GWAS loci, a different variant other than the European lead SNP provided the statistically strongest evidence of association in the Mexican scan (Weissglas-Volkov et al. 2013, PMID: 23505323). Taken together, our Mexican GWAS for lipids identified a novel Mexican-specific locus for high serum TGs; restricted three European GWAS loci; and investigated which European lipid GWAS variants extend to the Mexican population (Weissglas-Volkov et al. 2013, PMID: 23505323). This NHLBI sponsored RO1 project aimed to identify genetic variants contributing to serum lipid levels in Mexicans. In addition to de-identified genome wide genotypic data, a selected list of de-identified phenotypic data related to hypertriglyceridemia and related metabolic traits are also available in dbGaP.
GenADA is a multi-site collaborative study, involving GlaxoSmithKline Inc and nine medical centres in Canada, to develop a dataset containing 1000 Alzheimer's disease patients and 1000 ethnically-matched controls in order to associate DNA sequence (allelic) variations in candidate genes with Alzheimer's disease phenotypes. The study consists of both retrospective and prospective components, that is, patients with an existing diagnosis of Alzheimer's disease as well as newly diagnosed patients were enrolled in the study. Thus, clinical data was retrospectively or prospectively obtained on Day 1 of entry into GenADA. Where possible, biological relatives with Alzheimer's (up to third degree relationship such as cousins) and unaffected siblings of AD cases were also recruited. Note that recruitment numbers for biological relatives were lower than expected and genotypic data has not been submitted to dbGap for these subjects. The purpose of this study is: To identify DNA sequence variations (genotype) in candidate genes that are associated with the clinical symptoms and behavioural features of Alzheimer's disease (phenotype), which differ between study participants with and without the disease. To identify other genotype-phenotype associations in cognitively impaired study participants such as age of onset, family history, rate of cognitive decline, patterns of behavioural/psychiatric non-cognitive symptoms factors, response to treatment co-morbid conditions, and risks/exposure. The final subject recruitment for this study included 875 Alzheimer's disease patients, 850 ethnically-matched controls, and 37 family members. GenADA LONG is a longitudinal assessment to the original GenADA study. Eligible subjects were recruited from five of the nine memory clinics that participated in the GenADA study. Mild to moderate AD participants, a matched subset of controls, and biological related siblings (both affected and unaffected) or other blood relatives affected with AD, were initially examined a minimum of 12 months from recruitment into the original GenADA study, then at two further intervals of 12 and 18 months after time of entry into GenADA LONG. This enables an evaluation of the disease progression in AD patients and a determination of whether controls show evidence of cognitive decline. The overall goal of this extension study is to identify genetic differences and environmental influences that modulate the age of onset of the disease, the course of the disease, and/or biomarkers for neurodegenerative processes. Three of the five memory clinics that participated in the GenADA LONG study recruited eligible patients into GenADA Imaging. A concurrent neuroimaging sub-study was conducted at three of the five memory clinics participating in GenADA LONG. Eligible AD cases with mild to moderate AD, who were recruited into the original GenADA study and participated in the GenADA LONG extension study, were enrolled. Additionally, controls that showed signs of cognitive decline, as part of the assessment in GenADA LONG, were imaged at baseline, 12 and 18 month scanning intervals. The objective of this study is to find genes that: affect changes in AD brain volume measure by magnetic resonance in order to investigate how well change in brain volume predicts other key clinical measures in AD, such as neurodegenerative scales; that correlate changes in brain volume for other genotype-phenotype associations in cognitively impaired study participants; and that correlate with other clinically applicable magnetic resonance measures of pathology that can be conducted at the same time as structural volume measures, and are complementary to the volume measures. The ultimate aim of this research is to obtain a better understanding and definition of Alzheimer's Disease in order to develop new improved medicines.
For the Clinical Trials Sequencing Project (CTSP), National Cancer Institute (NCI) will utilize whole genome sequencing and/or whole exome sequencing in conjunction with transcriptome sequencing to try to identify recurrent genetic alterations (mutations, deletions, amplifications, rearrangements) and/or gene expression signatures that would be important to the hypothesis(es) submitted by the investigators. The samples will be processed and submitted for genomic characterization using pipelines and procedures established within The Cancer Genome Analysis (TCGA) project. Data analysis will be performed as a collaboration between the National Clinical Trials Network (NCTN) Group and its investigators submitting the proposal. The investigators at the NCI-sponsored Genomic Data Analysis Center (GDAC) will characterize the samples. The NCTN Group will be responsible for providing the clinical data needed for the proposal to the open clinical system maintained by NCI CCG's Biospecimen Core Resource (BCR) at Nationwide Children's Hospital in Columbus, Ohio. The project team (Network Group/investigators and GDAC) will analyze the data together. Additionally, clinical and genomic data related to the analyses will also need to be registered by NCI and will be made available to qualified researchers via a controlled-access database (e.g., dbGaP) upon publication of the primary analysis described in the study proposal. A substudy description and its molecular data information are provided under its own page: phs001184 CTSP Diffuse Large B-Cell Lymphoma (DLBCL) CALGB 50303
The goal of our studies is to identify genetic modifiers of neurodegeneration in Huntington's disease (HD). HD is caused by expansion of CAG repeats in the huntingtin (Htt) gene, with longer stretches often leading to more rapid disease onset and progression. Yet, for a given number of repeats, the age of symptom onset can be variable, differing by up to decades. Thus, the age of onset of motor symptoms in HD is only partly explained by the length of the CAG expansion. Available evidence suggests that genetic modifiers contribute to the variation in HD onset. Identifying genetic modifiers is important because they may provide critical insights into HD pathogenesis and reveal key pathways that could be targeted by novel HD therapeutics. This is important since there are no disease-modifying therapies for HD, and mHtt is an unattractive small-molecule drug target. We recruited 21 HD families with varying characteristics of disease progression and age of onset and obtained medical histories, clinical records and DNA samples that were subjected to whole-genome sequencing (WGS). These WGS data describe families of 104 subjects, including HD patients and their unaffected family members. These individuals were selected based on individual clinical histories and family structures that best fit our criteria for expressing potential genetic modifiers. We are testing the hypothesis that novel, rare genetic variants contribute to HD and those genetic modifiers can be identified by WGS.
Hepatocellular Carcinoma (HCC) is a leading cause of cancer-related death and can be considered a prototype of inflammation-derived cancer arising from chronic liver injury. The cell composition of the HCC tumor immune microenvironment (TiME) has a major impact on cancer biology as the TiME can have divergent capacities on tumor initiation, progress, and response to therapy. Recent development of multi-omics and single-cell technologies help us to comprehensively quantify the cellular heterogeneity and spatial organization of the TiME and to further our understanding of antitumor immunity. We investigated the cellular composition of liver cancer patient samples (n=8) using single-cell RNA-seq. For this purpose, mononuclear cells were isolated from human liver cancer samples. Three locations within the tumor-bearing liver were used: adjacent liver, rim and tumor core. Then, cells were FACS sorted (either CD45+ cells or MAIT cells) and subjected to 10X Chromium-based scRNA-seq. We investigated immune cell changes and the cellular heterogeneity of the HCC TiME with a focus on MAIT cells. Annotated H5 files are provided. Clinical metadata including TMN stage, sex, gender, ethnicity, pretreatment, and histopathological reports are available for all patient samples. Further details on the study can be obtained in our paper once it’s published.A complementary dataset derived from ultrahighplex CO-Detection by indEXing (CODEX) from paired patient samples can be accessed through The Cancer Imaging Archives (TCIA) under DOI: https://doi.org/10.7937/bh0r-y074.
The genetic makeup of an individual strongly influences the risk of developing systemic lupus erythematosus (SLE). The identification of genes that predispose an individual to SLE will lead to earlier and better diagnosis, better treatments, and possibly prevention. To this end, the International Consortium on the Genetics of Systemic Lupus Erythematosus (SLEGEN) was formed in 2005 and is composed of lupus researchers who agreed to pool their knowledge and resources to search for genes that predispose to lupus. Eight laboratories contributed DNA samples for genotyping at the Broad Institute and association with SLE was performed by the Data Coordinating Center (Wake Forest University), as part of a four stage study design. Stages one and two of this design were graciously funded by the Alliance for Lupus Research (www.lupusresearch.org). In this stage of the study, approximately 767 SLE patients (cases) were compared to approximately 383 non-SLE patients (controls) for differences among the Illumina HumanHap300. The affected individuals are all females of European decent. 82% of the cases are the index case from multiplex pedigrees for SLE and the remaining 18% have self-reported first degree relatives with SLE. A detailed summary of the methods and results can be found in the manuscript in Nature Genetics February 2008 by SLEGEN "Genome-wide association scan in women with systemic lupus erythematosus identifies susceptibility variants ITGAM, PXK, KIAA1542 and other loci". (Please see also Study Accession: phs000202.v1.p1)
This research is a non-NIH effort to discover the pathogenesis of non-syndromic microtia and to detect potential candidate genes. Microtia (OMIM 600674, 612290, 251800) is a spectrum of congenital anomalies and facial abnormalities, 90% of the cases followed by hearing loss. Two types of the disease are described in terms of connection to the affected syndromes: anomalies (syndromic) and only clinical manifestation (non-syndromic) associated with other symptoms. Though the exact pathogenesis of microtia remains unknown, pathogenic genes of majority of the microtia syndromic types have been identified. However, the genetic mutation of the non-syndromic microtia has not been confirmed. In this study, whole genome sequencing (WGS) was utilized to screen mutations in monozygotic discordant twins of congenital microtia-atresia and their family members. One of the twins, along with her grandfather, was discordant for microtia-atresia. By bioinformatic analysis of the WGS data from monozygotic twins with microtia and their non-microtia relatives, we proposed that among 28 identified potential genes, 4 of them (CNOT1, ODAD4, TBX15, and GIPC3) could be more responsible for non-syndromic microtia development and phenotypic features. Interested investigators may request to access the DNA genomic sequencing data of the study participants via dbGaP authorized access. In addition to raw genomic sequencing data, datasets of germline mutations and candidate gene mutations may also be available for investigators to request.
After conception, postzygotic mutations continuously occur throughout life in humans, causing somatic mosaicism in an individual. The variant type, time of origination, and locations of the mosaic mutations result in unique mosaic patterns in a combinatorial manner and further affect phenotypes, including various noncancerous diseases. Several efforts have, thus, been made to identify the mutational landscape and mechanisms underlying the mosaic mutations. From a technical aspect, the accurate detection of mutations is at the core of the mosaicism research, and the construction of a standard reference is a critical first step to serve as the basis for analytical validation and benchmarks for germline and somatic mutation detection. Furthermore, a reference standard for mosaic mutations is urgently needed to enable more advanced research.Herein, we generated robust, large-scale, and cell line mixture-based reference standards that contain abundant mosaic SNVs and INDELs along with wild-type sites and germ line variants. Moreover, the whole reference standard set mimics the cumulative aspect of mosaic variant acquisition in the early developmental stage through a progressive cell line mixing with confirmed genotypes. It can be used for optimizing the current use of mosaic variant detection strategies, as well as developing algorithms for future improvements.We previously generated an NCBI BioProject (PRJNA758606) to release much of these data. Only the data that are subject to authorized access constraints have been included in this submission to dbGaP. The two data sources are thus, complementary to each other.
The generation and maintenance of protective immunity is a dynamic interplay between host and environment that is impacted by age. Understanding fundamental changes in the healthy immune system that occur over a lifespan is critical in developing interventions for age-related susceptibility to infections and diseases. Here, we use multi-omic profiling (scRNA-seq, proteomics, flow cytometry) to examine human peripheral immunity in over 300 healthy adults, with 96 young and older adults followed over two years with yearly vaccination. The resulting resource includes scRNA-seq datasets of >16 million PBMCs, interrogating 71 immune cell subsets from our new Immune Health Atlas. This study allows unique insights into the composition and transcriptional state of immune cells at homeostasis, with vaccine perturbation, under Cytomegalovirus (CMV) infection and across age. We find that T cells specifically accumulate age-related transcriptional changes more than other immune cells, independent from inflammation and chronic perturbation. Moreover, impaired memory B cell responses to vaccination are linked to a Th2-like state shift in older adults' memory CD4 T cells, revealing possible mechanisms of immune dysregulation during healthy human aging. This extensive resource is provided with a suite of exploration tools at https://apps.allenimmunology.org/aifi/insights/dynamics-imm-health-age/ to enhance data accessibility and further the understanding of immune health across age. Please note that the Sound Life Cohort raw data are spread across three dbGap submissions: phs003841.v1 (this study), phs003944.v1, and phs003400.v1.
The placenta serves as the interface between the mother and fetus, facilitating the exchange of gases and nutrients between their separate blood circulation systems. Trophoblasts in the placenta play a central role in this process. Our current understanding of mammalian trophoblast development relies largely on mouse models. However, given the diversification of mammalian placentas, findings from the mouse placenta cannot be readily extrapolated to other mammalian species, including humans. To fill this knowledge gap, we performed CRISPR knockout (KO) screening in human trophoblast stem cells (hTSCs). We targeted genes essential for mouse placental development and identified more than 100 genes as critical regulators in both human hTSCs and mouse placentas. Among them, we further characterized in detail two transcription factors, DLX3 and GCM1, and revealed their essential roles in hTSC differentiation. Moreover, a gene function-based comparison between human and mouse trophoblast subtypes suggests that their relationship may differ significantly from previous assumptions based on tissue localization or cellular function. Notably, our data reveal that hTSCs may not be analogous to mouse TSCs or the extraembryonic ectoderm (ExE) in which in vivo TSCs reside. Instead, hTSCs may be analogous to progenitor cells in the mouse ectoplacental cone and chorion. This finding is consistent with the absence of ExE-like structures during human placental development. Our data not only deepen our understanding of human trophoblast development but also facilitate cross-species comparison of mammalian placentas.
Loss-of-function (LoF) variants in the FLG gene are associated with ichthyosis vulgaris (IV) and atopic dermatitis (AD). IV and AD patients with two FLG LoF variants are reported to have a more severe disease course. However, subsets of compound heterozygous patients display phenotypic heterogeneity despite a prediction of a severe phenotype from their genotype. The underlying genetic mechanism of this phenotypic variability is unknown. One possibility is that two LoF FLG variants could be located in cis (on the same allele), thus altering the predicted gene dosage. To investigate the allelic contribution of compound heterozygous FLG LoF mutations to IV phenotypic variability, we employ long reads from Oxford Nanopore Technologies, utilising two methodologies to phase the entire coding region of FLG into parental alleles: an amplicon-sequencing approach and an adaptive-sampling strategy. We investigated the allelic phasing of FLG LoF variants and intragenic-repeat copy-number variations (CNV) of 21 subjects with known compound heterozygous genotypes and mixed IV severity. Our results suggest that IV severity is not linked to the presence of cis variants in our IV patient cohort. Nanopore reads enabled detection and accurate phasing of CNVs and LoF variants into parental alleles, providing a framework for future in-depth genetic analysis. These results highlight the usefulness of utilising long reads for interrogating the FLG gene and other highly repetitive genes for genetic stratification related to clinical phenotypes.
Emerging evidence suggests a correlation between CD8+ T cell-tumor cell proximity and anti-tumor immune response. However, it remains unknown whether these cells exist as functional clusters that can be isolated from clinical samples. By conventional and imaging flow cytometry, we show here that from 21/21 human melanoma metastases, we could isolate heterotypic clusters, comprising CD8+ T cells interacting with one or more tumor cells and/or antigen-presenting cells (APCs). Single-cell RNA-sequencing revealed that T cells from clusters were enriched for gene signatures associated with tumor reactivity and exhaustion. Clustered T cells exhibited increased TCR clonality indicative of expansion, while TCR-matched T cells showed more exhaustion and co-modulation when conjugated to APCs than to tumor cells. T cells that were expanded from clusters ex vivo exerted on average 9-fold increased killing activity towards autologous melanomas, which was accompanied by enhanced cytokine production. Upon adoptive cell transfer into mice, T cells from clusters showed improved patient-derived melanoma control, which was associated with increased T cell infiltration and activation. Together, these results demonstrate that tumor-reactive CD8+ T cells are enriched in functional clusters with tumor cells and/or APCs and that they can be isolated and expanded from clinical samples. Typically excluded by single cell gating in flow cytometry, these distinct heterotypic T cell clusters serve as a valuable source to decipher functional tumor-immune cell interactions, while they may also be therapeutically explored.
Aim of study: Exploring single cell transcriptomics in expanded Tregs of with autoimmune polyendocrine syndrome type 1 (APS-1) patients (global gene expression, immune panel, TCR). N= 9 APS-1 patients and 9 healthy controls Protocol 1. Regulatory T cells (Tregs) were isolated from EDTA-blood, activated with anti-CD3/CD28 and IL2, expanded in cell culture for 14 days and stored at -150℃ in AB serum or FBS + 5% DMSO until use. 2. Cells were thawed, run through a Live/Dead column and carefully counted. 10000 Tregs were used as input for the 10x protocol. 3. cDNA was generated and amplified using Single Cell VDJ 5’ Gel Beads, Chromium Next GEM Chip K Single cell Kit, Next GEM Single Cell 5’ GEM Kit v2 and 5’ Feature Barcode Kit (all from 10x genomics). 4. Gene expression (GEX) libraries were generated by using the Library Construction Kit from 10x genomics. 5. T-cell receptor (TCR) libraries were generated using the Single Cell Human TCR Amplification Kit and the Library Construction Kit from 10x genomics. 6. 10x Genomics Target Hybridization Kit and Human Immunology Panel was used with GEX libraries. Only 8 patient samples and 8 control samples were used for the immune panel. 7. All libraries have a unique sample index (Dual Index TT Set A). 8. All samples were sequenced on Illumina NovaSeq 6000 on a NovaSeq S2 flow cell.
Technological advances have enabled transcriptome characterization of cell types at the single-cell level providing new biological insights. New methods that enable simple yet high-throughput single-cell expression profiling are highly desirable. Here we report a novel nanowell-based single-cell RNA sequencing system, ICELL8, which enables processing of thousands of cells per sample. The system employs a 5184-nanowell-containing microchip to capture ~1300 single cells and process them. Each nanowell contains preprinted oligonucleotides encoding poly-d(T), a unique well barcode, and a unique molecular identifier. The ICELL8 system uses imaging software to identify nanowells containing viable single cells and only wells with single cells are processed into sequencing libraries. Here, we report the performance and utility of ICELL8 using samples of increasing complexity from cultured cells to mouse solid tissue samples. Our assessment of the system to discriminate between mixed human and mouse cells showed that ICELL8 has a low cell multiplet rate (< 3%) and low cross-cell contamination. We characterized single-cell transcriptomes of more than a thousand cultured human and mouse cells as well as 468 mouse pancreatic islets cells. We were able to identify distinct cell types in pancreatic islets, including alpha, beta, delta and gamma cells. Overall, ICELL8 provides efficient and cost-effective single-cell expression profiling of thousands of cells, allowing researchers to decipher single-cell transcriptomes within complex biological samples.
The Genome of the Netherlands (GoNL) Project characterizes DNA sequence variation, common and rare, for SNVs, short insertions and deletions (indels) and larger deletions in 769 individuals of Dutch ancestry selected from five biobanks under the auspices of the Dutch hub of the Biobanking and Biomolecular Research Infrastructure (BBMRI-NL). The samples come from a representative sample of 250 trio-families from all provinces in the Netherlands. The parent-offspring trios include adult individuals ranging in age from 19 to 87 years (mean=53 years; SD=16 years) from birth cohorts 1910-1994. Sequencing was done on blood-derived DNA from uncultured cells and accomplished coverage was 14-15x. Samples where contributed by LifeLines (http://lifelines.nl/lifelines-research/general), The Leiden Longevity Study (http://www.healthy-ageing.nl; http://www.langleven.net), The Netherlands Twin Registry (NTR: http://www.tweelingenregister. org), The Rotterdam studies, (http://www.erasmus-epidemiology.nl/rotterdamstudy) and the Genetic Research in Isolated Populations program (http://www.epib.nl/research/geneticepi/research.html#gip). The sequencing was carried out in collaboration with the Beijing Institute for Genomics (BGI). The analysis was done by a consortium lead by UMCG, LUMC, Erasmus MC, VU university and UMCU, see http://www.nlgenome.nl. Funding for the project was provided by the Netherlands Organization for Scientific Research under award number 184021007, dated July 9, 2009 and made available as a Rainbow Project of the Biobanking and Biomolecular Research Infrastructure Netherlands (BBMRI-NL).
In the UK10K project we propose a series of complementary genetic approaches to find new low frequency/rare variants contributing to disease phenotypes. These will be based on obtaining the genome wide sequence of 4000 samples from the TwinsUK and ALSPAC cohorts (at 6x sequence coverage), and the exome sequence (protein coding regions and related conserved sequence) of 6000 samples selected for extreme phenotypes. Our studies will focus primarily on cardiovascular-related quantitative traits, obesity and related metabolic traits, neurodevelopmental disorders and a limited number of extreme clinical phenotypes that will provide proof-of-concept for future familial trait sequencing. We will analyse directly quantitative traits in the cohorts and the selected traits in the extreme samples, and also use imputation down to 0.1% allele frequency to extend the analyses to further sample sets with genome wide genotype data. In each case we will investigate indels and larger structural variants as well as SNPs, and use statistical methods that combine rare variants in a locus or pathway as well as single-variant approaches. The BioNED (Biomarkers for Childhood onset neuropsychiatric disorders) study has been carrying out detailed phenotypic assessments evaluating children with an autism spectrum disorder. These assessments included ADI-R, ADOS, neuropsychology, EEG etc. There are 56 DNA samples from this study (25 extracted from blood). For further information with regard to this cohort please contact Patrick Bolton (patrick.bolton@kcl.ac.uk).
In the UK10K project we propose a series of complementary genetic approaches to find new low frequency/rare variants contributing to disease phenotypes. These will be based on obtaining the genome wide sequence of 4000 samples from the TwinsUK and ALSPAC cohorts (at 6x sequence coverage), and the exome sequence (protein coding regions and related conserved sequence) of 6000 samples selected for extreme phenotypes. Our studies will focus primarily on cardiovascular-related quantitative traits, obesity and related metabolic traits, neurodevelopmental disorders and a limited number of extreme clinical phenotypes that will provide proof-of-concept for future familial trait sequencing. We will analyse directly quantitative traits in the cohorts and the selected traits in the extreme samples, and also use imputation down to 0.1% allele frequency to extend the analyses to further sample sets with genome wide genotype data. In each case we will investigate indels and larger structural variants as well as SNPs, and use statistical methods that combine rare variants in a locus or pathway as well as single-variant approaches. This is an Irish sample set of individuals with ASD (approximately 50% with comorbid intellectual disability). Individuals have been diagnosed with ADI/ ADOS, measures of cognition/ adaptive function. They represent a more severe, narrowly defined cohort of ASD subjects. Family histories are available for some with measures of broader phenotype. For further information on this cohort please contact Nadia Bolshakova (bolshakn@tcd.ie).