The Genetics and Epidemiology of Colorectal Cancer Consortium (GECCO) is a collaborative effort comprised of a coordinating center and scientific researchers from well-characterized cohort and case-control studies conducted in North America and Europe. This international consortium aims to accelerate the discovery of common and rare genetic risk variants for colorectal cancer by conducting large-scale meta-analyses of existing and newly generated genome-wide association study (GWAS) data, replicating and fine-mapping of GWAS discoveries, and investigating how genetic risk variants are modified by environmental risk factors. To expand these efforts, we assembled case-control sets or nested case-control sets from 20 different North American or European studies. Summary descriptions and study participant inclusions/exclusion criteria for each of these studies are detailed below. The Black Women's Health Study (BWHS): Is the largest follow-up study of the health of African-American women (Cozier et al., 2004; Rosenberg et al., 1995) [PMID: 15018884; PMID: 7722208]. The purpose is to identify and evaluate causes and preventives of cancers and other serious illnesses in African-American women. Among the diseases being studied are breast cancer, colorectal cancer, type 2 diabetes, uterine fibroids, systemic lupus erythematosus, and cardiovascular disease. The study began in 1995, when 59,000 black women from all parts of the United States enrolled through postal questionnaires. The women provided demographic and health data on the 1995 baseline questionnaire, including information on weight, height, smoking, drinking, contraceptive use, use of other selected medications, illnesses, reproductive history, physical activity, diet, use of health care, and other factors. The participants are followed through biennial questionnaires to determine the occurrence of cancers and other illnesses and to update information on risk factors. Self-reports of cancer are confirmed through medical records and state cancer registry records. Mouthwash-swish samples, as a source of DNA, were obtained from ~26,000 BWHS participants in 2002-2007. DNA was isolated from the mouthwash-swish samples at the Boston University Molecular Core Genetics Laboratory using the QIAAMP DNA Mini Kit (Qiagen). All incident colorectal cancer cases with a DNA sample were included in the present analysis. Two controls per case, selected from among BWHS participants free of colorectal cancer at end of follow-up, were matched to cases on year of birth (+/- 2 years) and geographical region of residence (Northeast, South, Midwest, and West). A total 209 colorectal cancer cases and 423 controls were sent for genotyping. Campaign Against Cancer and Heart Disease (CLUE II): The Campaign Against Cancer and Heart Disease, is a prospective cohort designed to identify biomarkers and other factors associated with risk of cancer, heart disease, and other conditions (Kakourou et al., 2015) [PMID: 26220152]. 32,894 participants were recruited from May through October 1989 from Washington County, Maryland and surrounding communities. Colorectal cancer cases (n = 297) and matched controls (n = 296) were identified between 1989 and 2000 among participants in the CLUE II cohort of Washington County, Maryland. Colorectal Cancer Study of Austria (CORSA): In the ongoing colorectal cancer study of Austria (CORSA), more than 13,000 Caucasian participants have been recruited within the province-wide screening project "Burgenland Prevention Trial of Colorectal Disease with Immunological Testing" (B-PREDICT) since 2003 (Hofer et al., 2011) [PMID: 21422235]. All inhabitants of the Austrian province Burgenland aged between 40 and 80 years are annually invited to participate in fecal immunochemical testing and haemoccult positive screening participants are invited for colonoscopy. CORSA includes genomic DNA and plasma of colorectal cancer cases, low-risk and high-risk adenomas, and colonoscopy-negative controls. Controls received a complete colonoscopy and were free of colorectal cancer or polyps. CORSA participants have been recruited in the four KRAGES hospitals in Burgenland, Austria, and additionally, at the Medical University of Vienna (Department of Surgery), the Viennese hospitals "Rudolfstiftung" and the "Sozialmedizinisches Zentrum Sud", and at the Medical University of Graz (Department of Internal Medicine). 1403 colorectal cancer and advanced colorectal adenoma cases, and 1404 matched controls were selected for the study. Distribution of factors sex and age (5 year strata) were evenly matched between cases and controls. Cancer Prevention Study II (CPS II): The CPS II Nutrition cohort is a prospective study of cancer incidence and mortality in the United States, established in 1992 and described in detail elsewhere (Calle et al., 2002; Campbell et al., 2014) [PMID: 12015775; PMID: 25472679]. At enrollment, participants completed a mailed self-administered questionnaire including information on demographic, medical, diet, and lifestyle factors. Follow-up questionnaires to update exposure information and to ascertain newly diagnosed cancers were sent biennially starting in 1997. Reported cancers were verified through medical records, state cancer registry linkage, or death certificates. The Emory University Institutional Review Board approves all aspects of the CPS II Nutrition Cohort. A total of 360 cases and 359 controls were selected for this study. Czech Republic Colorectal Cancer Study (Czech Republic CCS): Cases with positive colonoscopy results for malignancy, confirmed by histology as colon or rectal carcinomas, were recruited between September 2003 and May 2012 in several oncological departments in the Czech Republic (Prague, Pilsen, Benesov, Brno, Liberec, Ples, Pribram, Usti and Labem, and Zlin). Two control groups, sampled at the same time of cases recruitment, were included in the study. The first group consisted of hospital-based individuals with a negative colonoscopy result for malignancy or idiopathic bowel diseases. The reasons for the colonoscopy were: i) positive fecal occult blood test, ii) hemorrhoids, iii) abdominal pain of unknown origin, and iv) macroscopic bleeding. The second control group consisted of healthy blood donor volunteers from a blood donor center in Prague. All individuals were subjected to standard examinations to verify the health status for blood donation and were cancer-free at the time of the sampling. Details of CRC cases and controls have been reported previously (Vymetalkova et al., 2014; Naccarati et al., 2016; Vymetalkova et al., 2016) [PMID: 24755277; PMID: 26735576; PMID: 27803053]. All subjects were informed and provided written consent to participate in the study. They approved the use of their biological samples for genetic analyses, according to the Declaration of Helsinki. The design of the study was approved by the Ethics Committee of the Institute of Experimental Medicine, Prague, Czech Republic. All subjects included in the study were Caucasians and comprised 1792 cases and 1764 matched controls. Controls were matched to CRC cases as 1:1 ratio. Matching was done on age and sex. Age was matched on +-5 years, whereas sex was matched exactly. For the cases without matched controls, matching was done only on sex. Early Detection Research Network (EDRN): The aim of the EDRN initiative is to develop and sustain a biorepository for support of translational research (Amin et al., 2010) [PMID: 21031013]. High-quality biospecimens were accrued and annotated with pertinent clinical, epidemiologic, molecular and genomic information. A user-friendly annotation tool and query tool was developed for this purpose. The various components of this annotation tool include: CDEs are developed from the College of American Pathologists (CAP) Cancer Checklists and North American Association of Central Cancer Registries (NAACR) standards. The CDEs provides semantic and syntactic interoperability of the data sets by describing them in the form of metadata or data descriptor. A total of 352 colorectal case samples and 399 controls were selected for this study. Controls were matched to CRC cases based on age and sex. The EPICOLON Consortium (EPICOLON): The EPICOLON Consortium comprises a prospective, multicentre and population-based epidemiology survey of the incidence and features of CRC in the Spanish population (Fernandez-Rozadilla et al., 2013) [PMID: 23350875]. Cases were selected as patients with de novo histologically confirmed diagnosis of colorectal adenocarcinoma. Patients with familial adenomatous polyposis, Lynch syndrome or inflammatory bowel disease-related CRC, and cases where patients or family refused to participate in the study were excluded. Hospital-based controls were recruited through the blood collection unit of each hospital, together with cases. All of the controls were confirmed to have no history of cancer or other neoplasm and no reported family history of CRC. Controls were randomly selected and matched with cases for hospital, sex and age (+- 5 years). A total of 370 cases and 370 controls were selected for genotyping. Hawaii Adenoma Study: For this adenoma study, two flexible-sigmoidoscopy screening clinics were first used to recruit participants on Oahu, Hawaii. Adenoma cases were identified either from the baseline examination at the Hawaii site of the Prostate Lung Colorectal and Ovarian cancer screening trial during 1996-2000 or at the Kaiser Permanente Hawaii's Gastroenterology Screening Clinic during 1995-2007. In addition, starting in 2002 and up to 2007, we also approached for recruitment all eligible patients who underwent a colonoscopy in the Kaiser Permanente Hawaii Gastroenterology Department. Cases were patients with histologically confirmed first-time adenoma(s) of the colorectum and were of Japanese, Caucasian or Hawaiian race/ethnicity. Controls were selected among patients with a normal colorectum and were individually matched to the cases on age at exam, sex, race/ethnicity, screening date (+-3 months) and clinic and type of examination (colonoscopy or flexible sigmoidoscopy). We recruited 1016 adenoma cases (67.8% of all eligible) and 1355 controls (69.2% of all eligible); 889 cases and 1169 controls agreed to give a blood and 29 cases and 34 controls, a mouthwash sample. A total of 989 cases and 1185 controls were genotyped for this study. Columbus-area HNPCC Study (HNPCC, OSUMC): Patients with colorectal adenocarcinoma diagnosed at six participating hospitals were eligible for this study, regardless of age at diagnosis or family history of cancer. Patients with a clinical diagnosis of familial adenomatous polyposis were not eligible for this study. These six hospitals perform the vast majority of all operations for CRC in the Columbus metropolitan area (population 1.7 million). The institutional review board at all participating hospitals approved the research protocol and consent form in accordance with assurances filed with and approved by the United States Department of Health and Human Services. Briefly, during the period of January 1999 through August 2004, 1,566 eligible patients with CRC were accrued to the study (Hampel et al., 2008) [PMID 18809606]. A total of 1472 colorectal cancer samples had enough blood DNA remaining to be sent for genotyping. Control samples were provided by the Ohio State University Medical Center%#39;s (OSUMC) Human Genetics Sample Bank. The Columbus Area Controls Sample Bank is a collection of control samples for use in human genetics research that includes both donors' anonymized biological specimens and linked phenotypic data. The data and samples are collected under the protocol "Collection and Storage of Controls for Genetics Research Studies", which is approved by the Biomedical Sciences Institutional Review Board at OSUMC. Recruitment takes place in OSUMC primary care and internal medicine clinics. If individuals agree to participate, they provide written informed consent, complete a questionnaire that includes demographic, medical and family history information, and donate a blood sample. 4-7 ml of blood is drawn into each of 3 ACD Solution A tubes and is used for genomic DNA extraction and the establishment of an EBV-transformed lymphoblastoid cell culture, cell pellet in Trizol, and plasma. Controls were matched to CRC cases as 1:1. Matching was done on age at reference time (age_ref), race, and sex. Age_ref was matched on +-5 years. Sex and race were matched exactly. For the cases without matched controls, matching was done only on sex and race with 1:1 ratio. Since controls are fewer than cases, one control is matched on 2 cases at most. Health Professionals Follow-up Study (HPFS): A parallel prospective study to the NHS (Nurses' Health Study). The HPFS cohort comprised 51,529 men aged 40-75 who, in 1986, responded to a mailed questionnaire (Rimm et al., 1990) [PMID: 2090285]. Participants provided information on health related exposures, including current and past smoking history, age, weight, height, diet, physical activity, aspirin use, and family history of colorectal cancer. Colorectal cancer and other outcomes were reported by participants or next-of-kin and were followed up through review of the medical and pathology record by physicians. Overall, more than 97% of self-reported colorectal cancers were confirmed by medical record review. Information was abstracted on histology and primary location. Incident cases were defined as those occurring after the subject provided the blood sample. Prevalent cases were defined as those occurring after enrollment in the study but before the subject provided the blood sample. Follow-up evaluation has been excellent, with 94% of the men responding to date. Colorectal cancer cases were ascertained through January 1, 2008. In 1993-1995, 18,825 men in the HPFS mailed blood samples by overnight courier, which were aliquoted into buffy coat and stored in liquid nitrogen. In 2001-2004, 13,956 men in the HPFS who had not provided a blood sample previously mailed in a swish-and-spit sample of buccal cells. Incident cases were defined as those occurring after the subject provided a blood or buccal sample. Prevalent cases were defined as those occurring after enrollment in the study in 1986, but before the subject provided either a blood or buccal sample. After excluding participants with histories of cancer (except nonmelanoma skin cancer), ulcerative colitis, or familial polyposis, case-control sets were previously constructed. In addition to colorectal cancer cases and controls, a set of adenoma cases and matched controls with available DNA from buffy coat were selected for genotyping. Over the follow-up period, data were collected on endoscopic screening practices and, if individuals had been diagnosed with a polyp, the polyps were confirmed to be adenomatous by medical record review. Adenoma cases were ascertained through January 1, 2008. A separate case-control set was constructed of participants diagnosed with advanced adenoma matched to control participants who underwent a lower endoscopy in the same time period and did not have an adenoma. Advanced adenoma was defined as an adenoma 1 cm or larger in diameter and/or with tubulovillous, villous, or highgrade dysplasia/carcinoma-in-situ histology. Matching criteria included year of birth (within 1 year) and month/ year of blood sampling (within 6 months), the reason for their lower endoscopy (screening, family history, or symptoms), and the time period of any prior endoscopy (within 2 years). Controls matched to cases with a distal adenoma either had a negative sigmoidoscopy or colonoscopy examination, and controls matched to cases with proximal adenoma all had a negative colonoscopy. In total, 159 advanced adenoma cases and 109 controls were selected for genotyping. Leeds Colorectal Cancer Study (LCCS): Following local ethical approval, colorectal cancer cases were recruited from 1997 until 2012 in Leeds, UK through surgical clinics. Initially, funding was provided by the UK Ministry of Agriculture, Farming and Fisheries (subsequently the Food Standards Agency) and Imperial Cancer Research Fund (subsequently Cancer Research UK). Recruitment also occurred similarly in Dundee, Perth and York between the periods of 1997 and 2001 using the same protocol and the data and samples were combined. Pathologically confirmed cases were consented at outpatient clinics, providing information on known and postulated risk factors for colorectal cancer (diet, lifestyle and family history) as well as providing a blood sample for DNA. Exclusion criteria included pre-existing diverticular disease and an inability to complete the questionnaire. The General Practitioners of cases (all UK residents have a nominated General Practitioner to whom to refer initial medical queries) and these GPs were asked to send letters to other persons on their patient list of the same gender and born within 5 years of the case. Subsequently to enhance the number of controls, we systematically invited patients from selected GP practices. Diet was assessed in cases and controls using an extensive dietary and lifestyle questionnaire modified by that produced by the European Prospective Investigation in Cancer (EPIC). The frequency that each specific food items were eaten was recorded and we also obtained average fruit and vegetable consumption as a cross-check. In total, 1591 cases and 739 controls provided a DNA sample. The North Carolina Colon Cancer Studies (NCCCS I/II): The North Carolina Colon Cancer Studies (NCCCS I- colon and NCCCS II-rectal) were population-based case-control studies conducted in 33 counties of North Carolina. Cases were identified using the rapid case ascertainment system of the North Carolina Central Cancer Registry. Patients with a first diagnosis of histologically confirmed invasive adenocarcinoma of the colon (cecum through sigmoid colon) between October 1996 and September 2000 were classified as potential cases in the NCCCS I. The NCCCS II included patients with a first diagnosis of histologically confirmed invasive adenocarcinoma of the sigmoid colon, rectosigmoid, or rectum (hereafter collectively referred to as rectal cancer) between May 2001 and September 2006. Additional eligibility requirements were: aged 40-80 years, residence in one of the 33 counties, ability to give informed consent and complete an interview, had a driver's license or identification card issued by the North Carolina Department of Motor Vehicles (if under the age of 65), and had no objections from the primary physician in regards to contacting the individual. Controls, identified and sampled during the respective study dates, were selected from two sources. Potential controls under the age of 65 were identified using the North Carolina Department of Motor Vehicles records. For those 65 years and older, records from the Center for Medicare and Medicaid Services were used. Controls were matched to cases using randomized recruitment strategies. Recruitment probabilities were done using strata of 5-year age, sex, and race groups. Dietary information was collected using a modified version of the semiquantitative food frequency questionnaire developed at the National Cancer Institute. In addition, participants were asked about vitamin and mineral supplementation, special diets, restaurant eating, sodium use, and fats used in cooking. In NCCCS I, 515 colorectal cases and 687 matched controls were sent for genotyping. In NCCCS II, 796 colorectal cases and 823 controls were sent from the NCCCS II for genotyping. Controls were matched to CRC cases as 1:1 ratio. Matching was done on age, race, and sex. Age was matched on +-5 years. Race and sex was matched exactly. For the cases without matched controls, matching was done only on sex and race. Nurses Health Study (NHS): The NHS cohort began in 1976 when 121,700 married female registered nurses age 30-55 years returned the initial questionnaire that ascertained a variety of important health-related exposures (Belanger et al., 1978) [PMID: 248266]. Since 1976, follow-up questionnaires have been mailed every 2 years. Colorectal cancer and other outcomes were reported by participants or next-of-kin and followed up through review of the medical and pathology record by physicians. Overall, more than 97% of self-reported colorectal cancers were confirmed by medical-record review. Information was abstracted on histology and primary location. The rate of follow-up evaluation has been high: as a proportion of the total possible follow-up time, follow-up evaluation has been more than 92%. Colorectal cancer cases were ascertained through June 1, 2008. In 1989 -1990, 32,826 women in NHS I mailed blood samples by overnight courier, which were aliquoted into buffy coat and stored in liquid nitrogen. In 2001-2004, 29,684 women in NHS I who did not previously provide a blood sample mailed a swish-and-spit sample of buccal cells. Incident cases were defined as those occurring after the subject provided a blood or buccal sample. Prevalent cases were defined as those occurring after enrollment in the study in 1976 but before the subject provided either a blood or buccal sample. After excluding participants with histories of cancer (except nonmelanoma skin cancer), ulcerative colitis, or familial polyposis, case-control sets were previously constructed from which DNA was isolated from either buffy coat or buccal cells for genotyping. In addition to colorectal cancer cases and controls, a set of advanced adenoma cases and matched controls with available DNA from buffy coat were selected for genotyping. Over the follow-up period, data were collected on endoscopic screening practices and, if individuals had been diagnosed with a polyp, the polyps were confirmed to be adenomatous by medical record review. Adenoma cases were ascertained through June 1, 2011. A separate case-control set was constructed of participants diagnosed with advanced adenoma matched to control participants who underwent a lower endoscopy in the same time period and did not have an adenoma. Advanced adenoma was defined as an adenoma more than 1 cm in diameter and/or with tubulovillous, villous, or high-grade dysplasia/carcinoma-in-situ histology. Matching criteria included year of birth (within 1 year) and month/year of blood sampling (within 6 months), the reason for their lower endoscopy (screening, family history, or symptoms), and the time period of any prior endoscopy (within 2 years). Controls matched to cases with a distal adenoma either had a negative sigmoidoscopy or colonoscopy examination, and controls matched to cases with proximal adenoma all had a negative colonoscopy. A total of 272 cases and 236 matched controls were sent to CIDR for the advanced adenoma case-control set. Northern Swedish Health and Disease Study (NSHDS): Comprises over 110,000 participants, including approximately one third with repeated sampling occasions, from three population-based cohorts (Dahlin et al., 2010; Myte et al., 2016) [PMID: 20197478; PMID: 27367522]. The largest is the ongoing Vasterbotten Intervention Programme, in which all residents of Vasterbotten County are invited to a health examination upon turning 30 (some years), 40, 50 and 60 years of age. Extensive measured and self-reported health and lifestyle data, as well as blood samples for central biobanking in Umea, Sweden, are collected at the health exam. Leucocyte DNA samples for 1:1-matched CRC case-control sets from the NSHDS, of which 878 samples are included in this study, have been selected for genotyping. This is in addition to 354 samples from the NSHDS previously analyzed as part of the multicenter EPIC cohort. Cancer-specific and overall survival data are available for all patients. For at least 425 patients, archival tumor tissue has been analyzed for the BRAF V600E mutation and by sequencing codon 12 and 13 for KRAS mutations, as well as for MSI screening status by immunohistochemistry and for an eight-gene CIMP panel using quantitative real-time PCR (MethyLight). Ohio Colorectal Cancer Prevention Initiative (OCCPI, OSUMC): OCCPI (ClinicalTrials.gov identifier: NCT01850654) is a population-based study of colorectal cancer patients diagnosed in one of 51 hospitals throughout the state of Ohio from January 1, 2013 through December 31, 2016. The OCCPI was created to decrease CRC incidence in Ohio by identifying patients with hereditary predisposition (statewide universal tumor screening for newly diagnosed CRC patients), increase colonoscopy compliance for first-degree relatives of CRC patients, and encourage future research through the creation of a biorepository. The 51 Ohio hospitals participating in the OCCPI were selected to represent a cross-section of clinical centers in the state based on high reported volume of CRC patients, affiliation with a high volume hospital, or interest in participation. Institutional Review Board (IRB) approval was obtained by the individual hospitals, Community Oncology Programs, or by ceding review to the OSU IRB. Written informed consent was obtained. A total of 2139 colorectal cases were genotyped. Patients were considered eligible for this study if they were age 18 or older at the time of enrollment, if they had a surgical resection (or biopsy if unresectable) in the state of Ohio demonstrating an adenocarcinoma of the colorectum from 1/1/13 - 12/31/16. Matched control samples were selected from the Ohio State University Medical Center's (OSUMC) Human Genetics Sample Bank in an identical way to the selection for the Columbus-area HNPCC Study (please refer to the description for the Columbus-area HNPCC Study). Prostate, Lung, Colorectal and Ovarian Cancer Screening Trail (PLCO): PLCO enrolled 154,934 participants (men and women, aged between 55 and 74 years) at ten centers into a large, randomized, two-arm trial to determine the effectiveness of screening to reduce cancer mortality. Sequential blood samples were collected from participants assigned to the screening arm. Participation was 93% at the baseline blood draw. In the observational (control) arm, buccal cells were collected via mail using the "swish-and-spit" protocol and participation rate was 65%. Details of this study have been previously described (Huang et al., 2016) [PMID: 27673363] and are available online (http://dcp.cancer.gov/plco). For this study 1651 advanced adenoma cases and 1392 controls were selected for genotyping. Selenium and Vitamin E Prevention Trial (SELECT): The Selenium and Vitamin E Cancer Prevention Trial (SELECT) was a double-blind, placebo controlled clinical trial which explored using selenium and vitamin E alone and in combination to prevent prostate cancer in healthy men (Lippman et al., 2009) [PMID: 19066370]. Secondary endpoints included the prevention of colorectal and lung cancers. SELECT was conducted at 427 sites and centers in the United States, Canada and Puerto Rico; 35,533 men 55 years and older (50 or older if African American) were randomized beginning August 22, 2001. Supplementation was discontinued on October 23, 2008 due to futility. 308 colorectal cancer cases and 308 matched controls were selected from the SELECT population and sent for genotyping. Screening Markers For Colorectal Disease Study and Colonoscopy and Health Study (SMS-REACH): Details on this study population were previously reported (Burnett-Hartman et al., 2014) [PMID: 24875374]. Participants were enrollees in an integrated health-care delivery system in western Washington State (Group Health Cooperative, Seattle, Washington) aged 24-79 years who underwent an index colonoscopy for any indication between 1998 and 2007 and donated a buccal-cell or blood sample for genotyping analysis. Study recruitment took place in 2 phases, with phase 1 occurring in 1998-2003 and phase 2 occurring in 2004-2007. Persons who had undergone a colonoscopy less than 1 year prior to the index colonoscopy, persons with inadequate bowel preparation for the index colonoscopy, and persons with a prior or new diagnosis of colorectal cancer, a familial colorectal cancer syndrome (such as familial adenomatous polyposis), or another colorectal disease were ineligible. Patients diagnosed with adenomas or serrated polyps and persons who were polyp-free at the index colonoscopy (controls) were systematically recruited during both phases of recruitment. Approximately 75% agreed to participate and provided written informed consent. Based on medical records, persons who agreed to participate and those who refused study participation were similar with respect to age, sex, and colorectal polyp status. Study protocols were approved by the institutional review boards of the Group Health Cooperative and the Fred Hutchinson Cancer Research Center (Seattle, Washington). A total of 575 cases and 508 matched were selected for the study. Controls were matched to CRC cases as 1:1 ratio. Matching was done on age_ref, race, and sex. Age_ref was matched on +-5 years. The Women's Health Initiative (WHI): WHI is a long-term national health study that has focused on strategies for preventing heart disease, breast and colorectal cancer, and osteoporotic fractures in postmenopausal women. The original WHI study included 161,808 postmenopausal women enrolled between 1993 and 1998. The Fred Hutchinson Cancer Research Center in Seattle, WA serves as the WHI Clinical Coordinating Center for data collection, management, and analysis of the WHI. The WHI has two major parts: a partial factorial randomized Clinical Trial (CT) and an Observational Study (OS); both were conducted at 40 Clinical Centers nationwide. The CT enrolled 68,132 postmenopausal women between the ages of 50-79 into trials testing three prevention strategies. If eligible, women could choose to enroll in one, two, or all three of the trial components. The components are: Hormone Therapy Trials (HT): This double-blind component examined the effects of combined hormones or estrogen alone on the prevention of coronary heart disease and osteoporotic fractures, and associated risk for breast cancer. Women participating in this component with an intact uterus were randomized to estrogen plus progestin (conjugated equine estrogens [CEE], 0.625 mg/d plus medroxyprogesterone acetate [MPA] 2.5 mg/d] or a matching placebo. Women with prior hysterectomy were randomized to CEE or placebo. Both trials were stopped early, in July 2002 and March 2004, respectively, based on adverse effects. All HT participants continued to be followed without intervention until close-out. Dietary Modification Trial (DM): The Dietary Modification component evaluated the effect of a low-fat and high fruit, vegetable and grain diet on the prevention of breast and colorectal cancers and coronary heart disease. Study participants were randomized to either their usual eating pattern or a low-fat dietary pattern. Calcium/Vitamin D Trial (CaD): This double-blind component began 1 to 2 years after a woman joined one or both of the other clinical trial components. It evaluated the effect of calcium and vitamin D supplementation on the prevention of osteoporotic fractures and colorectal cancer. Women in this component were randomized to calcium (1000 mg/d) and vitamin D (400 IU/d) supplements or a matching placebo. The Observational Study (OS)examines the relationship between lifestyle, environmental, medical and molecular risk factors and specific measures of health or disease outcomes. This component involves tracking the medical history and health habits of 93,676 women not participating in the CT. Recruitment for the observational study was completed in 1998 and participants were followed annually for 8 to 12 years. All centrally confirmed cases of invasive colorectal cancers, or deaths from colorectal cancer were selected as potential cases from September 30, 2015 database. Controls were participants free of colorectal cancer (invasive or in situ) as of September 30, 2015. Potential cases and controls were excluded if they (1) were non-White; (2) had history of colorectal cancers at baseline; (3) lost to follow-up after enrollment; (4) DbGAP ineligible; (5) had <1.25ug of DNA; (6) selected for WHI study M26 Phase I or II; (7) selected for WHI study AS224 and also included in the imputation project. A total of 578 cases and 104,429 controls met the eligibility criteria. Each case was matched with 1 control (1:1) that exactly met the following matching criteria: age (+-5 years), 40 randomization centers (exact), WHI date (+-3 years), CaD date (+-3 years), OS flag (exact), HRT assignments (exact), DM assignments (exact), and CaD assignments (exact). Control selection was done in a time-forward manner, selecting one control for each case from the risk set at the time of the case's event. The matching algorithm was allowed to select the closest match based on a criteria to minimize an overall distance measure (Bergstralh EJ, Kosanke JL. Computerized matching of cases to controls. Technical Report #56, Department of Health Sciences Research, Mayo Clinic, Rochester MN. April 1995). Each matching factor was given the same weight. When exact matches could not be found, the matching criteria were gradually relaxed among unmatched cases and controls until all cases had found matched controls. Using the matching criteria specified above, 559 of the 578 eligible cases found exact matches. The matching criteria was then relaxed to : Age+-5, randomization centers, WHI date +- 3 years, CaD date +- 3 years, OS flag, HRT flag, DM flag, CaD flag. 17 of the remaining 19 unmatched cases found matched controls. By matching on Age+-5, randomization centers, WHI date +- 3 years, CaD date +- 3 years, OS flag, HRT flag, the remaining 2 unmatched cases found their matches.
The aim of this study is to ascertain whether leukaemic mutations exist within the blood of people with otherwise normal haematopoeisis. To satisfy this aim we plan to look for 7 known leukaemic mutations in the whole blood DNA of a large cohort of blood donors who have normal haematopoesis. Genomic regions around mutational sites have been amplified using a 2 step PCR process which involves barcoding of individual patients
INternational VErapamil SR-Trandolapril STudy (INVEST) was a prospective, multicenter, randomized, open-label, parallel-clinical trial that evaluated blood pressure response and adverse outcomes in patients randomized to either an atenolol-based or a verapamil SR-based hypertension treatment strategy in 22,576 patients with documented coronary artery disease and hypertension (PMID: 9809930,14657064). Hydrochlorothiazide and trandolapril were added as needed to achieve BP control or end organ protection in a protocol-defined manner. Briefly, the protocol required patients to be followed up at bassline, 6, 12, 18 and 24 weeks and then every 6 months thereafter until 2 years after the last patient was enrolled. At each visit, patients had BP and heart rate measured, clinical assessment performed, and additional antihypertensive medications added as needed to meet JNC VI blood pressure goals. The primary outcome was the first occurrence of all-cause death, nonfatal myocardial infarction (MI), or nonfatal stroke (clinicaltrials.gov identifier: NCT00133692). The genetic substudy of INVEST, INVEST-GENES, collected genomic DNA samples from 5,979 INVEST study participants. INVEST-GENES samples were genotyped on the Illumina OmniExpressExome GWAS chip. This data release includes 1529 samples that passed quality control including 657 whites, 537 Hispanics and 155 blacks. Two case-control subsets were created: one for the primary outcome of all-cause death, nonfatal MI or nonfatal stroke; the other one is for new-onset diabetes. Resistant hypertension was also evaluated in 1349 participants based on the medication use and blood pressure measurements at the visit prior to experiencing study outcomes or censoring (PMID:30237584).
Background: - There are many types of immune disorders. These range from rare immune deficiencies to allergies to autoimmune disease like rheumatoid arthritis. Genes are the instructions our body uses to work and develop. A new technology called whole exome sequencing may help find the cause of these disorders. Whole exome sequencing is a way to look at many genes at once for errors. Researchers hope to find new gene changes that lead to immune disorders. Additionally, researchers are interested in finding the best way to manage unexpected but important findings by whole exome sequencing. Objectives: - To better understand genetic causes of immune system disorders. Also, to better understand people s thoughts and feelings about immune system disorders and new genomic testing. Eligibility: - People ages 0 100 with an immune disorder or a relative with an immune disorder. People must be at least 2 to be evaluated at the NIH clinical center. People must be at least 12 to do the survey/interview portion of the study. Design: Participants will have their genes sequenced. They may be asked for a new sample of blood. If participants cannot come for a study visit, they can have a blood sample collected by their local lab or doctor and sent by mail. Researchers may or may not find the cause of the participant s immune disorder. Participants will learn that information. Some participants may be asked to return to NIH to get results and have more tests. Researchers may share information with other studies. The data will be anonymous. For the survey part of the study, participants will answer questions about their or their relative s immune disorder. They will also answer about their thoughts and feelings about genomic testing. Some participants will be asked for a brief interview to ask more about the survey topics. There may be more follow-up after several months. From clinicaltrials.gov
The goal of the project is to complete a 600,000 tag SNP genome-wide association scan of 958 parent-child trios from the International Multisite ADHD Genetics (IMAGE) project, in order to assess the association of SNP markers with ADHD, analyze quantitative ADHD phenotypes, complete copy number analyses, assess parent of origin effects and season of birth effects, and test for epistasis among apparently uncorrelated genes. Acquiring DNA Samples All consent forms stipulate that the samples can only be used by researchers who have been approved by the National Institute of Mental Health (NIMH), National Institutes of Health. All consent forms, except those used at the Zürich site (N=141 subjects), explicitly indicate that the samples may be used by researchers from commercial enterprises seeking to benefit financially from the analysis of the samples. The Zürich consent does not prohibit such use. The Zürich consent form also included an "opt out" that allowed the subjects to indicate that they did not want their samples stored at the NIMH repository or used by researchers external to the project. No subjects enrolled in the project opted out. Consent groups and participant set ADHD (ADHD): 2758 (924 trios)
Microsatellite stable (MSS) colorectal cancers (CRC) are largely unresponsive to immune checkpoint inhibition (ICI), prompting investigation into strategies to enhance sensitivity. The MAYA trial, which utilized temozolomide (TMZ) in MGMT-silenced MSS mCRC, hypothesized that TMZ-induced hypermutation could sensitize tumors to ICI. This phase II trial met its primary endpoint, demonstrating durable clinical responses with TMZ combined with ipilimumab and nivolumab. To elucidate factors in uencing response heterogeneity, we conducted multi-omic spatial pro ling of samples from patients who participated in the MAYA trial, including baseline and on-treatment tissue and blood specimens. While increased neoantigen load following TMZ exposure did not consistently predict for deep responses, spatial profiling revealed key determinants. Lymphocyte proportions, particularly CD8+KI67+ cells, within stromal and tumor compartments, along with macrophage composition (CD68+CD163+ cells) at the tumor-stromal interface, were predictive of response. Treatment pressures dynamically altered the tumor microenvironment composition and activated peripheral immune cells. This study is the first to identify spatial predictors of response to this promising novel treatment approach for MSS CRC.
Understanding the pathology of COVID-19 is a global research priority. Early evidence suggests that the microbiome may be playing a role in disease progression, yet current studies report contradictory results. Here, we examine potential confounders in COVID-19 microbiome studies by analyzing the upper respiratory tract microbiome in well-phenotyped COVID-19 patients and controls combining microbiome sequencing, viral load determination, and immunoprofiling. We found that time in the intensive care unit and the type of oxygen support explained the most variation within the upper respiratory tract microbiome, dwarfing (non-significant) effects from viral load, disease severity, and immune status. Specifically, mechanical ventilation was linked to altered community structure, lower species- and higher strain-level diversity, and significant shifts in oral taxa previously associated with COVID-19.
The Colon Cancer Family Registry Cohort (CCFRC www.coloncfr.org) is a cohort of families recruited through six study sites located in the USA, Canada, Australia, and New Zealand. The CCFRC was formed as a resource to support studies on the etiology, prevention, and clinical management of colorectal cancer. Recruitment protocols fall broadly into two main categories: population-based and clinic-based ascertainment. Between 1998 and 2013, the CCFR recruited and interviewed probands who were either recently diagnosed with colorectal cancer that was reported to state or regional population cancer registries in the United States (Washington, California, Arizona, Minnesota, Colorado, New Hampshire, North Carolina, and Hawaii), Australia (Victoria), and Canada (Ontario) or were from multiple-case families referred to family-cancer clinics in the United States (Mayo Clinic in Rochester, Minnesota, and Cleveland Clinic in Cleveland, Ohio), Australia (Melbourne, Adelaide, Perth, Brisbane, and Sydney), New Zealand (Auckland), and Canada. Cases with known familial adenomatous polyposis were excluded. Once recruited, probands were asked for permission to contact their relatives for recruitment. Collectively, we have enrolled 42,49874 participants in 15,048 families who participated in standardized protocols used to collect information regarding family cancer history and colorectal cancer risk factors and biospecimens. All participants of population-based case families (excluding control-families) and clinic-based families are asked to provide updates on their personal and family history of cancer as well as their history of surgery, cancer screening, and some risk factors every 4-5 years either by telephone interview or by self-completed questionnaire via mail or online (PMID: 17982118).
Besides the predominant neurological involvement of PMM2-CDG patients, around 20% of patient mortality has been linked to severe infections. However, the immunological dysfunction in PMM2-CDG is not well understood. In this study, we aimed to address this gap and study the molecular mechanisms affected behind immunological issues in PMM2-CDG. Primary skin fibroblasts from three PMM2-CDG patients and three healthy controls were stimulated or not with TNF-alpha and their gene expression was assessed by RNA sequencing. The response to TNF-alpha was characterized based on the differential expressed genes and functional enrichment and and compared between PMM2-CDG and healthy individuais. Gene expression alterations and functional enrichment were further validated by several experimental assays by ELISA, Flow Cytometry and Western Blotting.
Platelets are anucleate cells produced in the bone marrow and derived from large progenitor cells called megakaryocytes (MKs). Platelets receive RNA transcripts from their progenitorial MKs during thrombopoiesis. However, the correspondence between platelet and MK transcriptomes is poorly understood, particularly in the context of germline mutations that cause platelet formation defects or thrombocytopenia. We have studied the effects of two such mutations on MK and platelet transcriptomes. We generated immortalized MK cell line (imMKCL)-based models of Bernard-Soulier syndrome and IKZF5-related thrombocytopenia. MKs derived from imMKCLs with either a homozygous deletion of GP9 (GP9-/-) or a heterozygous Y121F variant in IKZF5 (IKZF5WT/Y121F) exhibited reduced proplatelet formation (reductions of 96% and 57%, respectively). Platelets from patients with either GP9-/- or IKZF5WT/Y121F genotypes had broad transcriptomic dysregulation, suggesting that (pro)platelet formation defects due to mutations in glycoprotein receptor and transcription factor genes such as GP9 and IKZF5 affect the trafficking of transcripts from MKs into platelets. RNA-seq data from MKs at four stages of differentiation revealed widespread but distinct changes in expression over time between the GP9-/- and the IKZF5WT/Y121F genotypes. Dysregulated genes in GP9-/- MKs were enriched for RNA metabolism and actin/tubulin folding pathways while those in IKZF5WT/Y121F MKs were enriched for cell cycle pathways. Most of these genes were also dysregulated in the platelets of patients with the corresponding diseases. Our results suggest that patients with inherited forms of thrombocytopenia present with a transcript trafficking defect during platelet formation.
The Electronic Medical Record Phenotypes and Community-engaged Genomic Association Study aims to identify genetic variants mediating susceptibility to peripheral arterial disease (PAD). The study leverages a biorepository of blood samples of 1688 PAD cases and 1649 controls, and the electronic medical record (EMR) to annotate the biorepository. PAD cases were identified from the vascular laboratory database as having an ankle brachial index (ABI) <0.9 at rest or after exercise or having non-compressible vessels. Controls were without prior history of atherosclerotic vascular disease and when tested, no evidence of ischemia on a stress test. Phenotypes and environmental exposures including age, ethnicity, demographic and anthropometric data are derived from the Mayo Electronic Medical Record (EMR). Comorbidities were determined using algorithms for diabetes and hypertension based on ICD-9 codes and medication use. Relevant laboratory data, including lipid levels, fasting blood sugar and serum creatinine at index date or within a 1-year window of the index date were extracted. Medication classes at index date were identified using Mayo's Natural Language Processing-based system with RxNorm codification and NDF-RT terminologies mapping. Smoking status was confirmed by natural language processing of clinical notes. Genotyping of ~600,000 SNPs across the genome is being conducted at the Broad Institute using the Illumina 660W platform. Statistical analyses will be conducted to identify genetic variants associated with susceptibility to PAD.
This study consists of genomic sequencing data obtained through targeted sequencing of regulatory regions (85,394 human cCREs from ENCODE v2, version 2, https://screenv2.wenglab.org/) in 200 ASD (Autism Spectrum Disorder) trios (father and mother unaffected and proband affected). The aim of the project is to detect genomic variants (rare and inherited variants) in regulatory regions that contribute to autism.
Bulk RNA-sequencing of peripheral blood mononuclear cells collected from mild COVID-19 (n = 4), critical COVID-19 (n = 12) and control (n = 8) patients. Sequencing data is provided as FASTQ files.
Three SpCas9-ABE (R785X/R785X) and three xCas9-ABE-repaired organoid clones (F508del/R553X) and their respective unrepaired control organoids were paired-end whole genome sequenced using Illumina Novaseq 6000 system. The reads were mapped to hg19 genome assembly and data is provided as BAM files.
Colorectal cancer (CRC) develops from normal epithelium, through a benign precursor lesion called adenoma, by accumulation of genetic alterations affecting oncogenes and tumour suppressor genes. About 5% of colorectal adenomas are estimated to progress to CRC. However, it is important to identify which adenomas actually are at high-risk of progression, because these should serve as intermediate endpoints for e.g. CRC screening programs. In clinical practice, adenomas with a size of ≥10 mm, villous component and/or high-grade dysplasia, called advanced adenomas, are considered high-risk, although this classification lacks solid evidence. Specific DNA copy number changes are associated with adenoma-to-carcinoma progression.For this tissue dataset, we applied low-pass whole genome sequencing to 98 non-advanced and advanced adenomas. These adenomas were classified as lesions with low-risk or high-risk of progression, according to the presence of specific DNA copy number changes (Carvalho et al, CancerPrevRes, 2018).
Overall the outcomes of patients with metastatic melanoma have improved dramatically over the last decade due to an improved understanding of the molecular drivers of this disease. In particular, multiple targeted therapy regimens have been approved for patients with a BRAFV600E/K mutation, which are present in ~50% of cutaneous melanomas. These treatments achieve clinical responses in ~80% of patients with a BRAFV600E/K mutation, thus providing proof-of-concept of the therapeutic potential for personalized therapeutic strategies. However, most of the patients will progress within 2 years of starting those therapies. Further, currently there are no targeted therapies that have been shown to be effective in patients with a wild-type BRAF. Thus, there are unmet clinical needs to develop treatments that prevent or overcome resistance to existing therapies for patients with a BRAFV600E/K mutation, and that are effective in patients without a BRAF mutation. In order to facilitate the development of new therapeutic strategies, over the last 5 years we have led a major effort to develop a broad collection of PDX models to reflect the clinical, histological, and genetic heterogeneity of this disease. Our collection of PDX models represents one of the largest collections for any human malignancy, and our initial testing demonstrates that the collection accurately recapitulates the oncogenic drivers and molecular heterogeneity that is observed in patients. This collection also includes a subset of PDX established from patients with acquired resistance to targeted therapies that have been maintained on those agents in vivo to sustain their resistant phenotype. Together these efforts have generated a robust resource to develop, refine, and prioritize new personalized combinatorial therapies for patients. Thus, we propose to establish a multi-disciplinary and multi-institutional PDTC Program focused on the use and continued expansion of our robust melanoma PDX collection to identify new therapeutic approaches that fill important clinical gaps in this disease.
SCANDARE (NCT03017573) is a multicentric biobanking study, enrolling adult patients with newly diagnosed head and neck squamous cell carcinoma (HNSCC), triple negative breast cancer (TNBC), ovarian and cervical cancer. Tumor tissue and blood samples are collected at several time points during patient's journey, including at diagnosis, post-neoadjuvant chemotherapy in case of neoadjuvant treatment, at surgery, at recurrence and at disease progression following treatment initiated at recurrence. Since its launch in 2017 at Institut Curie, SCANDARE has enabled the longitudinal collection and preservation of samples for in-depth analyses. Here, we describe molecular alterations in 264 FFPE samples at baseline from 90 HNSCC patients using 3'-tag RNA-seq technology, with FASTQ files deposited in the dataset.
Little is known about the transcriptomic profile of individuals who are exposed to SARS-CoV-2 yet resist becoming PCR positive. To investigate this, longitudinal whole-blood samples were taken (0, 7, 14, and 28 days after enrolment) from PCR positive and PCR negative SARS-CoV-2-naïve household contacts who were recently exposed to a COVID-19 index. Samples were also taken from pre- and post-pandemic unexposed controls. Total RNA was extracted from PAXgene tubes before undergoing poly(A) selection followed by globin and rRNA depletion. DNA libraries were constructed using the NEBNext® Ultra™ II Directional RNA Library Prep Kit for Illumina. All samples were then sequenced across 2 flowcells of an Illumina HiSeq 4000.
The aim of this project is to assess differences in intratumoral immune composition in pregnant melanoma patients versus non pregnant melanoma controls. Samples (N=25) were obtained from a local patient database. From archived FFPE we isolated RNA and performed NGS and bio-informatica data-analysis.
Immune checkpoint inhibitors are effective cancer treatments, but molecular determinants of clinical benefit are unknown. Ipilimumab and tremelimumab are antibodies against cytotoxic T-lymphocyte antigen 4 (CTLA-4). Anti-CTLA-4 treatment prolongs overall survival in patients with melanoma. CTLA-4 blockade activates T cells and enables them to destroy tumor cells. We obtained tumor tissue from patients with melanoma who were treated with ipilimumab or tremelimumab. Whole-exome sequencing was performed on tumors and matched blood samples. Somatic mutations and candidate neoantigens generated from these mutations were characterized. Neoantigen peptides were tested for the ability to activate lymphocytes from ipilimumab-treated patients. Malignant melanoma exomes from 64 patients treated with CTLA-4 blockade were characterized with the use of massively parallel sequencing. A discovery set consisted of 11 patients who derived a long-term clinical benefit and 14 patients who derived a minimal benefit or no benefit. Mutational load was associated with the degree of clinical benefit (P = 0.01) but alone was not sufficient to predict benefit. Using genomewide somatic neoepitope analysis and patient-specific HLA typing, we identified candidate tumor neoantigens for each patient. We elucidated a neo-antigen landscape that is specifically present in tumors with a strong response to CTLA-4 blockade. We validated this signature in a second set of 39 patients with melanoma who were treated with anti-CTLA-4 antibodies. Predicted neoantigens activated T cells from the patients treated with ipilimumab.
There are 66 pairs of LAML cases(complete genomics) in this project which belongs to LAML-CN..The library is constructed by the Completes Genomics protocol.
There are 22 pairs of LAML cases in this project which belongs to LAML-CN.The library is constructed by the Illumina protocol.
Mixture of 5 unrelated individuals sequenced by 10x as a scATAC-seq. The dataset was then processed by Cell Ranger and deconvoluted to yield each individuals genetic profile using the De-goulash pipeline. The separation file is submitted as the processed file. The bam are submitted as unprocessed files.
Genome-wide association studies (GWAS) of colorectal cancer (CRC) have been instrumental in identifying a number of common susceptibility loci in Non Hispanic (NH)-White populations, and a NCI priority is to extend GWAS findings to other populations to address racial/ethnic disparities in cancer susceptibility. Currently, GWA studies of CRC in NH-Whites, Japanese and African-Americans are ongoing. We propose a complementary study to address this critical research area in Hispanics. Hispanics represent the fastest growing ethnic population in the U.S. and have been largely understudied in terms of genetic susceptibility to cancer. There are noted differences in incidence, survival and mortality in CRC by ethnic/racial groups. Hispanics often present with CRC at a younger age and have a significantly greater incidence of stage IV tumors or metastatic disease compared to NH-Whites. We propose to conduct a large, cost-efficient, population-based GWAS in Hispanics by building upon existing NIH-funded resources, the Colon Cancer Family Registry (Colon CFR) and the Multiethnic Cohort Study (MEC). We plan to recruit 2,500 Hispanic men and women diagnosed with CRC between 01/2008 to present using cancer registries in California, physican referrals and familial referrals. Risk factor/diet questionnaires, pathology reports, Oragene saliva samples (for genotyping), optional blood samples (for genotyping and biometric analysis) and tumor blocks (for MSI testing) will be collected using methodologies developed in the Colon CFR/MEC. Cases of CRC in the MEC (currently 473; anticipated 600 at end) will also be included. Population-based Hispanic individuals without a diagnosis of CRC participating in other GWA studies in the MEC (n=3,900, U01HG004726, Haiman) will be used as controls. We will genotype all 3,100 cases using the Illumina 1M array and use available genotype and epidemiologic data collected on 3,900 controls. Our statistical analyses will include: single-SNP and haplotype effects, gene-environment interactions and heterogeneity by MSI, tumor subtype and family history of CRC. We will replicate findings in a second-stage using CRC cases and controls from Mexico (1,000 cases and 1,000 controls, EU FP7 funding, CHIBCHA, Carvajal-Carmona/Tomlinson). We will also examine heterogeneity of the risk estimates by ethnicity/race by leveraging GWA data on NH-Whites (2,142 cases, 1,909 controls, U01 CA122839, Casey), (4,000 cases, 6,000 NH-White controls, UK-CHIBCHA, Tomlinson), Colombians (2,000 cases and 2,000 controls, CHIBCHA), Japanese (1,000 cases and 1,000 controls) and African-Americans (1,500 cases and 1,500 controls, R01CA126895, Le Marchand). We will genotype replicated significant SNPs in our main and combined analysis in several Hispanic populations (note: studies funded by EU or NIH for data collection but not GWAS), including 800 Puerto Ricans, 2,000 Brazilians, 2,000 Argentineans and 3,000 Spanish/Portuguese, to assess generalizability of findings. We will examine the differences in inflammatory gene transcription dynamics in leukocytes (from blood sample collection) by fatigue level (as assessed from study questionnaire data). This study will have a high impact by addressing the key question of racial/ethnic disparities related to genetic susceptibility to CRC, will provide translational guidelines on biological mechanisms during the cancer survivorship period to increase quality of life among cancer survivors, and will enable further growth and investment into research among Hispanics by providing a resource of genetic data and biospecimens, which is lacking.
Multiple myeloma is a disease characterized by the expansion of cancerous plasma cells in the bone marrow. The bone marrow microenvironment plays a pivotal role in supporting myeloma growth and modulating tumor immunity. As the disease progresses, myeloma cells may become independent of the bone marrow environment, leading to extramedullary disease associated with poor prognosis. However, the early processes associated with bone marrow independence and its implications for disease and immune control remain poorly understood. Here, we employed comprehensive single-cell and spatial mapping to identify the disruption of the cortical bone and subsequent expansion of myeloma cells in breakout lesions as a key event in multiple myeloma pathogenesis and tumor immunity. Breakout lesions harbor an adapted niche and a unique immune microenvironment conferred by sustained immune-tumor interactions. In particular, tumor-reactive T cells with a highly exhausted phenotype, alongside unique NK cell and macrophage subtypes, specifically expanded in breakout lesions. Spatially-resolved genomic, transcriptomic and cellular analyses uncovered extensive intra-lesion heterogeneity, suggesting a divergent co-evolution of genomic variation and locally confined T cell responses in distinct subregions. Ultra-high plex imaging revealed that tumor-reactive T cells, together with other immune-regulatory cells and dendritic cells, co-expand and exhaust upon sustained tumor interactions in locally confined immune islands. In contrast, immune-regulatory macrophage and NK cell subsets operate spatially separated within myeloma cell deserts. Jointly, our analyses uncover bone marrow breakout lesions as a hotspot for tumor-immune cell interactions, diversification and exhaustion, representing a key event in myeloma pathogenesis with significant therapeutic implications.