This study comprises prospectively accrued, microdissected fresh frozen samples of multifocal lobular carcinoma in situ (LCIS), ductal carcinoma in situ (DCIS), invasive lobular carcinoma and invasive ductal carcinoma from patients undergoing prophylactic or therapeutic mastectomies after a diagnosis on LCIS diagnosed and managed at Memorial Sloan Kettering Cancer Center (MSKCC). Microdissected samples were subjected to paired-end whole exome sequencing on an Illumina HiSeq 2000. The data generated were used to define the landscape of somatic genetic alterations of LCIS, DCIS, invasive lobular carcinoma and invasive ductal carcinoma, to define clonal relatedness of these lesions and to investigate the clonal shifts in the progression from in situ to invasive breast cancer.
Mucopolysaccharidosis (MPS) types I, II, IV, VI and VII, are inherited diseases that result from the bodies inability to break down large sugar molecules that are by-products of metabolism. MPS affects most organs of the body and causes abnormalities in the liver, spleen, bones, and brain. We are studying the central nervous system (primarily the brain) so we can better understand the nature of the problems individuals with MPS I, II, IV, VI and VII have so we can find ways of better treating these problems. We would like to find out about the changes in the brain of individuals with MPS I, II, IV, VI and VII using data from MRI and neuropsychological tests. This is a longitudinal study so patients who enroll will need to be seen 3 times over 5 years. The longitudinal nature of the study allows us to make conclusions about how changes in some structures of the brain and changes in cognitive ability are related. The research objectives are: To identify abnormalities of brain (and central nervous system) structure and function in patients with MPS I, II, IV, VI and VII, regardless of how they are being treated or have been treated in the past; and to track disease progression over time. To examine the degree to which independent variables (e.g., age at first treatment, severity of disease, types of medical abnormalities, mutation, medical events, and sensory abnormalities) have an impact on both functional and structural outcomes as well as on quality-of-life. To identify, through longitudinal study, those measures that most accurately reflect the current disease state. This is a longitudinal study of 75-100 individuals with either MPS I, II, IV, VI and VII. Those participating in the study will be evaluated 3 times over 5 years. The primary site for this study is the University of Minnesota but there are an additional 6 centers in the United States and Canada that are also participating and will provide data for analysis. You will need to be able to travel to Minnesota or one of the participating centers in order to be a part of this study.
The Southern African Human Genome Programme (SAHGP) is a national and regional initiative that aims to unlock the unique genetic character of southern African populations. Its vision is to improve quality of life through understanding human genetic diversity. It is timely to advance the SAHGP initiative since global genomic initiatives are increasingly pointing to populations in Africa, more specifically the San and Khoe from southern Africa, as harbouring the most ancient genetic signatures among the living populations of the world. Studies on African populations have already generated a wealth of information upon which the SAHGP can build. This information will be used to promote and support genomic research programmes both nationally and regionally to address critical questions that would benefit the people of our region. The SAHGP aims to make a significant contribution to the understanding of DNA variation among southern Africans and how this impacts on the health of the people of this region. Aims of the SAHGP: To develop capacity for genomic research in southern Africa; To establish sustainable resources for genomic research in the region; and To translate knowledge and information into improvements in human health in the region. The objective is to derive the maximum benefit from the SAHGP, and that the data should be widely shared with the scientific community for the benefit of the people of the sub-continent and beyond.
Sickle cell disease (SCD) is characterized by the presence of sickle hemoglobin (HbS) within circulating erythrocytes resulting in hemolytic anemia, vascular occlusion, and end organ damage due to alterations in the shape and deformability of the cell membrane. The disease is inherited in an autosomal recessive pattern, and is most commonly caused by a single nucleotide substitution in the hemoglobin subunit beta (HBB) gene located on chromosome 11. Participants in this study include children with SCD treated with hydroxyurea to pharmacologically increase fetal hemoglobin (HbF) levels and reduce disease severity. Therefore, the primary phenotype of interest in this study is the change in HbF levels in response to hydroxyurea treatment. Genetic factors have been shown to influence inter-individual variation in drug response, and identification of novel genes and variants associated with clinical outcomes in SCD will be achieved through collaboration between Baylor College of Medicine, Augusta University, Columbia University Medical Center, Emory University School of Medicine and Children's Healthcare of Atlanta, and St. Jude Children's Research Hospital. The NHLBI TOPMed Program is designed to generate scientific resources to enhance understanding of fundamental biological processes that underlie heart, lung, blood and sleep disorders (HLBS). It is part of a broader Precision Medicine Initiative, which aims to provide disease treatments that are tailored to an individual's unique genes and environment.
Huntington's disease (HD) is a neurodegenerative disorder typically diagnosed in mid-life that is caused by expansion of an otherwise polymorphic CAG trinucleotide repeat. The mutation causes a gain-of-function of the huntingtin protein to trigger a pathogenic process that produces detectable phenotypic differences many years before the traditional diagnosis, which is based upon characteristic motor symptoms. The prediagnosis phase of pathogenesis is now the subject of intense scrutiny by an NINDS-funded longitudinal study, PREDICT-HD, in which undiagnosed gene carriers are followed longitudinally and subjected to detailed phenotyping (PREDICT-HD Huntington Disease Study -- dbGaP Study Accession: phs000222). This powerful approach, which offers the potential for moving the focus of therapeutic development to the decades prior to neurological disease diagnosis, is enabled by the fact that all individuals with HD have the same type of mutation, which can be determined at any time in life by a single HD CAG repeat PCR amplification assay. The precise length of the HD CAG repeat differs between individuals. There is a strong negative correlation between the number of CAG repeats and the age at onset of diagnostic neurological abnormalities in HD, such that the CAG repeat accounts for ~50% of the variation in age at diagnosis. Analysis of the remaining variance not explained by the length of the CAG repeat has shown that it is highly heritable, being due to genetic variation, elsewhere in the genome. The intent of the Genetic Modifiers of Huntington's Disease study is to identify genetic modifiers of HD pathogenesis by using genomewide association techniques in diagnosed HD individuals to identify genetic factors associated with the residual variance in age at onset not explained by the CAG repeat, and to extend these analyses to pre-diagnosis phenotypes, for example, those defined in the PREDICT-HD study. Identification of modifier genes is a top priority for HD research (and an example of an approach that can be applied in other late-onset genetic disorders), as it could provide clues to developing rational treatments that delay or prevent the pathogenic process from causing the ravages of the disease that ensue in the ~15 years of inexorable decline to ultimate death that now follows clinical diagnosis. To further that goal, the release of study version 2 makes available whole exome sequencing data of n=221 study participants.
Data Access NOTE: Please refer to the “Authorized Access” section below for information about how access to the data from this accession differs from many other dbGaP accessions. Objectives: To evaluate the hypothesis that high frequency oscillatory ventilation in preterm infants would reduce the incidence of mortality and pulmonary complications compared to conventional mechanical ventilation.Background: With the introduction of mechanical ventilation in preterm infants, mortality and morbidity significantly improved but remained high. The improvement in survival that accompanied the use of mechanical ventilation also brought about an increase in the incidence of pulmonary complications. The principal complication occurs in the form of bronchopulmonary dysplasia. Barotrauma and oxygen toxicity are considered to be in the pathogenesis for this disorder. Considerable interest in high frequency ventilation for preterm infants was generated when animal studies indicated high frequency ventilation to be effective in promoting gas exchange without apparent adverse effects. High Frequency Ventilation (HFV) delivers small tidal volumes at high frequencies of 4 to 15 Hz, and animal studies had indicated that HFV was associated with effective gas exchange, less barotrauma, and lower mean airway pressure. However, the efficacy and safety of HFV in preterm infants had not been studied. The HIFI Planning Phase was initiated in August 1984, and recruitment and intervention began in February, 1986. Follow-up studies continued thru September, 1988.Participants: A total of 673 infants were enrolled.Conclusions: Bronchopulmonary dysplasia incidence was similar in the two groups as was mortality and the need for ventilatory support during the first 28 days. There was a significantly greater rate of pneumoperitoneum of pulmonary origin in the high frequency group as was a greater incidence rate of intracranial hemorrhage. (HIFI Study Group, 1989, PMID: 2643039)
This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/We performed exome sequencing on serial samples from a patient with CMML who progressed to AML. The exome sequencing suggests that NPM1, TET2 and DNMT3a mutations were present in the dominant clone in the CMML sample and that NRAS is a new subclonal mutation in the AML sample. Diagnostic data shows the presence of a FLT3-ITD mutation in the AML sample, which is likely to have driven progression. Here we are performing re-sequencing of the putative driver and some passenger mutations which appear to be in the same clone to validate these mutations and to verify the relative quantification of these abnormalities .
High resolution analysis of DNA copy number abnormalities and loss-of-heterozygosity on acute myeloblastic leukemia samples utilizing SNP arrays has demonstrated that in contrast to pediatric ALL, de novo AML is characterized by a very low burden of genomic alterations (Radtke, et al., PNAS, 2009). Samples for this study represented a cross-section of the different subtypes of pediatric AML. The only AML subtype that was an outlier from the above observations was acute megakaryocytic leukemia (AML FAB-M7), with the majority of these cases being characterized by complex chromosomal rearrangements and a high number of copy number alterations. To more fully define the genomic landscape of this subtype, we performed transcriptome sequence analysis on 14 pediatric FAB-M7 cases and mutation recurrence screening in a panel of 62 adult and pediatric AML FAB-M7 samples using the Illumina platform. Our results identified chromosomal rearrangements resulting in the expression of novel fusion transcripts in 11/14 cases. Remarkably, in 7/14 cases we detected an inversion on chromosome 16 that results in the juxtaposition of the CBFA2T3 gene next to the GLIS2 gene resulting in a CBFA2T3-GLIS2 chimeric gene that encoded an in frame fusion protein. This fusion led to the acquisition or preservation of self-renewal in colony forming assays, providing functional evidence for a role in leukemogenesis. In addition to novel chimeric transcripts, we found mutations in genes previously identified to play a role in megakaryoblastic leukemia that carry a proliferative advantage to the cell, such as JAK2 and MPL. These data demonstrate that AML FAB-M7 is characterized by cooperating Class I and Class II mutations leading to leukemogenesis.
This individual-level collection of datasets contains all the RADx Digital Health Technologies (RADx-DHT) studies in dbGaP for General Research Use (GRU). Access to studies in this collection is on the RAPIDS platform. To request access to this study collection, select phs003666 in the dbGaP when submitting a data access request.
Data Access NOTE: Please refer to the "Authorized Access" section below for information about how access to the data from this accession differs from many other dbGaP accessions.Objectives: The initial objective of the PANTHER-IPF study was to evaluate the effectiveness of a drug combination of prednisone, azathioprine, and N-acetylcysteine in the treatment of mild-to-moderate idiopathic pulmonary fibrosis (IPF) compared to N-acetylcysteine alone or placebo. After interim analysis presented safety concerns regarding the three drug regimen, the protocol was modified to evaluate only N-acetylcysteine effectiveness. Background: IPF is a chronic, progressive lung disease of unknown cause that is characterized by the histopathological or radiologic patterns of usual interstitial pneumonia in a typical clinical setting. The median survival of participants with idiopathic pulmonary fibrosis after diagnosis is 2 to 5 years. To date, no pharmacologic therapies have been shown to improve survival. Acetylcysteine has been suggested as a beneficial treatment for idiopathic pulmonary fibrosis, although data from placebo-controlled studies are lacking.Design: Patients with mild-to-moderate impairment in pulmonary function were randomly assigned to receive a three-drug regimen (prednisone, azathioprine, and acetylcysteine), acetylcysteine alone (plus matched placebos for prednisone and azathioprine), or matched placebos for each of the active therapies. After the planned midpoint interim analysis, the data and safety monitoring board recommended discontinuation of the three-drug regimen because of an excess in the number of deaths, hospitalizations, and serious adverse events among patients in the combination-therapy group, as compared with the placebo group. After NHLBI accepted this recommendation on October 14, 2011, the three-drug regimen was discontinued and patients were recruited for only the acetylcysteine group and the placebo group. Patients were seen at the clinical centers for screening, at baseline, and at 4, 15, 30, 45, and 60 weeks. The primary outcome measure was change in FVC over 60 weeks.Conclusions: As compared with placebo, acetylcysteine offered no significant benefit with respect to the preservation of FVC in patients with idiopathic pulmonary fibrosis with mild-to-moderate impairment in lung function. In addition, there were no significant differences between the acetylcysteine group and the placebo group in the rates of death or acute exacerbation.
We have studied the tumor genomic evolution in a patient with metastatic breast cancer bearing an activating PIK3CA mutation. The patient was treated with the PI3Kα inhibitor BYL719 and achieved a lasting clinical response on BYL719, but eventually progressed to treatment and died shortly thereafter. A rapid autopsy was performed and a total of 14 metastatic lesions were collected for further analysis. In order to identify possible genetic determinants of acquired resistance to PI3Kα inhibition, we took a three-step approach. First, we examined both the primary tumor (before BYL719 treatment) and the new lung metastasis by whole genome sequencing (DNA from the spleen was used as a normal control). Then, we analyzed the primary tumor, lung metastasis, and the peri-aortic lesion that remained stable (responding) at the time of progression to BYL719 therapy by whole exome sequencing. Finally, to confirm and expand our findings, we sequenced the primary tumor and all the metastatic lesions to >500-fold coverage using a custom targeted deep-sequencing assay, MSK-IMPACT. Using this 3-step approach, we have demonstrated that patient’s lesion underwent parallel genetic evolution at multiple metastatic sites with different PTEN genomic alterations leading to a convergent PTEN- null phenotype resistant to PI3Kα inhibition. In order to expand our observations, we analyzed paired samples (pre-treatment and at progression) from six additional patients enrolled in the BYL719 trial. Acquired bi-allelic loss of PTEN was found in one additional patient treated with BYL719 whereas in two patients PIK3CA mutations present in the primary tumor were no longer detected at the time of progression. To functionally characterize our findings, inducible PTEN knockdown in sensitive cells resulted in resistance to BYL719, while simultaneous PI3K-p110β blockade reverted this resistance phenotype, both in cell lines and in PTEN-null xenografts derived from our patient. We conclude that parallel genetic evolution of separate sites with different PTEN genomic alterations leads to a convergent PTEN-null phenotype resistant to PI3Kα inhibition.
The goal of this international collaborative project is to identify genes responsible for Schizophrenia in the Xhosa population of South Africa. The vast majority of the genetic basis for Schizophrenia has yet to be explained. We hypothesize that genes and pathways important to Schizophrenia will harbor different, severe disease-causing mutations in different affected individuals. Given the genetic diversity of African populations, we expect to find genes for Schizophrenia that have not yet emerged from studies of other populations. This project will also foster the development of gene discovery research for neuropsychiatric disorders in Africa. Our approach will identify genes important for the disorder in populations worldwide. These genes will stimulate future efforts to develop more effective treatment and prevention strategies.
BEAGESS is a genome-wide association study (GWAS) which takes advantage of the extensive data collected by investigators in the Barrett's and Esophageal Adenocarcinoma Consortium (BEACON, http://beacon.tlvnet.net), representing many of the high-quality population-based and other epidemiologic studies of BE and EA in the world. The overall goal of our research is to evaluate the influence of genetic susceptibility on risk of EA and its main precancerous condition, BE. The primary aims of the study are to identify tagSNPs most strongly associated with risk of esophageal adenocarcinoma (EA) and Barrett's esophagus (BE). In the secondary aims we will evaluate the extent to which the most significant associations vary according to key environmental and host factors for these conditions, including gender, obesity and cigarette smoking. These results will guide additional exploratory analyses examining ways in which individual SNP results might be aggregated to shed light in the importance of specific pathways relevant to the etiology of BE and EA, and to identify possible epistatic effects.
Osteoporosis is a condition of excessive skeletal fragility which results in high risk to low trauma fractures. It is the most prevalent metabolic bone disease and is a major public health problem which may result in devastating morbidity and mortality. The most powerful, measurable determinant of fracture risk is bone mineral density (BMD). More than 60% of BMD variation is attributable to genetic factors. There are gender differences in BMD that contribute to a substantially higher fracture risk among women than men. Genetic studies demonstrate that some osteoporosis risk genes/genomic regions are gender specific. However, specific such genes contributing to female BMD and to the sex differences of BMD are largely unknown. Recent rapid progresses in SNP genotyping technology, in our knowledge about human genome diversity and linkage disequilibrium (LD) patterns in the human genome as revealed have made it feasible and timely to pursue a powerful whole genome-wide association study (GWAS) to identify genes for BMD. The major goal of this project is to perform a powerful GWAS study in a large sample of US Caucasian subjects. Gender specific effects of the genetic variants will be examined. The significant genetic variants discovered will be used to design diagnostic DNA chips for prognosis for potential health problems of osteoporosis later in life.
We investigate the naturally occurring variation of small RNA expression in conjunction with genetic and gene expression variation. We shall assess the variation in subabdominal fat tissue small RNA levels in 168 unrelated MuTHER individuals. Using gene expression data from the same samples, we can survey how much the small RNA expression contributes to the variance in gene expression profiles. We can also take advantage of the high density genotyping data on these samples to identify genetic associations of small RNA expression levels, and the downstream effect of the genetic variants in small RNA sequence. Previous small RNA sequencing efforts have also extended the repertoire of small RNAs, and we hope to potentially find additional small RNAs active in adipose tissue biology. Altogether, the project should provide an exciting view into small RNA biology in humans, and lead to development of new methodology for analysis of molecular phenotype data.
We are developing a protocol to differentiate mouse and human induced pluripotent stem (IPS) and embryonic stem (ES) cells towards the haematopoietic pathway to generate erythrocytes in vitro. This system has many applications such as the study of the role of specific genes and human polymorphisms in infectious diseases such as malaria, as well as haematological diseases such as myelodysplastic syndrome. The nature of the in vitro differentiation process means that a heterogeneous population of cells is generated. In order to understand the types of cells produced with our protocol, we have performed a single cell analysis, which has the power to reveal the different populations of cells and their characteristics. For this, a cDNA library has been made that needs to be sequenced to obtain the gene expression profiles of the different cells. With this information we will be able to assess the quality of the differentiation protocol and improve it in order to produce better cells for the downstream applications.This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
This study is currently hosted by the European Nucleotide Archive. To access the data contained within the Study please follow the link below: https://www.ebi.ac.uk/ena/browser/view/PRJEB39323 Oligodendrocyte (OL) pathology is increasingly implicated in neurodegenerative diseases, as they are involved in metabolic support of axons and functional cross-talk with other brain cells. Rodent OLs are heterogeneous, with developmental and biological differences, but the extent of heterogeneity in the normal human brain and its contribution to any changes to disease remains unknown. Here we performed single nuclei RNA-sequencing (snRNA-seq) from white matter (WM) areas of post mortem human brain both in control (Ctr) and multiple sclerosis (MS) patients. We identified several sub-clusters of oligodendroglia in the Ctr human WM, some similar to those in mouse, and defined new markers for these cell states. Strikingly, some of these sub-clusters were under-represented in MS tissue, while others were more prevalent than in controls. We found a lack of OL precursor cells (OPCs) and an OL subcluster in an intermediate stage of differentiation in MS lesions and in normal appearing white matter (NAWM), suggesting either depletion by the disease or by a regenerative response. The differences in mature OL sub-clusters indicate different functional states of OLs in MS tissue and, as this is similar in NAWM to lesions, that MS is a more diffuse brain disease than the focal demyelinating lesions suggest. We were also able to identify new putative markers of different MS lesion subtypes. Our findings of an altered heterogeneity of oligodendroglia in MS may have an important contribution to our understanding of disease progression and may alter therapeutic approaches to MS.
Background:Acquired resistance to targeted drugs is largely due to intra-tumor molecular heterogeneity and clonal selection. Objective:The objective of this study was to determine molecular alterations and mediating resistant mechanisms and to suggest additional treatment options.Design, Setting, and Participants:Four patients were enrolled into the MORE (Molecular Renal Cancer Evolution) trial. We investigated the mutational spectra of metastatic clear cell renal cell carcinoma (ccRCC) at baseline and compared them to those upon clonal evolution at progression on systemic therapy with tyrosine kinase inhibitors.Outcome Measurements:DNA was isolated from tumor tissues after surgery, metastasis biopsies after progression on sunitinib or axitinib, and subjected to whole-exome sequencing (WES). In addition, the exomes of circulating tumor DNA (ctDNA) from two patients were sequenced at baseline and at progression.Results and Limitations:Our data provide evidence for clonal evolution and various acquired resistance mechanisms of ccRCC with subclonal mutations in FLT4, MTOR, ITGA3/5, SETD2, and VHL on metastatic progression. The limited number of shared mutations in tissue and plasma biopsies indicates that both sources provide complementary information, possibly due to mutations present in metastatic lesions which were not amenable to tissue sampling but are represented in the ctDNA. Conclusion:Acquired resistance to targeted therapies in metastatic ccRCC is due to intra-tumor heterogeneity and clonal evolution. Larger patient cohorts have to be studied to estimate the benefit of WES from biopsies and ctDNA for therapy decision in metastatic ccRCC.Patient Summary:We analyzed the molecular profiles of ccRCC patients in primary tissues and metastatic tissues upon progression on tyrosine kinase inhibitor therapy. Clinically relevant mutations upon progression were detected in the metastatic sites of all of four patients, suggesting that it might be possible to derive therapies based on sequencing the metastases.?HIPO-045
Using the CRISPR-Cas9 genome editing system, various oncogenic mutations were introduced in the genome of normal adult stem cells derived from the colon. The resulting clonal cultures were subjected to whole genome sequencing. In addition, the cultures were transplanted in a mouse and the resulting primary and metastatic tumors were also subjected to whole genome sequencing. The initial bulk culture, which was used to generate the mutations, was also whole genome sequenced and served as a reference sample to filter germline variants.
The Inuit ancestors of the Greenlandic people arrived in Greenland close to 1000 years ago. Since then, Europeans from many different countries have been present in Greenland. Consequently, the present-day Greenlandic population has ~25% of its genetic ancestry from Europe. In this study, we investigated to what extent different European countries have contributed to this genetic ancestry.
The Gabriella Miller Kids First Pediatric Research Program (Kids First) is a trans-NIH effort initiated in response to the 2014 Gabriella Miller Kids First Research Act and supported by the NIH Common Fund. This program focuses on gene discovery in pediatric cancers and structural birth defects and the development of the Gabriella Miller Kids First Pediatric Data Resource (Kids First Data Resource). All of the genomic and phenotypic data from this study are accessible through dbGaP. The data is also available at the Kids First Portal, where other Kids First datasets can also be accessed in the cloud for data analysis, data visualization, collaboration and interoperability, open to all researchers and developers.Pediatric malignant germ cell tumors (GCTs) represent approximately 6% of childhood cancers, including 3% of tumors in children aged 0-14 years and 15% of tumors in adolescents. GCTs are heterogeneous and grouped together due to the presumed common cell of origin, the primordial germ cell (PGC). GCTs typically occur in the testes or ovaries; however, extragonadal GCTs can occur and are likely a result of abnormal germ cell migration during development. Evidence suggests that GCTs, including those diagnosed in adults, are initiated in utero. Thus, alterations in normal embryonic development are likely to be especially relevant to GCT etiology. Germline susceptibility has not been evaluated in an agnostic fashion in pediatric GCT, mainly due to a lack of an adequate number of samples; however, emerging evidence supports a genetic etiology.In the two Kids First GCT projects, we will evaluate genetic susceptibility to intracranial and extracranial GCT by sequencing probands and their unaffected parents. The goals of the project are to: 1) evaluate the contribution of rare genetic variants in GCT through the use of aggregate burden tests, focusing on genes and established regulatory regions; 2) identify de novo SNVs and CNVs in pediatric GCT using a case-parent triad design; and 3) identify molecular signatures in GCT tumor specimens, overall and by age group and tumor characteristics. Whole Genome Sequencing (WGS) data generated through the Gabriella Miller Kids First Pediatric Research Program will provide an opportunity to investigate the genetic origins of GCT in a diverse set of samples. Given the limited knowledge of GCT etiology and biology, the results of the proposed analyses are likely to have a big impact on the field.
ost adults with intellectual disabilities (ID) do not undergo genetic diagnostic investigation as part of their clinical care and have 'missed the boat' with regard to the WES and WGS genetic testing that is now being provided for children with ID. There is a dramatically increased risk of psychistric disorders in adults with ID, e.g. the risk of psychoses is 10X higher than in the general population. It remains an open question as to how much of adult ID is genetic in origin and how similar the genetic forms of adult ID are to those being diagnosed in children, in part due to survivor bias. There is also the opportunity to identify adults with treatable forms of ID, of which over 80 have been described, thus improving their clinical management. Furthermore, analysis of medical records of adults with genetic forms of ID can help to characterise the 'natural history' of individual disorders, resulting in more accurate prognoses for diagnosed children and identifying opportunities for improved management and possibly therapeutic intervention (e.g. optimal anti-epileptic therapy). Here we propose to exome sequence (to ~50X coverage) 200 adults with ID and co-morbid psychiatric disorders. This cohort has previously been assayed with chromosomal microarrays (Wolfe et al 2017 EJHG, 25, 66–72) identifying a diagnostic yield of ~11% which is comparable to the CNV diagnostic yield in various child ID cohorts (10-15%). The authors observed no substantive biases in diagnostic yield between different psychiatric diagnostic classes. The WES data will be analysed using the diagnostic workflows developed in the DDD study to ensure comparability between child and adult ID datasets. This study is intended as a pilot study to demonstrate the value of WES in adults with ID. This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
Background: Metastatic castration-resistant prostate cancer (mCRPC) is a heterogeneous disease in which molecular stratification is needed to improve clinical outcomes. The identification of predictive biomarkers can have a major impact on the care of these patients, but the availability of metastatic tissue samples for research in this setting is limited. Objective: To study the prevalence of immune biomarkers of potential clinical utility to immunotherapy in mCRPC by tumour RNA sequencing, and to determine their association with overall survival (OS).
This study's purpose is to use data generated in Renal whole genome and exome Cancer sequencing to target specific areas in multiple other Renal Cancer samples. These areas will then be subjected to bespoke pull downs, tagged and then pool the resultant amplicons. These amplicons will then be sequenced. This is hoped to show the prevalence of previous findings in multiple individuals in a high throughput method.
The Genetics and Epidemiology of Colorectal Cancer Consortium (GECCO) is a collaborative effort comprised of a coordinating center and scientific researchers from well-characterized cohort and case-control studies conducted in North America and Europe. This international consortium aims to accelerate the discovery of common and rare genetic risk variants for colorectal cancer by conducting large-scale meta-analyses of existing and newly generated genome-wide association study (GWAS) data, replicating and fine-mapping of GWAS discoveries, and investigating how genetic risk variants are modified by environmental risk factors. To expand these efforts, we assembled case-control sets or nested case-control sets from 20 different North American or European studies. Summary descriptions and study participant inclusions/exclusion criteria for each of these studies are detailed below. The Black Women's Health Study (BWHS): Is the largest follow-up study of the health of African-American women (Cozier et al., 2004; Rosenberg et al., 1995) [PMID: 15018884; PMID: 7722208]. The purpose is to identify and evaluate causes and preventives of cancers and other serious illnesses in African-American women. Among the diseases being studied are breast cancer, colorectal cancer, type 2 diabetes, uterine fibroids, systemic lupus erythematosus, and cardiovascular disease. The study began in 1995, when 59,000 black women from all parts of the United States enrolled through postal questionnaires. The women provided demographic and health data on the 1995 baseline questionnaire, including information on weight, height, smoking, drinking, contraceptive use, use of other selected medications, illnesses, reproductive history, physical activity, diet, use of health care, and other factors. The participants are followed through biennial questionnaires to determine the occurrence of cancers and other illnesses and to update information on risk factors. Self-reports of cancer are confirmed through medical records and state cancer registry records. Mouthwash-swish samples, as a source of DNA, were obtained from ~26,000 BWHS participants in 2002-2007. DNA was isolated from the mouthwash-swish samples at the Boston University Molecular Core Genetics Laboratory using the QIAAMP DNA Mini Kit (Qiagen). All incident colorectal cancer cases with a DNA sample were included in the present analysis. Two controls per case, selected from among BWHS participants free of colorectal cancer at end of follow-up, were matched to cases on year of birth (+/- 2 years) and geographical region of residence (Northeast, South, Midwest, and West). A total 209 colorectal cancer cases and 423 controls were sent for genotyping. Campaign Against Cancer and Heart Disease (CLUE II): The Campaign Against Cancer and Heart Disease, is a prospective cohort designed to identify biomarkers and other factors associated with risk of cancer, heart disease, and other conditions (Kakourou et al., 2015) [PMID: 26220152]. 32,894 participants were recruited from May through October 1989 from Washington County, Maryland and surrounding communities. Colorectal cancer cases (n = 297) and matched controls (n = 296) were identified between 1989 and 2000 among participants in the CLUE II cohort of Washington County, Maryland. Colorectal Cancer Study of Austria (CORSA): In the ongoing colorectal cancer study of Austria (CORSA), more than 13,000 Caucasian participants have been recruited within the province-wide screening project "Burgenland Prevention Trial of Colorectal Disease with Immunological Testing" (B-PREDICT) since 2003 (Hofer et al., 2011) [PMID: 21422235]. All inhabitants of the Austrian province Burgenland aged between 40 and 80 years are annually invited to participate in fecal immunochemical testing and haemoccult positive screening participants are invited for colonoscopy. CORSA includes genomic DNA and plasma of colorectal cancer cases, low-risk and high-risk adenomas, and colonoscopy-negative controls. Controls received a complete colonoscopy and were free of colorectal cancer or polyps. CORSA participants have been recruited in the four KRAGES hospitals in Burgenland, Austria, and additionally, at the Medical University of Vienna (Department of Surgery), the Viennese hospitals "Rudolfstiftung" and the "Sozialmedizinisches Zentrum Sud", and at the Medical University of Graz (Department of Internal Medicine). 1403 colorectal cancer and advanced colorectal adenoma cases, and 1404 matched controls were selected for the study. Distribution of factors sex and age (5 year strata) were evenly matched between cases and controls. Cancer Prevention Study II (CPS II): The CPS II Nutrition cohort is a prospective study of cancer incidence and mortality in the United States, established in 1992 and described in detail elsewhere (Calle et al., 2002; Campbell et al., 2014) [PMID: 12015775; PMID: 25472679]. At enrollment, participants completed a mailed self-administered questionnaire including information on demographic, medical, diet, and lifestyle factors. Follow-up questionnaires to update exposure information and to ascertain newly diagnosed cancers were sent biennially starting in 1997. Reported cancers were verified through medical records, state cancer registry linkage, or death certificates. The Emory University Institutional Review Board approves all aspects of the CPS II Nutrition Cohort. A total of 360 cases and 359 controls were selected for this study. Czech Republic Colorectal Cancer Study (Czech Republic CCS): Cases with positive colonoscopy results for malignancy, confirmed by histology as colon or rectal carcinomas, were recruited between September 2003 and May 2012 in several oncological departments in the Czech Republic (Prague, Pilsen, Benesov, Brno, Liberec, Ples, Pribram, Usti and Labem, and Zlin). Two control groups, sampled at the same time of cases recruitment, were included in the study. The first group consisted of hospital-based individuals with a negative colonoscopy result for malignancy or idiopathic bowel diseases. The reasons for the colonoscopy were: i) positive fecal occult blood test, ii) hemorrhoids, iii) abdominal pain of unknown origin, and iv) macroscopic bleeding. The second control group consisted of healthy blood donor volunteers from a blood donor center in Prague. All individuals were subjected to standard examinations to verify the health status for blood donation and were cancer-free at the time of the sampling. Details of CRC cases and controls have been reported previously (Vymetalkova et al., 2014; Naccarati et al., 2016; Vymetalkova et al., 2016) [PMID: 24755277; PMID: 26735576; PMID: 27803053]. All subjects were informed and provided written consent to participate in the study. They approved the use of their biological samples for genetic analyses, according to the Declaration of Helsinki. The design of the study was approved by the Ethics Committee of the Institute of Experimental Medicine, Prague, Czech Republic. All subjects included in the study were Caucasians and comprised 1792 cases and 1764 matched controls. Controls were matched to CRC cases as 1:1 ratio. Matching was done on age and sex. Age was matched on +-5 years, whereas sex was matched exactly. For the cases without matched controls, matching was done only on sex. Early Detection Research Network (EDRN): The aim of the EDRN initiative is to develop and sustain a biorepository for support of translational research (Amin et al., 2010) [PMID: 21031013]. High-quality biospecimens were accrued and annotated with pertinent clinical, epidemiologic, molecular and genomic information. A user-friendly annotation tool and query tool was developed for this purpose. The various components of this annotation tool include: CDEs are developed from the College of American Pathologists (CAP) Cancer Checklists and North American Association of Central Cancer Registries (NAACR) standards. The CDEs provides semantic and syntactic interoperability of the data sets by describing them in the form of metadata or data descriptor. A total of 352 colorectal case samples and 399 controls were selected for this study. Controls were matched to CRC cases based on age and sex. The EPICOLON Consortium (EPICOLON): The EPICOLON Consortium comprises a prospective, multicentre and population-based epidemiology survey of the incidence and features of CRC in the Spanish population (Fernandez-Rozadilla et al., 2013) [PMID: 23350875]. Cases were selected as patients with de novo histologically confirmed diagnosis of colorectal adenocarcinoma. Patients with familial adenomatous polyposis, Lynch syndrome or inflammatory bowel disease-related CRC, and cases where patients or family refused to participate in the study were excluded. Hospital-based controls were recruited through the blood collection unit of each hospital, together with cases. All of the controls were confirmed to have no history of cancer or other neoplasm and no reported family history of CRC. Controls were randomly selected and matched with cases for hospital, sex and age (+- 5 years). A total of 370 cases and 370 controls were selected for genotyping. Hawaii Adenoma Study: For this adenoma study, two flexible-sigmoidoscopy screening clinics were first used to recruit participants on Oahu, Hawaii. Adenoma cases were identified either from the baseline examination at the Hawaii site of the Prostate Lung Colorectal and Ovarian cancer screening trial during 1996-2000 or at the Kaiser Permanente Hawaii's Gastroenterology Screening Clinic during 1995-2007. In addition, starting in 2002 and up to 2007, we also approached for recruitment all eligible patients who underwent a colonoscopy in the Kaiser Permanente Hawaii Gastroenterology Department. Cases were patients with histologically confirmed first-time adenoma(s) of the colorectum and were of Japanese, Caucasian or Hawaiian race/ethnicity. Controls were selected among patients with a normal colorectum and were individually matched to the cases on age at exam, sex, race/ethnicity, screening date (+-3 months) and clinic and type of examination (colonoscopy or flexible sigmoidoscopy). We recruited 1016 adenoma cases (67.8% of all eligible) and 1355 controls (69.2% of all eligible); 889 cases and 1169 controls agreed to give a blood and 29 cases and 34 controls, a mouthwash sample. A total of 989 cases and 1185 controls were genotyped for this study. Columbus-area HNPCC Study (HNPCC, OSUMC): Patients with colorectal adenocarcinoma diagnosed at six participating hospitals were eligible for this study, regardless of age at diagnosis or family history of cancer. Patients with a clinical diagnosis of familial adenomatous polyposis were not eligible for this study. These six hospitals perform the vast majority of all operations for CRC in the Columbus metropolitan area (population 1.7 million). The institutional review board at all participating hospitals approved the research protocol and consent form in accordance with assurances filed with and approved by the United States Department of Health and Human Services. Briefly, during the period of January 1999 through August 2004, 1,566 eligible patients with CRC were accrued to the study (Hampel et al., 2008) [PMID 18809606]. A total of 1472 colorectal cancer samples had enough blood DNA remaining to be sent for genotyping. Control samples were provided by the Ohio State University Medical Center%#39;s (OSUMC) Human Genetics Sample Bank. The Columbus Area Controls Sample Bank is a collection of control samples for use in human genetics research that includes both donors' anonymized biological specimens and linked phenotypic data. The data and samples are collected under the protocol "Collection and Storage of Controls for Genetics Research Studies", which is approved by the Biomedical Sciences Institutional Review Board at OSUMC. Recruitment takes place in OSUMC primary care and internal medicine clinics. If individuals agree to participate, they provide written informed consent, complete a questionnaire that includes demographic, medical and family history information, and donate a blood sample. 4-7 ml of blood is drawn into each of 3 ACD Solution A tubes and is used for genomic DNA extraction and the establishment of an EBV-transformed lymphoblastoid cell culture, cell pellet in Trizol, and plasma. Controls were matched to CRC cases as 1:1. Matching was done on age at reference time (age_ref), race, and sex. Age_ref was matched on +-5 years. Sex and race were matched exactly. For the cases without matched controls, matching was done only on sex and race with 1:1 ratio. Since controls are fewer than cases, one control is matched on 2 cases at most. Health Professionals Follow-up Study (HPFS): A parallel prospective study to the NHS (Nurses' Health Study). The HPFS cohort comprised 51,529 men aged 40-75 who, in 1986, responded to a mailed questionnaire (Rimm et al., 1990) [PMID: 2090285]. Participants provided information on health related exposures, including current and past smoking history, age, weight, height, diet, physical activity, aspirin use, and family history of colorectal cancer. Colorectal cancer and other outcomes were reported by participants or next-of-kin and were followed up through review of the medical and pathology record by physicians. Overall, more than 97% of self-reported colorectal cancers were confirmed by medical record review. Information was abstracted on histology and primary location. Incident cases were defined as those occurring after the subject provided the blood sample. Prevalent cases were defined as those occurring after enrollment in the study but before the subject provided the blood sample. Follow-up evaluation has been excellent, with 94% of the men responding to date. Colorectal cancer cases were ascertained through January 1, 2008. In 1993-1995, 18,825 men in the HPFS mailed blood samples by overnight courier, which were aliquoted into buffy coat and stored in liquid nitrogen. In 2001-2004, 13,956 men in the HPFS who had not provided a blood sample previously mailed in a swish-and-spit sample of buccal cells. Incident cases were defined as those occurring after the subject provided a blood or buccal sample. Prevalent cases were defined as those occurring after enrollment in the study in 1986, but before the subject provided either a blood or buccal sample. After excluding participants with histories of cancer (except nonmelanoma skin cancer), ulcerative colitis, or familial polyposis, case-control sets were previously constructed. In addition to colorectal cancer cases and controls, a set of adenoma cases and matched controls with available DNA from buffy coat were selected for genotyping. Over the follow-up period, data were collected on endoscopic screening practices and, if individuals had been diagnosed with a polyp, the polyps were confirmed to be adenomatous by medical record review. Adenoma cases were ascertained through January 1, 2008. A separate case-control set was constructed of participants diagnosed with advanced adenoma matched to control participants who underwent a lower endoscopy in the same time period and did not have an adenoma. Advanced adenoma was defined as an adenoma 1 cm or larger in diameter and/or with tubulovillous, villous, or highgrade dysplasia/carcinoma-in-situ histology. Matching criteria included year of birth (within 1 year) and month/ year of blood sampling (within 6 months), the reason for their lower endoscopy (screening, family history, or symptoms), and the time period of any prior endoscopy (within 2 years). Controls matched to cases with a distal adenoma either had a negative sigmoidoscopy or colonoscopy examination, and controls matched to cases with proximal adenoma all had a negative colonoscopy. In total, 159 advanced adenoma cases and 109 controls were selected for genotyping. Leeds Colorectal Cancer Study (LCCS): Following local ethical approval, colorectal cancer cases were recruited from 1997 until 2012 in Leeds, UK through surgical clinics. Initially, funding was provided by the UK Ministry of Agriculture, Farming and Fisheries (subsequently the Food Standards Agency) and Imperial Cancer Research Fund (subsequently Cancer Research UK). Recruitment also occurred similarly in Dundee, Perth and York between the periods of 1997 and 2001 using the same protocol and the data and samples were combined. Pathologically confirmed cases were consented at outpatient clinics, providing information on known and postulated risk factors for colorectal cancer (diet, lifestyle and family history) as well as providing a blood sample for DNA. Exclusion criteria included pre-existing diverticular disease and an inability to complete the questionnaire. The General Practitioners of cases (all UK residents have a nominated General Practitioner to whom to refer initial medical queries) and these GPs were asked to send letters to other persons on their patient list of the same gender and born within 5 years of the case. Subsequently to enhance the number of controls, we systematically invited patients from selected GP practices. Diet was assessed in cases and controls using an extensive dietary and lifestyle questionnaire modified by that produced by the European Prospective Investigation in Cancer (EPIC). The frequency that each specific food items were eaten was recorded and we also obtained average fruit and vegetable consumption as a cross-check. In total, 1591 cases and 739 controls provided a DNA sample. The North Carolina Colon Cancer Studies (NCCCS I/II): The North Carolina Colon Cancer Studies (NCCCS I- colon and NCCCS II-rectal) were population-based case-control studies conducted in 33 counties of North Carolina. Cases were identified using the rapid case ascertainment system of the North Carolina Central Cancer Registry. Patients with a first diagnosis of histologically confirmed invasive adenocarcinoma of the colon (cecum through sigmoid colon) between October 1996 and September 2000 were classified as potential cases in the NCCCS I. The NCCCS II included patients with a first diagnosis of histologically confirmed invasive adenocarcinoma of the sigmoid colon, rectosigmoid, or rectum (hereafter collectively referred to as rectal cancer) between May 2001 and September 2006. Additional eligibility requirements were: aged 40-80 years, residence in one of the 33 counties, ability to give informed consent and complete an interview, had a driver's license or identification card issued by the North Carolina Department of Motor Vehicles (if under the age of 65), and had no objections from the primary physician in regards to contacting the individual. Controls, identified and sampled during the respective study dates, were selected from two sources. Potential controls under the age of 65 were identified using the North Carolina Department of Motor Vehicles records. For those 65 years and older, records from the Center for Medicare and Medicaid Services were used. Controls were matched to cases using randomized recruitment strategies. Recruitment probabilities were done using strata of 5-year age, sex, and race groups. Dietary information was collected using a modified version of the semiquantitative food frequency questionnaire developed at the National Cancer Institute. In addition, participants were asked about vitamin and mineral supplementation, special diets, restaurant eating, sodium use, and fats used in cooking. In NCCCS I, 515 colorectal cases and 687 matched controls were sent for genotyping. In NCCCS II, 796 colorectal cases and 823 controls were sent from the NCCCS II for genotyping. Controls were matched to CRC cases as 1:1 ratio. Matching was done on age, race, and sex. Age was matched on +-5 years. Race and sex was matched exactly. For the cases without matched controls, matching was done only on sex and race. Nurses Health Study (NHS): The NHS cohort began in 1976 when 121,700 married female registered nurses age 30-55 years returned the initial questionnaire that ascertained a variety of important health-related exposures (Belanger et al., 1978) [PMID: 248266]. Since 1976, follow-up questionnaires have been mailed every 2 years. Colorectal cancer and other outcomes were reported by participants or next-of-kin and followed up through review of the medical and pathology record by physicians. Overall, more than 97% of self-reported colorectal cancers were confirmed by medical-record review. Information was abstracted on histology and primary location. The rate of follow-up evaluation has been high: as a proportion of the total possible follow-up time, follow-up evaluation has been more than 92%. Colorectal cancer cases were ascertained through June 1, 2008. In 1989 -1990, 32,826 women in NHS I mailed blood samples by overnight courier, which were aliquoted into buffy coat and stored in liquid nitrogen. In 2001-2004, 29,684 women in NHS I who did not previously provide a blood sample mailed a swish-and-spit sample of buccal cells. Incident cases were defined as those occurring after the subject provided a blood or buccal sample. Prevalent cases were defined as those occurring after enrollment in the study in 1976 but before the subject provided either a blood or buccal sample. After excluding participants with histories of cancer (except nonmelanoma skin cancer), ulcerative colitis, or familial polyposis, case-control sets were previously constructed from which DNA was isolated from either buffy coat or buccal cells for genotyping. In addition to colorectal cancer cases and controls, a set of advanced adenoma cases and matched controls with available DNA from buffy coat were selected for genotyping. Over the follow-up period, data were collected on endoscopic screening practices and, if individuals had been diagnosed with a polyp, the polyps were confirmed to be adenomatous by medical record review. Adenoma cases were ascertained through June 1, 2011. A separate case-control set was constructed of participants diagnosed with advanced adenoma matched to control participants who underwent a lower endoscopy in the same time period and did not have an adenoma. Advanced adenoma was defined as an adenoma more than 1 cm in diameter and/or with tubulovillous, villous, or high-grade dysplasia/carcinoma-in-situ histology. Matching criteria included year of birth (within 1 year) and month/year of blood sampling (within 6 months), the reason for their lower endoscopy (screening, family history, or symptoms), and the time period of any prior endoscopy (within 2 years). Controls matched to cases with a distal adenoma either had a negative sigmoidoscopy or colonoscopy examination, and controls matched to cases with proximal adenoma all had a negative colonoscopy. A total of 272 cases and 236 matched controls were sent to CIDR for the advanced adenoma case-control set. Northern Swedish Health and Disease Study (NSHDS): Comprises over 110,000 participants, including approximately one third with repeated sampling occasions, from three population-based cohorts (Dahlin et al., 2010; Myte et al., 2016) [PMID: 20197478; PMID: 27367522]. The largest is the ongoing Vasterbotten Intervention Programme, in which all residents of Vasterbotten County are invited to a health examination upon turning 30 (some years), 40, 50 and 60 years of age. Extensive measured and self-reported health and lifestyle data, as well as blood samples for central biobanking in Umea, Sweden, are collected at the health exam. Leucocyte DNA samples for 1:1-matched CRC case-control sets from the NSHDS, of which 878 samples are included in this study, have been selected for genotyping. This is in addition to 354 samples from the NSHDS previously analyzed as part of the multicenter EPIC cohort. Cancer-specific and overall survival data are available for all patients. For at least 425 patients, archival tumor tissue has been analyzed for the BRAF V600E mutation and by sequencing codon 12 and 13 for KRAS mutations, as well as for MSI screening status by immunohistochemistry and for an eight-gene CIMP panel using quantitative real-time PCR (MethyLight). Ohio Colorectal Cancer Prevention Initiative (OCCPI, OSUMC): OCCPI (ClinicalTrials.gov identifier: NCT01850654) is a population-based study of colorectal cancer patients diagnosed in one of 51 hospitals throughout the state of Ohio from January 1, 2013 through December 31, 2016. The OCCPI was created to decrease CRC incidence in Ohio by identifying patients with hereditary predisposition (statewide universal tumor screening for newly diagnosed CRC patients), increase colonoscopy compliance for first-degree relatives of CRC patients, and encourage future research through the creation of a biorepository. The 51 Ohio hospitals participating in the OCCPI were selected to represent a cross-section of clinical centers in the state based on high reported volume of CRC patients, affiliation with a high volume hospital, or interest in participation. Institutional Review Board (IRB) approval was obtained by the individual hospitals, Community Oncology Programs, or by ceding review to the OSU IRB. Written informed consent was obtained. A total of 2139 colorectal cases were genotyped. Patients were considered eligible for this study if they were age 18 or older at the time of enrollment, if they had a surgical resection (or biopsy if unresectable) in the state of Ohio demonstrating an adenocarcinoma of the colorectum from 1/1/13 - 12/31/16. Matched control samples were selected from the Ohio State University Medical Center's (OSUMC) Human Genetics Sample Bank in an identical way to the selection for the Columbus-area HNPCC Study (please refer to the description for the Columbus-area HNPCC Study). Prostate, Lung, Colorectal and Ovarian Cancer Screening Trail (PLCO): PLCO enrolled 154,934 participants (men and women, aged between 55 and 74 years) at ten centers into a large, randomized, two-arm trial to determine the effectiveness of screening to reduce cancer mortality. Sequential blood samples were collected from participants assigned to the screening arm. Participation was 93% at the baseline blood draw. In the observational (control) arm, buccal cells were collected via mail using the "swish-and-spit" protocol and participation rate was 65%. Details of this study have been previously described (Huang et al., 2016) [PMID: 27673363] and are available online (http://dcp.cancer.gov/plco). For this study 1651 advanced adenoma cases and 1392 controls were selected for genotyping. Selenium and Vitamin E Prevention Trial (SELECT): The Selenium and Vitamin E Cancer Prevention Trial (SELECT) was a double-blind, placebo controlled clinical trial which explored using selenium and vitamin E alone and in combination to prevent prostate cancer in healthy men (Lippman et al., 2009) [PMID: 19066370]. Secondary endpoints included the prevention of colorectal and lung cancers. SELECT was conducted at 427 sites and centers in the United States, Canada and Puerto Rico; 35,533 men 55 years and older (50 or older if African American) were randomized beginning August 22, 2001. Supplementation was discontinued on October 23, 2008 due to futility. 308 colorectal cancer cases and 308 matched controls were selected from the SELECT population and sent for genotyping. Screening Markers For Colorectal Disease Study and Colonoscopy and Health Study (SMS-REACH): Details on this study population were previously reported (Burnett-Hartman et al., 2014) [PMID: 24875374]. Participants were enrollees in an integrated health-care delivery system in western Washington State (Group Health Cooperative, Seattle, Washington) aged 24-79 years who underwent an index colonoscopy for any indication between 1998 and 2007 and donated a buccal-cell or blood sample for genotyping analysis. Study recruitment took place in 2 phases, with phase 1 occurring in 1998-2003 and phase 2 occurring in 2004-2007. Persons who had undergone a colonoscopy less than 1 year prior to the index colonoscopy, persons with inadequate bowel preparation for the index colonoscopy, and persons with a prior or new diagnosis of colorectal cancer, a familial colorectal cancer syndrome (such as familial adenomatous polyposis), or another colorectal disease were ineligible. Patients diagnosed with adenomas or serrated polyps and persons who were polyp-free at the index colonoscopy (controls) were systematically recruited during both phases of recruitment. Approximately 75% agreed to participate and provided written informed consent. Based on medical records, persons who agreed to participate and those who refused study participation were similar with respect to age, sex, and colorectal polyp status. Study protocols were approved by the institutional review boards of the Group Health Cooperative and the Fred Hutchinson Cancer Research Center (Seattle, Washington). A total of 575 cases and 508 matched were selected for the study. Controls were matched to CRC cases as 1:1 ratio. Matching was done on age_ref, race, and sex. Age_ref was matched on +-5 years. The Women's Health Initiative (WHI): WHI is a long-term national health study that has focused on strategies for preventing heart disease, breast and colorectal cancer, and osteoporotic fractures in postmenopausal women. The original WHI study included 161,808 postmenopausal women enrolled between 1993 and 1998. The Fred Hutchinson Cancer Research Center in Seattle, WA serves as the WHI Clinical Coordinating Center for data collection, management, and analysis of the WHI. The WHI has two major parts: a partial factorial randomized Clinical Trial (CT) and an Observational Study (OS); both were conducted at 40 Clinical Centers nationwide. The CT enrolled 68,132 postmenopausal women between the ages of 50-79 into trials testing three prevention strategies. If eligible, women could choose to enroll in one, two, or all three of the trial components. The components are: Hormone Therapy Trials (HT): This double-blind component examined the effects of combined hormones or estrogen alone on the prevention of coronary heart disease and osteoporotic fractures, and associated risk for breast cancer. Women participating in this component with an intact uterus were randomized to estrogen plus progestin (conjugated equine estrogens [CEE], 0.625 mg/d plus medroxyprogesterone acetate [MPA] 2.5 mg/d] or a matching placebo. Women with prior hysterectomy were randomized to CEE or placebo. Both trials were stopped early, in July 2002 and March 2004, respectively, based on adverse effects. All HT participants continued to be followed without intervention until close-out. Dietary Modification Trial (DM): The Dietary Modification component evaluated the effect of a low-fat and high fruit, vegetable and grain diet on the prevention of breast and colorectal cancers and coronary heart disease. Study participants were randomized to either their usual eating pattern or a low-fat dietary pattern. Calcium/Vitamin D Trial (CaD): This double-blind component began 1 to 2 years after a woman joined one or both of the other clinical trial components. It evaluated the effect of calcium and vitamin D supplementation on the prevention of osteoporotic fractures and colorectal cancer. Women in this component were randomized to calcium (1000 mg/d) and vitamin D (400 IU/d) supplements or a matching placebo. The Observational Study (OS)examines the relationship between lifestyle, environmental, medical and molecular risk factors and specific measures of health or disease outcomes. This component involves tracking the medical history and health habits of 93,676 women not participating in the CT. Recruitment for the observational study was completed in 1998 and participants were followed annually for 8 to 12 years. All centrally confirmed cases of invasive colorectal cancers, or deaths from colorectal cancer were selected as potential cases from September 30, 2015 database. Controls were participants free of colorectal cancer (invasive or in situ) as of September 30, 2015. Potential cases and controls were excluded if they (1) were non-White; (2) had history of colorectal cancers at baseline; (3) lost to follow-up after enrollment; (4) DbGAP ineligible; (5) had <1.25ug of DNA; (6) selected for WHI study M26 Phase I or II; (7) selected for WHI study AS224 and also included in the imputation project. A total of 578 cases and 104,429 controls met the eligibility criteria. Each case was matched with 1 control (1:1) that exactly met the following matching criteria: age (+-5 years), 40 randomization centers (exact), WHI date (+-3 years), CaD date (+-3 years), OS flag (exact), HRT assignments (exact), DM assignments (exact), and CaD assignments (exact). Control selection was done in a time-forward manner, selecting one control for each case from the risk set at the time of the case's event. The matching algorithm was allowed to select the closest match based on a criteria to minimize an overall distance measure (Bergstralh EJ, Kosanke JL. Computerized matching of cases to controls. Technical Report #56, Department of Health Sciences Research, Mayo Clinic, Rochester MN. April 1995). Each matching factor was given the same weight. When exact matches could not be found, the matching criteria were gradually relaxed among unmatched cases and controls until all cases had found matched controls. Using the matching criteria specified above, 559 of the 578 eligible cases found exact matches. The matching criteria was then relaxed to : Age+-5, randomization centers, WHI date +- 3 years, CaD date +- 3 years, OS flag, HRT flag, DM flag, CaD flag. 17 of the remaining 19 unmatched cases found matched controls. By matching on Age+-5, randomization centers, WHI date +- 3 years, CaD date +- 3 years, OS flag, HRT flag, the remaining 2 unmatched cases found their matches.