This dataset included cfMethyl-Seq data of 15 plasma cfDNA samples from 15 lung cancer patients and RRBS data of 58 lung tumor tissue samples from 58 lung cancer patients. The data were generated following the standard protocols.
This dataset contains RNA Seq, 10x scRNA Seq and Exome sequencing of glioblastoma samples. Sequencing was performed on a Illumina NovaSeq 6000 and Illumina HiSeq 4000. The sequencing was always paired.
Creation of living organoid biobank for ewing and ewing-like sarcomas
Sequencing of independent unrelated cases of alopecia areata (total/ universal) and vitiligo (>10%, affected family member). DNA obtained from blood
Whole genome sequencing on HAP1 clones wild-type or knockout for MMR genes PMS1, PMS2, MLH1 or MSH6. Libraries were prepared using the Kapa HyperPrep kit (Roche, 07962363001). Paired-end libraries (2 x 150 bp) were sequenced on a NovaSeq 6000 instrument (Illumina).
The Genetics and Epidemiology of Colorectal Cancer Consortium (GECCO) is a collaborative effort comprised of a coordinating center and scientific researchers from well-characterized cohort and case-control studies. This international consortium aims to accelerate the discovery of common and rare genetic risk variants for colorectal cancer by conducting large-scale meta-analyses of existing and newly generated genome-wide association study (GWAS) data, whole genome sequencing, replicating and fine-mapping of genetic discoveries, and investigating how genetic risk variants are modified by environmental risk factors. To expand these efforts, we assembled case-control sets or nested case-control sets from 6 different North American or European studies. Summary descriptions and study participant inclusions/exclusion criteria for each of these studies are detailed below. Cancer Prevention Study II (CPS II): The CPS II Nutrition cohort is a prospective study of cancer incidence and mortality in the United States, established in 1992 and described in detail elsewhere (Calle et al., 2002 PMID:12015775; Campbell et al., 2014 PMID:25472679). At enrollment, participants completed a mailed self-administered questionnaire including information on demographic, medical, diet, and lifestyle factors. Follow-up questionnaires to update exposure information and to ascertain newly diagnosed cancers were sent biennially starting in 1997. Reported cancers were verified through medical records, state cancer registry linkage, or death certificates. The Emory University Institutional Review Board approves all aspects of the CPS II Nutrition Cohort. We restricted to samples that had blood DNA source. Controls were matched to cases in a case/control ratio of 2:1 on reference year and sex. Darmkrebs: Chancen der Verhütung durch Screening (DACHS): This German study was initiated as a large population-based case-control study in 2003 in the Rhine-Neckar-Odenwald region (southwest region of Germany) to assess the potential of endoscopic screening for reduction of colorectal cancer risk and to investigate etiologic determinants of disease, particularly lifestyle/environmental factors and genetic factors. Cases with a first diagnosis of invasive colorectal cancer (International Classification of Diseases 10 codes C18-C20) who were at least 30 years of age (no upper age limit), German speaking, a resident in the study region, and mentally and physically able to participate in a one-hour interview, were recruited by their treating physicians either in the hospital a few days after surgery, or by mail after discharge from the hospital. Cases were confirmed based on histologic reports and hospital discharge letters following diagnosis of colorectal cancer. All hospitals treating colorectal cancer patients in the study region participated. Based on estimates from population-based cancer registries, more than 50% of all potentially eligible patients with incident colorectal cancer in the study region were included. Community-based controls were randomly selected from population registries, employing frequency matching with respect to age (5-year groups), sex, and county of residence. Controls with a history of colorectal cancer were excluded. Controls were contacted by mail and follow-up calls. The participation rate was 51%. During an in-person interview, data were collected on demographics, medical history, family history of CRC, and various life-style factors, as were blood and mouthwash samples. Routine formalin-fixed, paraffin-embedded (FFPE) tumor samples from the patients enrolled were requested from the pathology institutes and used for tumor tissue analyses. This analysis includes participants with blood source DNA that were recruited up to 2010 in this ongoing study. Controls were matched to cases on reference age and sex in a case/control ratio of 2:1. Health Professionals Follow-up Study (HPFS): A parallel prospective study to the NHS (Nurses' Health Study). The HPFS cohort comprised 51,529 men aged 40-75 who, in 1986, responded to a mailed questionnaire (Rimm et al., 1990 PMID:2090285). Participants provided information on health related exposures, including current and past smoking history, age, weight, height, diet, physical activity, aspirin use, and family history of colorectal cancer. Colorectal cancer and other outcomes were reported by participants or next-of-kin and were followed up through review of the medical and pathology record by physicians. Overall, more than 97% of self-reported colorectal cancers were confirmed by medical record review. Information was abstracted on histology and primary location. Incident cases were defined as those occurring after the subject provided the blood sample. Prevalent cases were defined as those occurring after enrollment in the study but before the subject provided the blood sample. Follow-up evaluation has been excellent, with 94% of the men responding to date. Colorectal cancer cases were ascertained through January 1, 2008. In 1993-1995, 18,825 men in the HPFS mailed blood samples by overnight courier, which were aliquoted into buffy coat and stored in liquid nitrogen. In 2001-2004, 13,956 men in the HPFS who had not provided a blood sample previously mailed in a swish-and-spit sample of buccal cells. Incident cases were defined as those occurring after the subject provided a blood or buccal sample. Prevalent cases were defined as those occurring after enrollment in the study in 1986, but before the subject provided either a blood or buccal sample. Participants with histories of cancer (except nonmelanoma skin cancer), ulcerative colitis, or familial polyposis, case-control sets were excluded. Control participants were required to be free of invasive colorectal cancer and non-invasive (stage 0 in situ) colorectal cancer. For this study, only European ancestry participants with blood source DNA and incident colorectal cancer cases were eligible for selection. Since enrollment year and sex matched exactly, controls were randomly selected in a case/control ratio of 2:1. Nurses Health Study (NHS): The NHS cohort began in 1976 when 121,700 married female registered nurses age 30-55 years returned the initial questionnaire that ascertained a variety of important health-related exposures (Belanger et al., 1978 PMID:248266). Since 1976, follow-up questionnaires have been mailed every 2 years. Colorectal cancer and other outcomes were reported by participants or next-of-kin and followed up through review of the medical and pathology record by physicians. Overall, more than 97% of self-reported colorectal cancers were confirmed by medical-record review. Information was abstracted on histology and primary location. The rate of follow-up evaluation has been high: as a proportion of the total possible follow-up time, follow-up evaluation has been more than 92%. Colorectal cancer cases were ascertained through June 1, 2008. In 1989-1990, 32,826 women in NHS I mailed blood samples by overnight courier, which were aliquoted into buffy coat and stored in liquid nitrogen. In 2001-2004, 29,684 women in NHS I who did not previously provide a blood sample mailed a swish-and-spit sample of buccal cells. Incident cases were defined as those occurring after the subject provided a blood or buccal sample. Prevalent cases were defined as those occurring after enrollment in the study in 1976 but before the subject provided either a blood or buccal sample. Participants with histories of cancer (except nonmelanoma skin cancer), ulcerative colitis, or familial polyposis, case-control sets were excluded. For this study, only European ancestry participants with blood source DNA and incident colorectal cancer cases were eligible for selection. Since enrollment year and sex matched exactly, controls were randomly selected in a case/control ratio of 2:1. Prostate, Lung, Colorectal and Ovarian Cancer Screening Trail (PLCO): PLCO enrolled 154,934 participants (men and women, aged between 55 and 74 years) at ten centers into a large, randomized, two-arm trial to determine the effectiveness of screening to reduce cancer mortality. Sequential blood samples were collected from participants assigned to the screening arm. Participation was 93% at the baseline blood draw. White colorectal cancer cases with a family history of colorectal cancer (no history of ulcerative colitis, Crohn's Disease, diverticulitis, Gardner's syndrome, Familial Polyposis) and successful genotyping from previous Peters GWAS were selected for this project. Controls were matched to cases on reference age and sex in a case/control ratio of 2:1. Women's Health Initiative (WHI): WHI is a long-term national health study that has focused on strategies for preventing heart disease, breast and colorectal cancer, and osteoporotic fractures in postmenopausal women. The original WHI study included 161,808 postmenopausal women enrolled between 1993 and 1998. The Fred Hutchinson Cancer Research Center in Seattle, WA serves as the WHI Clinical Coordinating Center for data collection, management, and analysis of the WHI. The WHI has two major parts: a partial factorial randomized Clinical Trial (CT) and an Observational Study (OS); both were conducted at 40 Clinical Centers nationwide. The CT enrolled 68,132 postmenopausal women between the ages of 50-79 into trials testing three prevention strategies. If eligible, women could choose to enroll in one, two, or all three of the trial components. The components are: Hormone Therapy Trials (HT): This double-blind component examined the effects of combined hormones or estrogen alone on the prevention of coronary heart disease and osteoporotic fractures, and associated risk for breast cancer. Women participating in this component with an intact uterus were randomized to estrogen plus progestin (conjugated equine estrogens [CEE], 0.625 mg/d plus medroxyprogesterone acetate [MPA] 2.5 mg/d) or a matching placebo. Women with prior hysterectomy were randomized to CEE or placebo. Both trials were stopped early, in July 2002 and March 2004, respectively, based on adverse effects. All HT participants continued to be followed without intervention until close-out. Dietary Modification Trial (DM): The Dietary Modification component evaluated the effect of a low-fat and high fruit, vegetable and grain diet on the prevention of breast and colorectal cancers and coronary heart disease. Study participants were randomized to either their usual eating pattern or a low-fat dietary pattern. Calcium/Vitamin D Trial (CaD): This double-blind component began 1 to 2 years after a woman joined one or both of the other clinical trial components. It evaluated the effect of calcium and vitamin D supplementation on the prevention of osteoporotic fractures and colorectal cancer. Women in this component were randomized to calcium (1000 mg/d) and vitamin D (400 IU/d) supplements or a matching placebo. The Observational Study (OS) examines the relationship between lifestyle, environmental, medical and molecular risk factors and specific measures of health or disease outcomes. This component involves tracking the medical history and health habits of 93,676 women not participating in the CT. Recruitment for the observational study was completed in 1998 and participants were followed annually for 8 to 12 years. All centrally confirmed White cases of invasive colorectal cancer, or death from colorectal cancer were selected as potential cases from the March, 2011 database. Case priory lists are: 1) have positive family history of colorectal cancer; 2) randomly select cases until we get a total of n=800 cases. Control participants were required to be White, free of invasive colorectal cancer and non-invasive (stage 0 in situ) colorectal cancer. Centrally denied cases of colorectal cancer were not allowed into the control pool. Case and control participants were subject to the following exclusion criteria: (1) had prior history of colorectal cancer at baseline; (2) had no available DNA (DNA searching as Nov 15, 2012); (3) cannot be deposited to dbGaP; (4) lost to follow-up after enrollment; (5) selected for WHI study M26 Phase II. Controls were matched to cases in a case/control ratio of 2:1. In order to get 2 cases with 1 control, cases were grouped by enrollment year (a total of 5 groups). For each year group, around 50% cases were selected to match controls. In total, 401 cases were selected to match controls. Matching was done on enrollment year, which was matched exactly. For additional information, see dbGaP: phs000200 and ClinicalTrials: NCT00000611.
CHD is the leading cause of death in the United States. One of the most common ways to prevent CHD is to take an anti-platelet agent, which lessens platelet aggregation. Two of the most common anti-platelet agents are aspirin and clopidogrel. However, up to 25% to 30% of people do not respond to these medications. Evidence indicates that treatment response may be related to genetics. The purpose of this study is to determine specific gene variants that predict response to aspirin and clopidogrel therapy. This study is part of a larger group of studies called the Pharmacogenomics Research Network (PGRN). Participants are from the Old Order Amish of Lancaster, Pennsylvania. They are well suited for genetic studies because they are a homogenous, closed, founder population. Participants received 300 mg of clopidogrel on the first day, then 75 mg of clopidogrel per day for the next 6 days. On the last day of clopidogrel treatment, participants took a single dose of 324 mg aspirin. Participants underwent platelet function tests before and after clopidogrel alone, and then again after taking clopidogrel plus aspirin. Using the gene variation profiles across the genome, researchers analyzed which variants correspond to treatment response.
Rare individuals are highly resistant to infection with human immunodeficiency virus (HIV). Studies of candidate genes resulted in the discovery of a 32bp deletion in the CC-chemokine receptor 5 gene (CCR5Δ32), which rendered this critical co-receptor for primary HIV infection to be non-functional. Pharmacologic and vaccine-induced blockade of CCR5 is being pursued to treat and prevent HIV infection and other conditions. The allele frequency of CCR5Δ32 among persons of European ancestry is approximately 10%. CCR5Δ32/Δ32 homozygotes are almost totally resistant to HIV infection. People with severe hemophilia A require frequent replacement with clotting Factor VIII (FVIII) to control hemorrhage. Prior to the discovery of HIV in 1984 and licensure of recombinant FVIII in the late 1980s, people with severe hemophilia A were treated with plasma-derived FVIII and thus were intensively exposed to HIV. Only 5% of such patients were not infected with HIV. Of these, approximately 1/3 were CCR5Δ32/Δ32 homozygotes. The remaining 2/3 of these people who were highly resistant to HIV remain unexplained. This project seeks to discover genome variations among people who are highly resistant to HIV infection. Such variation is likely to serve as a target for reducing the morbidity and incidence of HIV.
Genetic mutations causing disease may be inherited, newly acquired in parental gametes and present in the zygote or acquired as somatic events at some point in development after fertilization. The burden and localization of an acquired mutation depend on when the mutation arises. As is the case for inherited genetic variation, there is accumulating evidence that somatic mutations can lead to severe tissue-specific disease. Malformations of cortical development (MCD) represent a group of disorders characterized by a range of morphological and structural abnormalities of the cerebral cortex reflecting errors in embryonic cortical development. MCD are associated with intellectual disability, as well as refractory epilepsy, and may require the surgical removal of the affected tissue. Inherited gene mutations involved in neuronal development explain only a fraction of MCD cases. In this project we sought to evaluate the hypothesis that somatic mutations disrupt embryonic cortical development and are responsible for a substantial fraction of MCD. Using high coverage next generation sequencing of protein-coding regions in DNA extracted from abnormal brain tissue and unaffected tissue (leukocytes) from patients with three forms of MCD, we sought to identify somatic mutations within each patient with MCD. Exome sequence data from study participants are provided to facilitate research into the genetic underpinnings of MCD.
Developmental brain malformations are at the core of significant neurological diseases affecting many families in the United States and around the world. It is known that epilepsy, specific learning deficits and intellectual disability, cerebral palsy, and abnormalities of brain volume can be attributed in many cases to pathological malformations of the cerebral cortex. Although these consequences, such as epilepsy and intellectual disability, might appear broadly in the population as due to complex traits, this study's focus on those associated with cortical malformations highlights individual developmental pathways likely represented by innumerable and rare Mendelian alleles. Research has thus far uncovered dozens of genes responsible for these conditions and dissected the mechanisms underlying early cortical development in animals. However, this progress represents only the dawn of understanding the complex genetic network and neuronal architecture of the uniquely human cerebral cortex. The overall goal of this study is to define the genetic bases of human cerebral cortical development. This is accomplished through (1) the ascertainment of families with disorders of human brain development and malformation, (2) categorizing these using medical, physical and neuroimaging data, and (3) mapping and identifying the gene causing the disorder of cortical development, which can then be investigated for its normal expression and function, and role in human disease.