In collaboration with Dr Robert Semple we have identified a family harbouring an autosomal dominant variant, which leads to severe insulin resistance (SIR), short stature and facial dysmorphism. This family is unique within the SIR cohort in having normal lipid profiles, preserved adiponectin and normal INSR expression and phosphorylation. DNA is available for 7 affected and 7 unaffected family members across 3 generations. All 14 samples have been genotyped using microsatellites and the Affymetrix 6.0 SNP chip. Linkage analysis identified an 18.8Mb haplotype on chromosome 19 as a possible location of the causative variant. However, Exome sequencing of 3 affected and 1 unaffected family members has not identified the causative variant suggesting the possibility of an intronic or intergenic variant in this region or elsewhere in the genome. We propose to conduct whole genome sequencing of 5 members of the pedigree at a depth of 20X. The chosen samples are two sets of parents plus one member of an unaffected branch of the pedigree who shares the risk haplotype on chromosome 19. Sequencing of the two sets of parents will be used along with the genome-wide SNP data to impute 4 affected children giving an effect sample size of 6 affected individuals.
This dataset is a time-series of EGFR-mutant NSCLC clinical specimens from an individual patient profiled using tumor-based whole exome sequencing and the data is in BAM format. DNA was extracted from FFPE for primary tumor and frozen tumor tissue samples and matched non-tumor tissue using the Qiagen Allprep DNA/RNA Mini Kit. The library preparation protocol was based on the Agilent SureSelect Library Prep and Capture System. DNA was resuspended in a low TE buffer and sheared (Duty Cycle 5%; Intensity 175; Cycles/Burst: 200; Time: 300s, Corvaris S2 Utrasonicator). Bar-coded exome libraries were prepared using the Agilent Sure Select V5 library kit per manfucaturer’s specifications. The libraries were run on the HiSeq2500. Raw paired end reads (100bp) in FastQ format generated by the Illumina pipeline were aligned to the full hg19 genomic assembly obtained from USCS, gencode 14, using bwa version 0.7.12. Picard tools version 1.117 was used to sort, remove duplicate reads and generate QC statistics. Tumor DNA was sequenced to median depth of 303X (range 114.39-383.41) and the matched germline DNA to average depth of 231.65.
Clear cell renal cancer is characterized by near-universal loss of the short arm of chromosome 3 (3p). This event arises through unknown mechanisms, but critically results in the loss of several tumor suppressor genes. We analyzed whole genomes from 95 biopsies across 33 patients with clear cell renal cancer (ccRCC) recruited into the Renal TRACERx study. We find novel hotspots of point mutations in the 5'-UTR of TERT, targeting a MYC-MAX repressor, that result in telomere lengthening. The most common structural abnormality generates simultaneous 3p loss and 5q gain (36% patients), typically through chromothripsis. Using molecular clocks, we estimate this occurs in childhood or adolescence, generally preceding emergence of the most recent common ancestor by years to decades. Similar genomic changes recent common ancestor by years to decades. Similar genomic changes are seen in inherited kidney cancers. Modeling differences in age-incidence between inherited and sporadic cancers suggests that the number of cells with 3p loss capable of initiating sporadic tumors is no more than a few hundred. Targeting essential genes in deleted regions of chromosome 3p could represent a potential preventative strategy for renal cancer.
Employed single cell RNA sequencing and protein surface marker profiling of serialCAR-T cell samples from patients with non-Hodgkin’s lymphoma (NHL) to reveal CAR-T cell evolution, identify biomarkers of response, and test for evidence of exhaustion inCAR-T cells of poor responders. At the transcriptional and protein levels, we note the evolution of CAR-T cells toward a non-proliferative, highly-differentiated, andexhausted state that is enriched in CAR-T cells of patients with poor response.Furthermore, we identified the checkpoint receptor TIGIT as a novel prognosticbiomarker and potential driver of CAR-T cell exhaustion.
Introduction to V2: This data release comprises data from the V1 release combined with approximately 3,000 additional samples, collected during the HRS 2010 field period. The 2010 data include samples from a random half of the new cohort enrolled in 2010 along with a significant expansion of the minority sample. Description: The University of Michigan Health and Retirement Study (HRS) is a longitudinal panel study that surveys a representative sample of approximately 20,000 people in America over the age of 50 every two years. Supported by the National Institute on Aging (NIA U01AG009740) and the Social Security Administration, the HRS explores the changes in labor force participation and the health transitions that individuals undergo toward the end of their work lives and in the years that follow. The study collects information about income, work, assets, pension plans, health insurance, disability, physical health and functioning, cognitive functioning, and health care expenditures. Through its unique and in-depth interviews, the HRS provides an invaluable and growing body of multidisciplinary data that researchers can use to address important questions about the challenges and opportunities of aging. Because of its innovation and importance, the HRS has become the model and hub for a growing network of harmonized longitudinal aging studies around the world. Origins of the HRS. As the population ages it is increasingly important to obtain reliable data about aging and topics that are relevant to a range of policy issues in aging. To address this need, the National Institutes on Aging (NIA) established a cooperative agreement with the University of Michigan Institute for Social Research to collect such data. The HRS launched data collection in 1992 and has re-interviewed the original sample of respondents every two years since then. By adding new cohorts and refreshing the sample, the HRS has grown to become the largest, most representative longitudinal panel study of Americans 50 years and older. HRS Study Design. The target population for the original HRS cohort includes all adults in the contiguous United States born during the years 1931-1941 who reside in households, with a 2:1 oversample of African-American and Hispanic populations. The original sample is refreshed with new birth cohorts (51-56 years of age) every six years. The sample has been expanded over the years to include a broader range of birth cohorts as well. The target population for the AHEAD survey consists of United States household residents who were born in 1923 or earlier. Children of the Depression (CODA) recruits households born 1924-1930, War Babies 1942-47, Early Boomers 1948-53, and Mid-Boomers 1954-59. Data collection includes a mixed mode design combining in-person, telephone, mail, and Internet. For consenting respondents, HRS data are linked at the individual level to administrative records from Social Security and Medicare claims. Genetic Research in the HRS. The HRS has genotyped 2.5 million single nucleotide polymorphisms (SNPs) on respondents using Illumina's Human Omni2.5-Quad (Omni2.5) BeadChip. The genotyping was performed by the NIH Center for Inherited Disease Research (CIDR). Saliva was collected on half of the HRS sample each wave starting in 2006. In 2006, saliva was collected using a mouthwash collection method. From 2008 onward, the data collection method switched to the Oragene kit. Saliva completion rates were 83% in 2006, 84% in 2008, and 80% in 2010 among new cohort enrollees. HRS Phenotypic data. Phenotypic data are available on a variety of dimensions. Health measures include physical/psychological self-report, various health conditions, disabilities, cognitive performance, health behaviors (smoking, drinking, exercise), physical performance and anthropomorphic measures, and biomarkers (HbA1c, Total Cholesterol, HDL, CRP, Cystatin-C). Data are also available on health services including utilization, insurance and out-of-pocket spending with linkage to Medicare records. Economic measures include employment status/history, earnings, disability, retirement, type of work, income by source, wealth by asset type, capital gains/debt, consumption, linkage to pensions, Social Security earnings/benefit histories. There is also extensive information on family structure, proximity, transfers to/from of money, time, social and psychological characteristics, as well as a wide range of demographics. Performance on a cognitive test combining immediate and delayed word recall was selected as an example trait for the dbGaP data release. In the immediate word recall task the interviewer reads a list of 10 nouns to the respondent and asks the respondent to recall as many words as possible from the list in any order. After approximately five minutes of asking other survey questions, the respondent is asked to recall the nouns previously presented as part of the immediate recall task. The total recall score is the sum of the correct answers to these two tasks, with a range of 0 to 20. Researchers who wish to link to other HRS measures not in dbGaP will be able to apply for access from HRS. A separate Data Use Agreement (DUA) will be required for linkage to the HRS data. See the HRS website (http://hrsonline.isr.umich.edu/gwas) for details.
PCR products were obtained from each target loci using genomic DNA from human iPS cells. Subsequently, PCR products are pooled and subjected to Illumina library preparation. The library will be sequenced by MiSeq. This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
The dataset consist of DNA and RNA sequencing results and metadata of the samples. All sample numbers starting with 6716 are tumor samples which has been sequenced using WES (see BAM files). It concerns biopsies of metastatic lesions from patients with BRAFV600 mutated melanoma, obtained before, during and after the study treatment (see samples metadata) and in some cases blood for germline mutation analysis. Sequencing is performed using the Illumina Novaseq 6000 system.
This research project was a collaboration between Trinity College Dublin, Ireland and the Stanley Center at the Broad Institute. In this project we sequenced and analyzed the whole exomes of 191 Bipolar case/control samples from collaborators in Ireland. Genomic DNA from each sample was sequenced to a mean depth of 20x. The project used Illumina WXS sequencing of DNA and the file type is cram.
Our study describes for the first time the myeloid cell landscape in human visceral adipose tissue at single-cell level within a cohort of well-characterized patients with non-alcoholic fatty liver disease.
The iPSC line SIGi001-A-13 / SAMEA104386250 has undergone whole genome sequencing. Whole genome sequencing was done using Illumina HiSeq X Five platform with a reference genome of GRCh38. Raw data as FASTQ files and analysed data as CRAM files are available for this sample, in this dataset. The iPSC line SIGi001-A-13 is available for research use at www.EBiSC.org.