Copied to clipboard!

GCAT | Genomes for life: cohort study of the genomes of Catalonia

The GCAT Study have recruited 20 000 participants aged 40–65 years. Participants who agreed to take part in the study completed a self-administered computer-driven questionnaire, and underwent blood pressure, cardiac frequency and anthropometry measurements. For each participant, blood plasma, blood serum and white blood cells are collected at baseline. A total of 5459 genomic profiles have been characterised by comprehensive genotyping. Genome-wide genotypes have been generated using Illumina Infinium SNP-bead array technology. We chose the Multi-Ethnic Global (MEGAEX, V.2) consortium array, a multipurpose, multiethnic genotyping array with two million selected markers (including previously described germline mutations, insertions-deletions (InDels) and SNPs).We have strictly followed the standard manufacturer recommended automated protocol for the Infinium HTS Assay scanned with a HiScan confocal scanner (Illumina, San Diego, California, USA). Genome Studio V.2011.1 has been used for raw data analysis. Genotyping was performed at the Genomics and Bioinformatics Unit of the PMPPC Institute for Health Science Research Germans Trias i Pujol, in Badalona, Spain. Future plans The first follow-up study started in December 2017 and will end by March 2018. Residences of all subjects will be geocoded during the following year. Several genomic analyses are ongoing, and metabolomic and genomic integrations will be performed to identify underlying genetic variants, as well as environmental factors that influence metabolites. http://dx.doi.org/10.1136/bmjopen-2017-018324

Type: Other
Archiver: European Genome-phenome Archive (EGA)

15 Datasets 7 Publications

Click on a Dataset ID in the table below to learn more, and to find out who to contact about access to these data

Dataset ID	Description	Technology	Samples
EGAD00001007729	Sex, age at recruitment (2014-2018), and birthdate of GCAT Cohort individuals.		1
EGAD00001007730	First 20 principal components of 4988 genotyped GCAT Cohort individuals with Infinium Multi-Ethnic Global (MEGAEX2) array, with data for Cr1-22. Plink files with QC and imputed (SHAPEIT+IMPUTE).		1
EGAD00001007731	Disease diagnoses of GCAT Cohort participants obtained from electronic health records (EHR), mainly including the time period from 2012 to 2017. Disease diagnoses are codified in ICD-9, and the position of diagnosis refers to primary/secondary diagnoses (up to 14 secondary diagnoses per visit). The date and origin of the visit are also specified (AP: primary care, UGR: emergency, AH: hospital care, SMA: outpatient medical service, SMH: hospital medical service).		1
EGAD00001007774	This dataset contains genotypes (35.4M of SNVs, Indels and SVs), from 785 samples, after QC filtering, from the 808 WGS GCAT cohort.		1
EGAD00001008201	This dataset include FASTQ files of 808 samples from GCAT cohort. Technology used HiSeq 4000, read length 150 bp, inner mate distance 300 bp. For each sample the paired -ends are generated in separated files. Each FASTQ is splitted in multiple LANEs and grouped by the Multiplex index.	Illumina HiSeq 4000	1
EGAD00001008202	This dataset include BAM files of 808 samples from GCAT cohort. Technology used HiSeq 4000, read length 150 bp, inner mate distance 300 bp. For each sample the paired -ends are generated in separated files. Each FASTQ is splitted in multiple LANEs and grouped by the Multiplex index.		1
EGAD00001008210	This dataset contains raw genotypes ( SNVs, Indels and SVs), from 785 samples,without applying any filter, from the 808 WGS GCAT cohort.		1
EGAD00010001664	4988 samples issued from GCAT cohort, genotyped with MEGAex-Infinium Array, with data for Cr1-22. Plink files with QC and imputed (SHAPEIT+IMPUTE).	Illumina-Genotyping Array	4988
EGAD00010001665	4988 samples issued from GCAT cohort, genotyped with MEGAex-Infinium Array, with data for Cr1-22. Plink files with QC but not imputed.	Illumina-Genotyping Array	4988
EGAD00010002152	This resource contains the SV annotations using the AnnotSV tool. The description of annotations can be found in AnnotSV web page https://lbgi.fr/AnnotSV/ or GCAT-BSC web page: http://cg.bsc.es/GCAT_BSC_iberianpanel	Illumina HiSeq 4000	785
EGAD00010002153	This dataset includes the .hap, .legend and .sample files from the GCAT\|Panel (Iberian reference panel), built from 785 samples, after QC, from the 808 WGS GCAT cohort, including 30.3M SNVs, 5M Indels and 89K SVs. This resource has been generated using Shapeit4 and WhatsHap software. Technology used HiSeq 4000, read length 150 bp, inner mate disatance 300 bp.	Illumina HiSeq 4000	785
EGAD00010002740	This dataset contains DNA methylation data from 400 individuals (200 Type 2 Diabetes cases and 200 controls) from the GCAT (Genomes for Life) cohort. The methylation profiling was performed using the Illumina Infinium MethylationEPIC v2.0 BeadChip, which offers comprehensive coverage of over 850,000 CpG sites. Data are provided in IDAT file format, enabling raw signal-level analysis and downstream processing.	Infinium MethylationEPIC v2.0	400
EGAD00010002749	4988 samples issued from GCAT cohort, genotyped with MEGAex-Infinium Array, with data for Cr1-23. Plink files with QC and imputed using TOPMed r2.	Illumina-Genotyping Array	4988
EGAD00010002758	2746 samples issued from GCAT cohort, genotyped with GSA Array, with data for Cr1-23. Plink files with QC and imputed using TOPMed.	Illumina-Genotyping Array	2746
EGAD50000001378	This dataset comprises targeted sequencing data of 52 genes previously implicated in severe COVID-19 outcomes. The study includes samples from 764 individuals with severe COVID-19 and 3,939 population-based controls from the GCAT cohort (Spain). Molecular Inversion Probes (MIPs) were utilized for cost-effective and precise sequencing of the selected genes. The targeted genes include: Inflammasome/IL-1/TNF Pathway: NLRP3, CASP1, CASP8, IL1B, TNF, RIPK1, RIPK3, MYD88, TNFRSF13B SARS-CoV-2 Entry/Replication: ACE2, TMPRSS2, FURIN, SLC6A20, DDX1, DDX58, TLR4, FYCO1, CTSB, CTSL, ADAM17 Complement System: MBL2, CFH, CFI, CFB, ADAM10, CD46 Interferon Signaling: TLR3, IFIH1, IFITM3, TBK1, TLR7, IL10RB, IFNAR1, IFNAR2, SIGLEC1, MYD88, IFNGR1 Chemokine Receptor Signaling: CCR1, CCR3, CCR2, CCR9, IL8, CXCL3, CXCL10, CXCR6, XCR1, CCL2, CCL20 Immunodeficiency Genes: CASP8, CD46, CFB, CFH, CFI, IFNAR1, IFNAR2, IFNGR1, IFIH1, MYD88, NLRP3, RIPK1, TBK1, TLR3, TLR7	Illumina NovaSeq 6000	2294

Publications	Citations
GCAT\|Genomes for life: a prospective cohort study of the genomes of Catalonia. Obón-Santacana M, Vilardell M, Carreras A, Duran X, Velasco J, Galván-Femenía I, Alonso T, Puig L, Sumoy L, Duell EJ, Perucho M, Moreno V, de Cid R. BMJ Open 8: 2018 e018324	47
GCAT\|Panel, a comprehensive structural variant haplotype map of the Iberian population from high-coverage whole-genome sequencing. Valls-Margarit J, Galván-Femenía I, Matías-Sánchez D, Blay N, Puiggròs M, Carreras A, Salvoro C, Cortés B, Amela R, Farre X, Lerga-Jaso J, Puig M, Sánchez-Herrero JF, Moreno V, Perucho M, Sumoy L, Armengol L, Delaneau O, Cáceres M, de Cid R, Torrents D. Nucleic Acids Res 50: 2022 2464-2479	9
Y-chromosome target enrichment reveals rapid expansion of haplogroup R1b-DF27 in Iberia during the Bronze Age transition. García-Fernández C, Lizano E, Telford M, Olalde Í, de Cid R, Larmuseau MHD, M de Pancorbo M, Calafell F. Sci Rep 12: 2022 20708	0
Skin Phototype and Disease: A Comprehensive Genetic Approach to Pigmentary Traits Pleiotropy Using PRS in the GCAT Cohort. Farré X, Blay N, Cortés B, Carreras A, Iraola-Guzmán S, de Cid R. Genes (Basel) 14: 2023 149	14
Identification of intergenerational epigenetic inheritance by whole genome DNA methylation analysis in trios. Díez-Villanueva A, Martín B, Moratalla-Navarro F, Morón-Duran FD, Galván-Femenía I, Obón-Santacana M, Carreras A, de Cid R, Peinado MA, Moreno V. Sci Rep 13: 2023 21266	4
Multiomic integration analysis identifies atherogenic metabolites mediating between novel immune genes and cardiovascular risk. Carreras-Torres R, Galván-Femenía I, Farré X, Cortés B, Díez-Obrero V, Carreras A, Moratalla-Navarro F, Iraola-Guzmán S, Blay N, Obón-Santacana M, Moreno V, de Cid R. Genome Med 16: 2024 122	1
Inferring past demography and genetic adaptation in Spain using the GCAT cohort. Garcia-Calleja J, Biagini SA, de Cid R, Calafell F, Bosch E. Sci Rep 15: 2025 14225	3