T2D-GENES Consortium: San Antonio Mexican American Family Studies (SAMAFS)
The Type 2 Diabetes (T2D) Genetic Exploration by Next-generation sequencing in Ethnic Samples (T2D-GENES) Consortium is a collaborative international effort to identify genes influencing susceptibility to type 2 diabetes in multiple ethnic groups using next generation sequencing. To fulfill this objective, T2D-GENES Consortium undertook two large sequencing studies, called T2D-GENES Projects 1 and 2.
Project 1 has carried out whole exome sequencing of 12,940 individuals, 6,504 with T2D and 6,436 non-diabetic controls, equally divided among five continental ancestry groups: Europeans, East Asians, South Asians, Hispanic Americans, and African Americans. The goal of Project 1 is to identify all genetic variants in the complete coding regions of the genomes (i.e., whole exome) by sequencing, including rare variants.
Project 2 (i.e., SAMAFS substudy 2) is a pedigree-based study designed to identify low frequency or rare variants influencing susceptibility to T2D, using whole genome sequence information from approximately 600 individuals in 20 Mexican American T2D-enriched pedigrees from San Antonio, Texas, augmented with family-based imputation into approximately 440 additional family members. The major objectives of Project 2 are to identify low frequency or rare variants in and around known common variant signals for T2D, as well as to find novel low frequency or rare variants influencing susceptibility to T2D.
Both T2D-GENES Projects 1 and 2 involve the San Antonio Mexican American Family Studies (SAMAFS), which are composed of four San Antonio, Texas-based family studies: the San Antonio Family Heart Study (SAFHS), San Antonio Family Diabetes/Gallbladder Study (SAFDGS), Veterans Administration Genetic Epidemiology Study (VAGES), and Family Investigation of Nephropathy and Diabetes - San Antonio (FIND-SA) Component and its extension called the Extended FIND [E-FIND]. The SAFHS and SAFDGS began in 1991 and have followed participants with extensive clinical phenotyping related to T2D for over 20 years. The VAGES was initiated in 1994 and a large battery of T2D-related phenotypic data has been obtained from its participants. The FIND-SA began in 2000, a part of the multicenter FIND study which was designed to identify genetic determinants of diabetic kidney disease; data from its participants related to T2D were used for this project.
Non-overlapping subsets of SAMAFS participants are part of the T2D-GENES Projects 1 and 2, henceforth referred to as SAMAFS substudies 1 and 2, respectively. The SAMAFS substudies 1 and 2 are part of one of the five awards funded by NIDDK under a cooperative agreement award mechanism, which is governed by the Steering Committee of the T2D-GENES Consortium. Since Project 1 relies on population based subsets of cases and controls, 491 unrelated participants are drawn from the four SAMAFS as part of the T2D-GENES Project 1 Mexican American sample (i.e., SAMAFS substudy 1). The whole exome sequencing was performed at the Broad Institute.
For Project 2, 1048 individuals are drawn from two SAMAFS (SAFHS and SAFDGS), representing 20 large families for substudy 2. The substudy 2 strategy is to sequence approximately 600 individuals at an average of 50x coverage across the entire genome, then impute genome wide genotypes for about 440 additional family members. The 600 sequenced individuals are specifically chosen for their value in imputing sequence information into other family members. By studying large pedigrees, we expect to find multiple individuals carrying each genetic variant, even if this variant is very rare in the population at large. Thus, a pedigree-based approach provides an excellent opportunity for identifying rare novel variants influencing risk of T2D and quantitative variation in T2D-related phenotypes. The whole genome sequencing has been done commercially by Complete Genomics, Inc. (CGI). The available sample of 1,048 includes 5 sequenced individuals who do not belong to any of the 20 large pedigrees. The final family data of 1,043 individuals includes whole genome sequence data for 607 individuals. After quality control, 590 sequenced individuals provide data for family based imputation using Merlin linkage analysis software into approximately 440 additional family members for whom chip based genotypes are available to indicate which parental haplotype is transmitted.
The complete SAMAFS data including phenotype, genotype, sequence, other T2D-related trait data utilized for Projects 1 and 2 are available. These data can readily be viewed by clicking on the substudy title shown below or in the box: "Substudies", located on the right hand side of this parent or top study page phs000847.v1.p1, titled T2D-GENES Consortium: San Antonio Mexican American Family Studies (SAMAFS).
- phs000849 T2D-GENES Project 1: San Antonio Mexican American Family Studies (SAMAFS), Substudy 1: Whole Exome Sequencing
- phs000462 T2D-GENES Project 2: San Antonio Mexican American Family Studies (SAMAFS), Substudy 2: Whole Genome Sequencing in Pedigrees
- Type: Family
- Archiver: The database of Genotypes and Phenotypes (dbGaP)