scRNA-seq samples from healthy and asthma donors part 1.
scRNAseq dataset containing 5 healthy donors and 4 asthmatic donors.
These are caveman, pindel, battenberg and brass calls for index patients' metastatic melanoma genomes within this study.
Plasma DNA motif analysis
The Long Life Family Study (LLFS) is an international collaborative study of the genetics and familial components of exceptional survival, longevity, and healthy aging. Families were recruited through elderly probands (generally in their 90s) who self-reported on the survival history of their parents and siblings, and on the basis of this information, families which showed clustering of exceptional survival were recruited. [Specifically, a Family Longevity Selection Score (FLOSS) ≥7 was required. The FLOSS measures the average excess Observed lifespan over that Expected based upon lifetables, while adding a bonus term for still-living individuals. Thus FLOSS is a useful tool for scoring and selecting families for inclusion in a research study of exceptional survival (Sebastiani et al., 2009, PMID: 19910380)]. Probands resided in the catchment areas of four Field Centers (Boston University, Columbia University, University of Pittsburgh, and University of Southern Denmark). Recruited family members were phenotyped through extensive in-home visits by teams of technicians who traveled all over the USA and Denmark. Blood assays were centrally processed at a Laboratory Core (University of Minnesota) and protocols were standardized, monitored and coordinated through a Data Management Coordinating Center (Washington University). We examined and extensively phenotyped in all major domains of healthy aging, 4,953 individuals in 539 families through comprehensive in-home visits. Of these, 4,815 gave dbGaP sharing permission and had sufficient quantity/quality of DNA for GWAS genotyping. This large collection of families, selected on the basis of clustering for exceptional survival, is a unique resource for the study of human longevity and healthy aging. We estimate that less than 1% of the Framingham Heart Study (FHS) families (a roughly random population family sample) would meet the minimal entrance criteria for exceptional survival required in the LLFS (Sebastiani et al., 2009, PMID: 19910380). Thus, the least exceptional LLFS families show more clustering for exceptional longevity than 99% of the FHS families. Although the LLFS pedigrees were selected on the basis of longevity per se in the upper generation (and the generation above that), the children's generation have significantly lower rates of many major diseases and have better healthy aging profiles for many disease phenotypes (Newman et al., 2011, PMID: 21258136). The participants had their first in-person visit between 2006 and 2009. After that visit, they were contacted annually by telephone to update vital status, medical history, and general health. Between 2014 and 2017, willing participants completed a second in-person visit. The second visit followed the same protocols and centralized training as the first visit. During the second visit, a portable carotid ultrasound exam was added. Again, participants were continuously contacted annually for telephone follow-up during the period of the second in-person visit and after that. Annual telephone follow-ups currently ongoing, and plans for a third in-person visit are in progress.
The Africa America Diabetes Mellitus (AADM) study is a genetic epidemiological study of type 2 diabetes in Sub-Saharan Africa. Study participants were enrolled through university medical centers in Nigeria, Ghana, and Kenya. Ethical approval for the study was obtained from the Institutional Review Board (IRB) of each participating institution. All subjects provided written informed consent for the collection of samples and subsequent analysis. The case definition of type 2 diabetes was based on the American Diabetes Association (ADA) criteria. After providing informed consent, participants underwent the same enrollment procedures, which included collection of demographic information, medical history, clinical examination and a blood draw. Genome-wide SNP genotyping was done on either the Axiom™ PanAFR SNP array (n=1,808) or the Multi-Ethnic Global Array (MEGA) (n=3,423). After appropriate quality control, in silico imputation was done using the African Genome Resources Haplotype Reference Panel (at the Sanger Imputation Service). Imputed genotypes were filtered for variants with minor allele frequency (MAF)≥ 0.01 and information score (info) ≥ 0.3 for genetic association analysis. Genome-wide association analysis between type 2 diabetes and the imputed genotype dosages was done using a generalized linear mixed model, which adjusted for age, gender, body mass index, the genetic relatedness matrix and the first three principal components (PCs) of the genotypes.Metabolomics profiling of plasma samples of type 2 diabetes (T2D) cases and controls in Nigerians (West Africa) was done in the AADM Study. Plasma metabolites were measured in a total of 580 individuals (N=310 for the discovery phase and N=270 for the replication stage) using the global/untargeted approach on the Metabolon platform and following the manufacturer's standard operation protocols. The analytic methods are described in detail in Doumatey et al. [Genome Med 2024]. The measured metabolites level represented by peak areas are relative values. The peak area data were batch-normalized to remove the instrument batch effects (batch variability) and the batch-normalized data correspond to the median-scaled raw data. For each identified metabolite, the minimum value across all batches in the batch-normalized was imputed for the missing values. The batch-normalized and imputed data are natural log-transformed and consist of 1116 metabolites for the discovery cohort and 1071 metabolites for the replication. Welch's two-sample t-test on the log-transformed data was used to identify metabolites differentially expressed between T2D cases and controls . All other statistical analyses conducted on both the replication and discovery cohorts used the log-transformed data. To merge the discovery and replication, the same quality control samples (bridge samples) were run with each batch of the experimental samples in both cohorts and used to correct for additional variability and uniformize the procedures. The resulting merged data is the QC-normalized and imputed data that contains only metabolites that were common to both cohorts and successfully bridged for all batches (n= 891 metabolites).
SOMAscan plasma proteome datasets generated from participants consuming the fiber blend snack prototype (study 2)
SOMAscan plasma proteome datasets generated from participants consuming the pea fibre snack prototype (study 1)