RNAseq data of polyA+ RNA from Leukocytes from 624 individuals of the SardiNIA cohort.

We sequenced the polyA+ of the RNA of the leukocytes from 624 sardinian individuals with RNAseq (Illumina 2000, an average of round 60M reads per sample, 51 + 51 nt). Combining the genotype information of the same individuals (already available from the ProgeNIA project) with the transcriptome data we identified 21,183 independent expression quantitative trait loci (eQTLs) and 6,768 independent splicing quantitative trait loci (sQTLs), including 619 novel QTLs that exhibit population-specific trait associations. Using family relationships, we identified 809 segregating expression outliers and provide a new approach to study large effect regulatory variants and their relevance to traits. Our results provide insight into the effects of regulatory variants and their relationship to population history and individual genetic risk.

We sequenced the polyA+ fraction of the RNA of the leukocytes from 624 sardinian individuals with RNAseq. Prior to library preparation we added either ERCC ExFold RNA Spike-In. An average of 60M reads per samples with 51 bp paired-end reads were generated on a HiSeq 2000 (Illumina). Sequencing reads were then aligned using STAR-2.2.0c2 to the h37d5 reference genome supplemented with the ERCC spike-ins sequences. We further provided an exon-exon junction database that we generated from the ... (Show More)
Illumina HiSeq 2000 624

