Need Help?


In the UK10K project we propose a series of complementary genetic approaches to find new low frequency/rare variants contributing to disease phenotypes. These will be based on obtaining the genome wide sequence of 4000 samples from the TwinsUK and ALSPAC cohorts (at 6x sequence coverage), and the exome sequence (protein coding regions and related conserved sequence) of 6000 samples selected for extreme phenotypes. Our studies will focus primarily on cardiovascular-related quantitative traits, obesity and related metabolic traits, neurodevelopmental disorders and a limited number of extreme clinical phenotypes that will provide proof-of-concept for future familial trait sequencing. We will analyse directly quantitative traits in the cohorts and the selected traits in the extreme samples, and also use imputation down to 0.1% allele frequency to extend the analyses to further sample sets with genome wide genotype data. In each case we will investigate indels and larger structural variants as well as SNPs, and use statistical methods that combine rare variants in a locus or pathway as well as single-variant approaches. The Genetics and Psychosis (GAP) set consists of samples from subjects with schizophrenia, ascertained as a new-onset sample. This set is of UK origin, with data on cognition, brain imaging and other endophenotypes. The Maudsley twin series consists of probands ascertained from the Maudsley Twin Register, defined as patients of multiple birth who had suffered psychotic symptoms. This set is of UK origin, with data on cognition, brain imaging and other endophenotypes, with DNA available from an MZ or DZ affected or unaffected co-twin. The Maudsley family study (MFS) consists of over 250 families who have a history of schizophrenia or bipolar disorder Within the Maudsley Family Study, biological markers of psychosis include neuropsychological tests, Evoked Response Potentials Tests (ERPs), MRI scans, dermatoglyphics and eye tracking. Early risk factors for psychosis and clinical symptoms are also investigated. This set is of UK origin, with DNA available from both affected and unaffected relatives in many of the probands. For further information on this study please contact David Collier (

Click on a Dataset ID in the table below to learn more, and to find out who to contact about access to these data

Dataset ID Description Technology Samples
EGAD00001000235 Illumina HiSeq 2000 170
EGAD00001000321 Illumina HiSeq 2000 158
EGAD00001000442 Illumina HiSeq 2000 172
Publications Citations
The UK10K project identifies rare variants in health and disease.
Nature 526: 2015 82-90
Assessing the role of rare genetic variants in drug-resistant, non-lesional focal epilepsy.
Ann Clin Transl Neurol 8: 2021 1376-1387