Need Help?

Insights into non-crossover recombination from long-read sperm sequencing

Meiotic recombination is a fundamental process that generates genetic diversity by creating new combinations of existing alleles. While crossovers in humans are well characterized, the more frequent non-crossovers that lead to gene conversion remain challenging to study. Here we show that single high-fidelity long sequencing reads from sperm can capture both crossovers and non-crossovers, allowing effectively arbitrary sample sizes for analysis from one male. Using fifteen sperm samples from thirteen donors we demonstrate variation between donors for the different types of recombination, even when they share the same PRDM9 genotype. Non-crossovers are less consistent with the genetic map and exhibit reduced association with meiotic double-strand break (DSB) sites identified by DMC1 binding. Where non-crossover gene conversions are close to PRDM9 sites they tend to occur upstream. We further provide evidence for two distinct non-crossover processes. One gives rise to the vast majority of non-crossovers with mean conversion tract length under 50bp, while ~1% of non-crossovers have much longer mean tract length. Applying the same analysis pipeline to public data from twelve blood samples, we also see non-crossover gene conversions, but very few crossover events. These non-meiotic events, not correlated with the genetic map, increase with age, consistent with double-strand break repair throughout life. We suggest that many of the non-crossover events seen in sperm also derive from non-meiotic repair prior to spermatogenesis.

Request Access

pfizer-exomes-policy

”Pfizer-funded TwinsUK exome sequence data produced at BGI”

Studies are experimental investigations of a particular phenomenon, e.g., case-control studies on a particular trait or cancer research projects reporting matching cancer normal genomes from patients.

Study ID Study Title Study Type
EGAS00001004958 Cancer Genomics

This table displays only public information pertaining to the files in the dataset. If you wish to access this dataset, please submit a request. If you already have access to these data files, please consult the download documentation.

ID File Type Size Quality Report
Located in
EGAF00008867552 bam 126.7 GB
EGAF00008867553 bam 103.3 GB
EGAF00008867554 bam 113.1 GB
EGAF00008867555 bam 122.2 GB
EGAF00008867556 bam 113.7 GB
EGAF00008869856 bam 77.6 GB
EGAF00008869857 bam 123.9 GB
EGAF00008869858 bam 118.6 GB
EGAF00008869859 bam 128.0 GB
EGAF00008869860 bam 17.2 GB
EGAF00008869861 bam 94.2 GB
EGAF00008869862 bam 79.5 GB
EGAF00008869863 bam 57.5 GB
EGAF00008869864 bam 53.2 GB
EGAF00008869865 bam 101.7 GB
EGAF00008869866 bam 106.6 GB
EGAF00008869867 bam 90.8 GB
EGAF00008869868 bam 82.9 GB
EGAF00008869869 bam 89.8 GB
EGAF00008869870 bam 74.2 GB
EGAF00008869871 bam 117.8 GB
EGAF00008869872 bam 67.8 GB
EGAF00008869873 bam 110.2 GB
EGAF00008869874 bam 125.0 GB
EGAF00008869875 bam 222.0 GB
EGAF00008869876 bam 200.0 GB
EGAF00008869877 bam 207.8 GB
EGAF00008869878 bam 260.9 GB
EGAF00008869879 bam 145.3 GB
EGAF00008869880 bam 254.7 GB
EGAF00008869881 bam 304.5 GB
EGAF00008869882 bam 300.8 GB
EGAF00008869883 bam 13.5 GB
EGAF00008869884 bam 223.5 GB
EGAF00008869885 bam 203.5 GB
EGAF00008869886 bam 317.4 GB
EGAF00008869887 bam 274.5 GB
EGAF00008870400 bam 312.6 GB
EGAF00008870401 bam 296.6 GB
EGAF00008870402 bam 311.8 GB
EGAF00008870403 bam 320.4 GB
EGAF00008870404 bam 338.0 GB
EGAF00008870405 bam 360.0 GB
EGAF00008870406 bam 302.5 GB
EGAF00008870407 bam 307.9 GB
EGAF00008870408 bam 340.5 GB
EGAF00008870409 bam 339.2 GB
EGAF00008870410 bam 390.5 GB
EGAF00008870981 bam 351.4 GB
EGAF00008870982 bam 351.1 GB
EGAF00008872730 bam 222.3 GB
EGAF00008872731 bam 244.3 GB
EGAF00008872732 bam 246.9 GB
EGAF00008872733 bam 108.4 GB
EGAF00008888291 bam 56.7 GB
EGAF00008888292 bam 296.4 GB
EGAF00008945914 bam 79.8 GB
EGAF00008945915 bam 83.1 GB
EGAF00008945916 bam 82.7 GB
EGAF00008947865 bam 119.4 GB
60 Files (11.1 TB)