Assessing the fraction of short-distance tandem splice sites under purifying selection
- Michael Hiller1,
- Karol Szafranski2,
- Rileen Sinha2,
- Klaus Huse2,
- Swetlana Nikolajewa3,
- Philip Rosenstiel4,
- Stefan Schreiber4,
- Rolf Backofen1, and
- Matthias Platzer2
- 1Bioinformatics Group, Albert-Ludwigs-University Freiburg, 79110 Freiburg, Germany
- 2Genome Analysis, Leibniz Institute for Age Research, Fritz Lipmann Institute, 07745 Jena, Germany
- 3Department of Bioinformatics, Friedrich-Schiller-University Jena, 07743 Jena, Germany
- 4Institute of Clinical Molecular Biology, Christian-Albrechts-University Kiel, 24105 Kiel, Germany
Abstract
Many alternative splice events result in subtle mRNA changes, and most of them occur at short-distance tandem donor and acceptor sites. The splicing mechanism of such tandem sites likely involves the stochastic selection of either splice site. While tandem splice events are frequent, it is unknown how many are functionally important. Here, we use phylogenetic conservation to address this question, focusing on tandems with a distance of 3–9 nucleotides. We show that previous contradicting results on whether alternative or constitutive tandem motifs are more conserved between species can be explained by a statistical paradox (Simpson's paradox). Applying methods that take biases into account, we found higher conservation of alternative tandems in mouse, dog, and even chicken, zebrafish, and Fugu genomes. We estimated a lower bound for the number of alternative sites that are under purifying (negative) selection. While the absolute number of conserved tandem motifs decreases with the evolutionary distance, the fraction under selection increases. Interestingly, a number of frameshifting tandems are under selection, suggesting a role in regulating mRNA and protein levels via nonsense-mediated decay (NMD). An analysis of the intronic flanks shows that purifying selection also acts on the intronic sequence. We propose that stochastic splice site selection can be an advantageous mechanism that allows constant splice variant ratios in situations where a deviation in this ratio is deleterious.
Keywords
Footnotes
-
Reprint requests to: Michael Hiller, Bioinformatics Group, Albert-Ludwigs-University Freiburg, Georges-Koehler-Allee 106, 79110 Freiburg, Germany; e-mail: hiller{at}informatik.uni-freiburg.de; fax: 49 (761) 203-7462.
-
Abbreviations: fs, fraction of confirmed and conserved tandem splice sites estimated to be under purifying selection; CMH test, Cochran–Mantel–Haenszel test.
-
Article published online ahead of print. Article and publication date are at http://www.rnajournal.org/cgi/doi/10.1261/rna.883908.
-
- Received October 19, 2007.
- Accepted January 3, 2008.
- Copyright © 2008 RNA Society











