|
|
||||||||
BIOINFORMATICS |
1 Bioinformatics Center, Wadsworth Center, New York State Department of Health, Albany, New York 12208, USA
2 Center for Computational Molecular Biology and Division of Applied Mathematics, Brown University, Providence, Rhode Island 02912, USA
Reprint request to: Ye Ding, Bioinformatics Center, Wadsworth Center, New York State Department of Health, 150 New Scotland Avenue, Albany, NY 12208, USA; e-mail: yding{at}wadsworth.org; fax: (518) 402-4623; or Charles E. Lawrence, Center for Computational Molecular Biology and Division of Applied Mathematics, Brown University, 182 George Street, Providence, RI 02912, USA; e-mail: lawrence{at}dam.brown.edu; fax: (401) 863-1355.
Prediction of RNA secondary structure by free energy minimization has been the standard for over two decades. Here we describe a novel method that forsakes this paradigm for predictions based on Boltzmann-weighted structure ensemble. We introduce the notion of a centroid structure as a representative for a set of structures and describe a procedure for its identification. In comparison with the minimum free energy (MFE) structure using diverse types of structural RNAs, the centroid of the ensemble makes 30.0% fewer prediction errors as measured by the positive predictive value (PPV) with marginally improved sensitivity. The Boltzmann ensemble can be separated into a small number (3.2 on average) of clusters. Among the centroids of these clusters, the "best cluster centroid" as determined by comparison to the known structure simultaneously improves PPV by 46.5% and sensitivity by 21.7%. For 58% of the studied sequences for which the MFE structure is outside the cluster containing the best centroid, the improvements by the best centroid are 62.5% for PPV and 31.4% for sensitivity. These results suggest that the energy well containing the MFE structure under the current incomplete energy model is often different from the one for the unavailable complete model that presumably contains the unique native structure. Centroids are available on the Sfold server at http://sfold.wadsworth.org.
Keywords: secondary structure prediction; centroid; Boltzmann ensemble
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
A. R. Gruber, R. Lorenz, S. H. Bernhart, R. Neubock, and I. L. Hofacker The Vienna RNA Websuite Nucleic Acids Res., April 19, 2008; (2008) gkn188v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. L. Blount, J. M. Vaughan, W. W. Vale, and L. M. Bilezikjian A Smad-binding Element in Intron 1 Participates in Activin-dependent Regulation of the Follistatin Gene J. Biol. Chem., March 14, 2008; 283(11): 7016 - 7026. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. E. Carvalho and C. E. Lawrence Centroid estimation in discrete high-dimensional spaces with applications in biology PNAS, March 4, 2008; 105(9): 3209 - 3214. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Lunter, A. Rocco, N. Mimouni, A. Heger, A. Caldeira, and J. Hein Uncertainty in homology inferences: Assessing and improving genomic sequence alignment Genome Res., February 1, 2008; 18(2): 298 - 309. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. A. Malygin, N. M. Parakhnevitch, A. V. Ivanov, I. C. Eperon, and G. G. Karpova Human ribosomal protein S13 regulates expression of its own gene at the splicing step by a feedback mechanism Nucleic Acids Res., October 8, 2007; 35(19): 6414 - 6423. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Shao, C. Y. Chan, A. Maliyekkel, C. E. Lawrence, I. B. Roninson, and Y. Ding Effect of target secondary structure on RNAi efficiency RNA, October 1, 2007; 13(10): 1631 - 1640. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. A. Newberg, W. A. Thompson, S. Conlan, T. M. Smith, L. A. McCue, and C. E. Lawrence A phylogenetic Gibbs sampler that yields centroid solutions for cis-regulatory site prediction Bioinformatics, July 15, 2007; 23(14): 1718 - 1727. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. A. Thompson, L. A. Newberg, S. Conlan, L. A. McCue, and C. E. Lawrence The Gibbs Centroid Sampler Nucleic Acids Res., July 13, 2007; 35(suppl_2): W232 - W237. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Freyhult, V. Moulton, and P. Clote RNAbor: a web server for RNA structural neighbors Nucleic Acids Res., July 13, 2007; 35(suppl_2): W305 - W309. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Torarinsson, J. H. Havgaard, and J. Gorodkin Multiple structural alignment and clustering of RNA sequences Bioinformatics, April 15, 2007; 23(8): 926 - 932. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Shao, Y. Wu, C. Y. Chan, K. McDonough, and Y. Ding Rational design and rapid screening of antisense oligonucleotides for prokaryotic gene modulation Nucleic Acids Res., November 14, 2006; 34(19): 5660 - 5669. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. DING Statistical and Bayesian approaches to RNA secondary structure prediction. RNA, March 1, 2006; 12(3): 323 - 331. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Clote, J. Waldispuhl, B. Behzadi, and J.-M. Steyaert Energy landscape of k-point mutants of an RNA molecule Bioinformatics, November 15, 2005; 21(22): 4140 - 4147. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Y. Chan, C. E. Lawrence, and Y. Ding Structure clustering features on the Sfold Web server Bioinformatics, October 15, 2005; 21(20): 3926 - 3928. [Abstract] [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |