A fast and robust iterative algorithm for prediction of RNA pseudoknotted secondary structures

被引:28
|
作者
Jabbari, Hosna [1 ]
Condon, Anne [1 ]
机构
[1] Univ British Columbia, Dept Comp Sci, Vancouver, BC V5Z 1M9, Canada
来源
BMC BIOINFORMATICS | 2014年 / 15卷
基金
加拿大自然科学与工程研究理事会;
关键词
RNA; Secondary structure prediction; Pseudoknot; Hierarchical folding; Minimum free energy; DYNAMIC-PROGRAMMING ALGORITHM; PARTITION-FUNCTION; TRANSLATION; SERVER;
D O I
10.1186/1471-2105-15-147
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Improving accuracy and efficiency of computational methods that predict pseudoknotted RNA secondary structures is an ongoing challenge. Existing methods based on free energy minimization tend to be very slow and are limited in the types of pseudoknots that they can predict. Incorporating known structural information can improve prediction accuracy; however, there are not many methods for prediction of pseudoknotted structures that can incorporate structural information as input. There is even less understanding of the relative robustness of these methods with respect to partial information. Results: We present a new method, Iterative HFold, for pseudoknotted RNA secondary structure prediction. Iterative HFold takes as input a pseudoknot-free structure, and produces a possibly pseudoknotted structure whose energy is at least as low as that of any (density-2) pseudoknotted structure containing the input structure. Iterative HFold leverages strengths of earlier methods, namely the fast running time of HFold, a method that is based on the hierarchical folding hypothesis, and the energy parameters of HotKnots V2.0. Our experimental evaluation on a large data set shows that Iterative HFold is robust with respect to partial information, with average accuracy on pseudoknotted structures steadily increasing from roughly 54% to 79% as the user provides up to 40% of the input structure. Iterative HFold is much faster than HotKnots V2.0, while having comparable accuracy. Iterative HFold also has significantly better accuracy than IPknot on our HK-PK and IP-pk168 data sets. Conclusions: Iterative HFold is a robust method for prediction of pseudoknotted RNA secondary structures, whose accuracy with more than 5% information about true pseudoknot-free structures is better than that of IPknot, and with about 35% information about true pseudoknot-free structures compares well with that of HotKnots V2.0 while being significantly faster. Iterative HFold and all data used in this work are freely available at http://www.cs.ubc.ca/similar to hjabbari/software.php.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] A KINETIC APPROACH TO THE PREDICTION OF RNA SECONDARY STRUCTURES
    MIRONOV, AA
    DYAKONOVA, LP
    KISTER, AE
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 1985, 2 (05): : 953 - 962
  • [32] Prediction of sequentially optimal RNA secondary structures
    Breton, N
    Jacob, C
    Daegelen, P
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 1997, 14 (06): : 727 - 740
  • [33] Parsing nucleic acid pseudoknotted secondary structure: Algorithm and applications
    Rastegari, Baharak
    Condon, Anne
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2007, 14 (01) : 16 - 32
  • [34] A Matrix Algorithm for RNA Secondary Structure Prediction
    Krishnan, S. P. T.
    Khurshid, Mushfique Junayed
    Veeravalli, Bharadwaj
    PATTERN RECOGNITION IN BIOINFORMATICS, 2010, 6282 : 337 - +
  • [35] A folding algorithm for extended RNA secondary structures
    Siederdissen, Christian Hoener Zu
    Bernhart, Stephan H.
    Stadler, Peter F.
    Hofacker, Ivo L.
    BIOINFORMATICS, 2011, 27 (13) : I129 - I136
  • [36] RNA Secondary Structure Prediction with Coincidence Algorithm
    Srikamdee, Supawadee
    Wattanapornprom, Warin
    Chongstitvatana, Prabhas
    2016 16TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT), 2016, : 686 - 690
  • [37] AN ALGORITHM FOR COMPARING MULTIPLE RNA SECONDARY STRUCTURES
    SHAPIRO, BA
    COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1988, 4 (03): : 387 - 393
  • [38] A fast genetic algorithm for RNA secondary structure analysis
    Titov, II
    Vorobiev, DG
    Ivanisenko, VA
    Kolchanov, NA
    RUSSIAN CHEMICAL BULLETIN, 2002, 51 (07) : 1135 - 1144
  • [39] A fast genetic algorithm for RNA secondary structure analysis
    I. I. Titov
    D. G. Vorobiev
    V. A. Ivanisenko
    N. A. Kolchanov
    Russian Chemical Bulletin, 2002, 51 : 1135 - 1144
  • [40] Improved Approximation Algorithm for the Maximum Base Pair Stackings Problem in RNA Secondary Structures Prediction
    Zhou, Aizhong
    Jiang, Haitao
    Guo, Jiong
    Feng, Haodi
    Liu, Nan
    Zhu, Binhai
    COMPUTING AND COMBINATORICS, COCOON 2017, 2017, 10392 : 575 - 587