Probabilistic methods for improving efficiency of RNA secondary structure prediction across multiple sequences

被引:0
|
作者
Sharma, Gaurav [1 ,2 ]
Harmanci, A. Ozgun [1 ]
Mathews, David H. [2 ,3 ]
机构
[1] Univ Rochester, Dept Elect & Comp Engn, Hopeman 204,RC Box 270126, Rochester, NY 14627 USA
[2] Univ Rochester, Med Ctr, Dept Biostat & Comput Biol, Rochester, NY 14642 USA
[3] Univ Rochester, Med Ctr, Dept Biochem & Biophys, Rochester, NY 14642 USA
关键词
RNA secondary structure; posterior base pairing probability; hidden Markov model;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Prediction of common secondary structure across multiple RNA sequences is known to significantly increase accuracy in comparison with single-sequence based prediction methods. However, the computational requirements for joint prediction can often be daunting in comparison to single-sequence prediction. As a result, heuristic simplifications are often necessary for this joint estimation problem in order to perform computations on current hardware in reasonable times. In this paper, principled heuristics are presented for the purpose of computation reduction based on probabilistic methods. The methods presented eliminate the computations over extremely improbable alignments and structures, thereby reducing computation with little or no degradation in accuracy. Experimental results over databases of RNA families with known secondary structure validate our methods, demonstrating over a two-fold computational speed up in tests over the 5S rRNA family, without any compromise in accuracy.
引用
收藏
页码:34 / +
页数:2
相关论文
共 50 条
  • [1] The prediction of RNA secondary structure using multiple unaligned sequences
    Fang, Xiaoyong
    Luo, Zhigang
    Yuan, Bo
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2007, : 367 - +
  • [2] Secondary structure prediction for aligned RNA sequences
    Hofacker, IL
    Fekete, M
    Stadler, PF
    JOURNAL OF MOLECULAR BIOLOGY, 2002, 319 (05) : 1059 - 1066
  • [3] TurboFold: Iterative probabilistic estimation of secondary structures for multiple RNA sequences
    Arif O Harmanci
    Gaurav Sharma
    David H Mathews
    BMC Bioinformatics, 12
  • [4] TurboFold: Iterative probabilistic estimation of secondary structures for multiple RNA sequences
    Harmanci, Arif O.
    Sharma, Gaurav
    Mathews, David H.
    BMC BIOINFORMATICS, 2011, 12
  • [5] Improving RNA Secondary Structure Prediction with Structure Mapping Data
    Sloma, Michael F.
    Mathews, David H.
    COMPUTATIONAL METHODS FOR UNDERSTANDING RIBOSWITCHES, 2015, 553 : 91 - 114
  • [6] Improving the accuracy of predicting secondary structure for aligned RNA sequences
    Hamada, Michiaki
    Sato, Kengo
    Asai, Kiyoshi
    NUCLEIC ACIDS RESEARCH, 2011, 39 (02) : 393 - 402
  • [7] PARTS: Probabilistic Alignment for RNA joinT Secondary structure prediction
    Harmanci, Arif Ozgun
    Sharma, Gaurav
    Mathews, David H.
    NUCLEIC ACIDS RESEARCH, 2008, 36 (07) : 2406 - 2417
  • [8] Improving protein secondary structure prediction with aligned homologous sequences
    DiFrancesco, V
    Garnier, J
    Munson, PJ
    PROTEIN SCIENCE, 1996, 5 (01) : 106 - 113
  • [9] Prediction of RNA secondary structure including pseudoknots for long sequences
    Sato, Kengo
    Kato, Yuki
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [10] Comparison of probabilistic combination methods for protein secondary structure prediction
    Liu, Y
    Carbonell, J
    Klein-Seetharaman, J
    Gopalakrishnan, V
    BIOINFORMATICS, 2004, 20 (17) : 3099 - 3107