Prediction of RNA secondary structure using generalized centroid estimators

被引:159
|
作者
Hamada, Michiaki [1 ,2 ,3 ]
Kiryu, Hisanori [2 ]
Sato, Kengo [2 ,4 ]
Mituyama, Toutai [2 ]
Asai, Kiyoshi [2 ,5 ]
机构
[1] Mizuho Informat & Res Inst Inc, Chiyoda Ku, Tokyo 1018443, Japan
[2] Natl Inst Adv Ind Sci & Technol, Computat Biol Res Ctr, Koto Ku, Tokyo 1350064, Japan
[3] Tokyo Inst Technol, Dept Computat Intelligence & Syst Sci, Midori Ku, Yokohama, Kanagawa 2268503, Japan
[4] JBIC, Koto Ku, Tokyo 1358073, Japan
[5] Univ Tokyo, Grad Sch Frontier Sci, Kashiwa, Chiba 2778562, Japan
关键词
NONCODING RNAS; SEQUENCES; ALIGNMENT;
D O I
10.1093/bioinformatics/btn601
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Recent studies have shown that the methods for predicting secondary structures of RNAs on the basis of posterior decoding of the base-pairing probabilities has an advantage with respect to prediction accuracy over the conventionally utilized minimum free energy methods. However, there is room for improvement in the objective functions presented in previous studies, which are maximized in the posterior decoding with respect to the accuracy measures for secondary structures. Results: We propose novel estimators which improve the accuracy of secondary structure prediction of RNAs. The proposed estimators maximize an objective function which is the weighted sum of the expected number of the true positives and that of the true negatives of the base pairs. The proposed estimators are also improved versions of the ones used in previous works, namely CONTRAfold for secondary structure prediction from a single RNA sequence and McCaskill-MEA for common secondary structure prediction from multiple alignments of RNA sequences. We clarify the relations between the proposed estimators and the estimators presented in previous works, and theoretically show that the previous estimators include additional unnecessary terms in the evaluation measures with respect to the accuracy. Furthermore, computational experiments confirm the theoretical analysis by indicating improvement in the empirical accuracy. The proposed estimators represent extensions of the centroid estimators proposed in Ding et al. and Carvalho and Lawrence, and are applicable to a wide variety of problems in bioinformatics.
引用
收藏
页码:465 / 473
页数:9
相关论文
共 50 条
  • [1] Generalized Centroid Estimators in Bioinformatics
    Hamada, Michiaki
    Kiryu, Hisanori
    Iwasaki, Wataru
    Asai, Kiyoshi
    [J]. PLOS ONE, 2011, 6 (02):
  • [2] RNA Secondary Structure Prediction Using Soft Computing
    Ray, Shubhra Sankar
    Pal, Sankar K.
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2013, 10 (01) : 2 - 17
  • [3] PREDICTION OF RNA SECONDARY STRUCTURE
    DELISI, C
    CROTHERS, DM
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1971, 68 (11) : 2682 - &
  • [4] Using RNase H to improve RNA secondary structure prediction
    Campagna, Ryan J.
    Bartels, Chantal B.
    Kauffmann, Andrew D.
    Childs-Disney, Jessica L.
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2009, 237
  • [5] Prediction of RNA secondary structure with pseudoknots using integer programming
    Poolsap, Unyanee
    Kato, Yuki
    Akutsu, Tatsuya
    [J]. BMC BIOINFORMATICS, 2009, 10
  • [6] RNA secondary structure prediction using highly parallel computers
    Nakaya, A
    Yamamoto, K
    Yonezawa, A
    [J]. COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1995, 11 (06): : 685 - 692
  • [7] Pseudoknots Prediction on RNA Secondary Structure Using Term Rewriting
    Chowdhury, Linkon
    Khan, Mohammad Ibrahim
    [J]. BIOINFORMATICS AND BIOMEDICAL ENGINEERING (IWBBIO 2015), PT I, 2015, 9043 : 577 - 588
  • [8] Prediction of RNA Secondary Structure Using Butterfly Optimization Algorithm
    Sajib Chatterjee
    Rameswar Debnath
    Sujit Biswas
    Anupam Kumar Bairagi
    [J]. Human-Centric Intelligent Systems, 2024, 4 (2): : 220 - 240
  • [9] Prediction of RNA secondary structure with pseudoknots using integer programming
    Unyanee Poolsap
    Yuki Kato
    Tatsuya Akutsu
    [J]. BMC Bioinformatics, 10
  • [10] The prediction of RNA secondary structure using multiple unaligned sequences
    Fang, Xiaoyong
    Luo, Zhigang
    Yuan, Bo
    [J]. ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2007, : 367 - +