RNA secondary structural alignment with conditional random fields

被引:37
|
作者
Sato, K [1 ]
Sakakibara, Y [1 ]
机构
[1] Keio Univ, Dept Biosci & Informat, Kohoku Ku, Yokohama, Kanagawa 2238522, Japan
关键词
D O I
10.1093/bioinformatics/bti1139
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The computational identification of non-coding RNA regions on the genome is currently receiving much attention. However, it is essentially harder than gene-finding problems for protein-coding regions because non-coding RNA sequences do not have strong statistical signals. Since comparative sequence analysis is effective for non-coding RNA detection, efficient computational methods are expected for structural alignment of RNA sequences. Several methods have been proposed to accomplish the structural alignment tasks for RNA sequences, and we found that one of the most important points is to estimate an accurate score matrix for calculating structural alignments. Results: We propose a novel approach for RNA structural alignment based on conditional random fields (CRFs). Our approach has some specific features compared with previous methods in the sense that the parameters for structural alignment are estimated such that the model can most probably discriminate between correct alignments and incorrect alignments, and has the generalization ability so that a satisfiable score matrix can be obtained even with a small number of sample data without overfitting. Experimental results clearly show that the parameter estimation with CRFs can outperform all the other existing methods for structural alignments of RNA sequences. Furthermore, structural alignment search based on CRFs is more accurate for predicting non-coding RNA regions than the other scoring methods. These experimental results strongly support our discriminative method employing CRFs to estimate the score matrix parameters.
引用
收藏
页码:237 / 242
页数:6
相关论文
共 50 条
  • [1] RNA secondary structure prediction using conditional random fields model
    Subpaiboonkit, Sitthichoke
    Thammarongtham, Chinae
    Cutler, Robert W.
    Chaijaruwanich, Jeerayut
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2013, 7 (02) : 118 - 134
  • [2] Discriminative Word Alignment with Conditional Random Fields
    Blunsom, Phil
    Cohn, Trevor
    [J]. COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 65 - 72
  • [3] Conditional Alignment Random Fields for Multiple Motion Sequence Alignment
    Kim, Minyoung
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (11) : 2803 - 2809
  • [4] RNA Family Classification Using the Conditional Random Fields Model
    Subpaiboonkit, Sitthichoke
    Thammarongtham, Chinae
    Chaijaruwanich, Jeerayut
    [J]. CHIANG MAI JOURNAL OF SCIENCE, 2012, 39 (01): : 1 - 7
  • [5] An Introduction to Conditional Random Fields
    Sutton, Charles
    McCallum, Andrew
    [J]. FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2012, 4 (04): : 267 - 373
  • [6] Hidden conditional random fields
    Quattoni, Ariadna
    Wang, Sybor
    Morency, Louis-Philippe
    Collins, Michael
    Darrell, Trevor
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (10) : 1848 - 1853
  • [7] RNAMotifScan: automatic identification of RNA structural motifs using secondary structural alignment
    Zhong, Cuncong
    Tang, Haixu
    Zhang, Shaojie
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 (18) : e176 - e176
  • [8] Clustering RNA structural motifs in ribosomal RNAs using secondary structural alignment
    Zhong, Cuncong
    Zhang, Shaojie
    [J]. NUCLEIC ACIDS RESEARCH, 2012, 40 (03) : 1307 - 1317
  • [9] Investigating syllabic prominence with Conditional Random Fields and Latent-Dynamic Conditional Random Fields
    Cutugno, Francesco
    Leone, Enrico
    Ludusan, Bogdan
    Origlia, Antonio
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2399 - 2402
  • [10] Protein alignment based on higher order conditional random fields for template-based modeling
    Morales-Cordovilia, Juan A.
    Sanchez, Victoria
    Ratajczak, Martin
    [J]. PLOS ONE, 2018, 13 (06):