RNA secondary structural alignment with conditional random fields

被引:37
|
作者
Sato, K [1 ]
Sakakibara, Y [1 ]
机构
[1] Keio Univ, Dept Biosci & Informat, Kohoku Ku, Yokohama, Kanagawa 2238522, Japan
关键词
D O I
10.1093/bioinformatics/bti1139
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The computational identification of non-coding RNA regions on the genome is currently receiving much attention. However, it is essentially harder than gene-finding problems for protein-coding regions because non-coding RNA sequences do not have strong statistical signals. Since comparative sequence analysis is effective for non-coding RNA detection, efficient computational methods are expected for structural alignment of RNA sequences. Several methods have been proposed to accomplish the structural alignment tasks for RNA sequences, and we found that one of the most important points is to estimate an accurate score matrix for calculating structural alignments. Results: We propose a novel approach for RNA structural alignment based on conditional random fields (CRFs). Our approach has some specific features compared with previous methods in the sense that the parameters for structural alignment are estimated such that the model can most probably discriminate between correct alignments and incorrect alignments, and has the generalization ability so that a satisfiable score matrix can be obtained even with a small number of sample data without overfitting. Experimental results clearly show that the parameter estimation with CRFs can outperform all the other existing methods for structural alignments of RNA sequences. Furthermore, structural alignment search based on CRFs is more accurate for predicting non-coding RNA regions than the other scoring methods. These experimental results strongly support our discriminative method employing CRFs to estimate the score matrix parameters.
引用
收藏
页码:237 / 242
页数:6
相关论文
共 50 条
  • [21] Chunking in Turkish with Conditional Random Fields
    Yildiz, Olcay Taner
    Solak, Ercan
    Ehsani, Razieh
    Gorgun, Onur
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT I, 2015, 9041 : 173 - 184
  • [22] Gaussian conditional random fields for classification
    Petrovic, Andrija
    Nikolic, Mladen
    Jovanovic, Milos
    Delibasic, Boris
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 212
  • [23] Infinite Latent Conditional Random Fields
    Jiang, Yun
    Saxena, Ashutosh
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, : 262 - 266
  • [24] Generalized isotonic conditional random fields
    Yi Mao
    Guy Lebanon
    [J]. Machine Learning, 2009, 77 : 225 - 248
  • [25] CONDITIONAL RANDOM FIELDS FOR TERM EXTRACTION
    Zhang, Xing
    Song, Yan
    Fang, Alex Chengyu
    [J]. KDIR 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2010, : 414 - 417
  • [26] Learning conditional random fields for stereo
    Scharstein, Daniel
    Pal, Chris
    [J]. 2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 1688 - +
  • [27] Model fusion of Conditional Random Fields
    Li, Lu
    Wang, Xuan
    Yu, Yanbing
    Wang, Xiaolong
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-8, 2007, : 1452 - 1456
  • [28] MICAlign: a sequence-to-structure alignment tool integrating multiple sources of information in conditional random fields
    Xia, Xuefeng
    Zhang, Song
    Su, Yu
    Sun, Zhirong
    [J]. BIOINFORMATICS, 2009, 25 (11) : 1433 - 1434
  • [29] GraphClust: alignment-free structural clustering of local RNA secondary structures
    Heyne, Steffen
    Costa, Fabrizio
    Rose, Dominic
    Backofen, Rolf
    [J]. BIOINFORMATICS, 2012, 28 (12) : I224 - I232
  • [30] Detecting conserved secondary structures in RNA molecules using constrained structural alignment
    Khaladkar, Mugdha
    Patel, Vandanaben
    Bellofatto, Vivian
    Wilusz, Jeffrey
    Wang, Jason T. L.
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2008, 32 (04) : 264 - 272