Analysis of an optimal hidden Markov model for secondary structure prediction

被引:30
|
作者
Martin, Juliette
Gibrat, Jean-Francois
Rodolphe, Francois
机构
[1] Univ Paris 07, INSERM, Equipe Bioinformat Genom & Mol, U726, F-75251 Paris 05, France
[2] INRA, Unite Math Informat & Genome, F-78352 Jouy En Josas, France
关键词
D O I
10.1186/1472-6807-6-25
中图分类号
Q6 [生物物理学];
学科分类号
071011 ;
摘要
Background: Secondary structure prediction is a useful first step toward 3D structure prediction. A number of successful secondary structure prediction methods use neural networks, but unfortunately, neural networks are not intuitively interpretable. On the contrary, hidden Markov models are graphical interpretable models. Moreover, they have been successfully used in many bioinformatic applications. Because they offer a strong statistical background and allow model interpretation, we propose a method based on hidden Markov models. Results: Our HMM is designed without prior knowledge. It is chosen within a collection of models of increasing size, using statistical and accuracy criteria. The resulting model has 36 hidden states: 15 that model a-helices, 12 that model coil and 9 that model alpha-strands. Connections between hidden states and state emission probabilities reflect the organization of protein structures into secondary structure segments. We start by analyzing the model features and see how it offers a new vision of local structures. We then use it for secondary structure prediction. Our model appears to be very efficient on single sequences, with a Q3 score of 68.8%, more than one point above PSIPRED prediction on single sequences. A straightforward extension of the method allows the use of multiple sequence alignments, rising the Q3 score to 75.5%. Conclusion: The hidden Markov model presented here achieves valuable prediction results using only a limited number of parameters. It provides an interpretable framework for protein secondary structure architecture. Furthermore, it can be used as a tool for generating protein sequences with a given secondary structure content.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Optimal filters for a hidden Markov random field model
    Aggoun, L
    Benkherouf, L
    Benmerzouga, A
    MATHEMATICAL AND COMPUTER MODELLING, 2000, 31 (13) : 1 - 9
  • [22] Analysis on a hidden Markov channel model
    Shen, JP
    Gill, J
    GLOBECOM'99: SEAMLESS INTERCONNECTION FOR UNIVERSAL SERVICES, VOL 1-5, 1999, : 437 - 441
  • [23] Reliability Prediction Model for SOA using Hidden Markov Model
    Ahmed, Waseem
    Wu, Yong Wei
    2013 8TH CHINAGRID ANNUAL CONFERENCE (CHINAGRID), 2013, : 40 - 45
  • [24] Improvement of recognition speed protein tertiary structure prediction using hidden Markov model
    Khedr, Ahmed M.
    KUWAIT JOURNAL OF SCIENCE & ENGINEERING, 2011, 38 (2A): : 147 - 161
  • [25] A Composite Approach to Protein Tertiary Structure Prediction: Hidden Markov Model Based on Lattice
    Peyravi, Farzad
    Latif, Alimohammad
    Moshtaghioun, Seyed Mohammad
    BULLETIN OF MATHEMATICAL BIOLOGY, 2019, 81 (03) : 899 - 918
  • [26] A proportion prediction model of terminal energy structure of IPS based on hidden markov chain
    Chen Yanchao
    Lin Xiqiao
    Zhang Shuangping
    11TH CIRP CONFERENCE ON INDUSTRIAL PRODUCT-SERVICE SYSTEMS, 2019, 83 : 456 - 460
  • [27] A Composite Approach to Protein Tertiary Structure Prediction: Hidden Markov Model Based on Lattice
    Farzad Peyravi
    Alimohammad Latif
    Seyed Mohammad Moshtaghioun
    Bulletin of Mathematical Biology, 2019, 81 : 899 - 918
  • [28] A hidden Markov model approach to the structure of documentaries
    Liu, TC
    Kender, JR
    IEEE WORKSHOP ON CONTENT-BASED ACCESS OF IMAGE AND VIDEO LIBRARIES, PROCEEDINGS, 2000, : 111 - 115
  • [29] Hidden markov model prediction algorithm for power load
    Li, Hanju (99959828@qq.com), 2018, SHPMedia Sdn Bhd
  • [30] Vehicle trajectory prediction based on Hidden Markov Model
    Ye, Ning
    Zhang, Yingya
    Wang, Ruchuan
    Malekian, Reza
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2016, 10 (07): : 3150 - 3170