Analysis of an optimal hidden Markov model for secondary structure prediction

被引:30
|
作者
Martin, Juliette
Gibrat, Jean-Francois
Rodolphe, Francois
机构
[1] Univ Paris 07, INSERM, Equipe Bioinformat Genom & Mol, U726, F-75251 Paris 05, France
[2] INRA, Unite Math Informat & Genome, F-78352 Jouy En Josas, France
关键词
D O I
10.1186/1472-6807-6-25
中图分类号
Q6 [生物物理学];
学科分类号
071011 ;
摘要
Background: Secondary structure prediction is a useful first step toward 3D structure prediction. A number of successful secondary structure prediction methods use neural networks, but unfortunately, neural networks are not intuitively interpretable. On the contrary, hidden Markov models are graphical interpretable models. Moreover, they have been successfully used in many bioinformatic applications. Because they offer a strong statistical background and allow model interpretation, we propose a method based on hidden Markov models. Results: Our HMM is designed without prior knowledge. It is chosen within a collection of models of increasing size, using statistical and accuracy criteria. The resulting model has 36 hidden states: 15 that model a-helices, 12 that model coil and 9 that model alpha-strands. Connections between hidden states and state emission probabilities reflect the organization of protein structures into secondary structure segments. We start by analyzing the model features and see how it offers a new vision of local structures. We then use it for secondary structure prediction. Our model appears to be very efficient on single sequences, with a Q3 score of 68.8%, more than one point above PSIPRED prediction on single sequences. A straightforward extension of the method allows the use of multiple sequence alignments, rising the Q3 score to 75.5%. Conclusion: The hidden Markov model presented here achieves valuable prediction results using only a limited number of parameters. It provides an interpretable framework for protein secondary structure architecture. Furthermore, it can be used as a tool for generating protein sequences with a given secondary structure content.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Hidden Markov Model for Cardholder Purchasing Pattern Prediction
    Otieno, Okoth Jeremiah
    Kimwele, Michael
    Ogada, Kennedy
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) : 547 - 559
  • [32] Stock Market Prediction Using Hidden Markov Model
    Somani, Poonam
    Talele, Shreyas
    Sawant, Suraj
    2014 IEEE 7TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC), 2014, : 89 - 92
  • [33] Spectrum Occupancy Prediction Using a Hidden Markov Model
    Eltom, Hamid
    Kandeepan, Sithamparanathan
    Moran, Bill
    Evans, Robin J.
    2015 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2015,
  • [34] LEARNING AND PREDICTION BASED ON A RELATIONAL HIDDEN MARKOV MODEL
    Elfers, Carsten
    Wagner, Thomas
    ICAART 2010: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1: ARTIFICIAL INTELLIGENCE, 2010, : 211 - 216
  • [35] MOOCS DROPOUT PREDICTION BASED ON HIDDEN MARKOV MODEL
    Zhu, Huisheng
    Wang, Yan
    Chen, Shuwen
    Ni, Yiyang
    JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2024, 25 (05) : 879 - 889
  • [36] The prediction role of Hidden Markov Model in Intrusion Detection
    Gao, F
    Sun, J
    Wei, Z
    CCECE 2003: CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-3, PROCEEDINGS: TOWARD A CARING AND HUMANE TECHNOLOGY, 2003, : 893 - 896
  • [37] Prediction of cutting chatter based on Hidden Markov Model
    Mei, Deqing
    Li, Xin
    Chen, Zichen
    PROGRESSES IN FRACTURE AND STRENGTH OF MATERIALS AND STRUCTURES, 1-4, 2007, 353-358 : 2712 - 2715
  • [38] An improved hidden Markov model for transmembrane topology prediction
    Kahsay, RY
    Liao, L
    Gao, G
    ICTAI 2004: 16TH IEEE INTERNATIONALCONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, : 634 - 639
  • [39] Protein secondary structure prediction for a single-sequence using hidden semi-Markov models
    Aydin, Zafer
    Altunbasak, Yucel
    Borodovsky, Mark
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [40] Protein secondary structure prediction for a single-sequence using hidden semi-Markov models
    Zafer Aydin
    Yucel Altunbasak
    Mark Borodovsky
    BMC Bioinformatics, 7