A dynamic Bayesian network approach to protein secondary structure prediction

被引:45
|
作者
Yao, Xin-Qiu [1 ,2 ,3 ]
Zhu, Huaiqiu [1 ,2 ,3 ]
She, Zhen-Su [1 ,2 ,3 ,4 ]
机构
[1] Peking Univ, State Key Lab Turbulence & Complex Syst, Beijing 100871, Peoples R China
[2] Peking Univ, Dept Biomed Engn, Beijing 100871, Peoples R China
[3] Peking Univ, Ctr Theoret Biol, Beijing 100871, Peoples R China
[4] Univ Calif Los Angeles, Dept Math, Los Angeles, CA 90095 USA
基金
中国国家自然科学基金;
关键词
D O I
10.1186/1471-2105-9-49
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Protein secondary structure prediction method based on probabilistic models such as hidden Markov model (HMM) appeals to many because it provides meaningful information relevant to sequence-structure relationship. However, at present, the prediction accuracy of pure HMM-type methods is much lower than that of machine learning-based methods such as neural networks (NN) or support vector machines (SVM). Results: In this paper, we report a new method of probabilistic nature for protein secondary structure prediction, based on dynamic Bayesian networks (DBN). The new method models the PSI-BLAST profile of a protein sequence using a multivariate Gaussian distribution, and simultaneously takes into account the dependency between the profile and secondary structure and the dependency between profiles of neighboring residues. In addition, a segment length distribution is introduced for each secondary structure state. Tests show that the DBN method has made a significant improvement in the accuracy compared to other pure HMM-type methods. Further improvement is achieved by combining the DBN with an NN, a method called DBNN, which shows better Q(3) accuracy than many popular methods and is competitive to the current state-of-the-arts. The most interesting feature of DBN/DBNN is that a significant improvement in the prediction accuracy is achieved when combined with other methods by a simple consensus. Conclusion: The DBN method using a Gaussian distribution for the PSI-BLAST profile and a high-ordered dependency between profiles of neighboring residues produces significantly better prediction accuracy than other HMM-type probabilistic methods. Owing to their different nature, the DBN and NN combine to form a more accurate method DBNN. Future improvement may be achieved by combining DBNN with a method of SVM type.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] A dynamic Bayesian network approach to protein secondary structure prediction
    Xin-Qiu Yao
    Huaiqiu Zhu
    Zhen-Su She
    [J]. BMC Bioinformatics, 9
  • [2] An Approach for RNA Secondary Structure Prediction Based on Bayesian Network
    Wu, Tianhua
    Deng, Zhidong
    Song, Dandan
    [J]. CIBCB: 2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2009, : 24 - 30
  • [3] Bayesian network multi-classifiers for protein secondary structure prediction
    Robes, V
    Larrañaga, P
    Peña, JM
    Menasalvas, E
    Pérez, MS
    Herves, V
    Wasilewska, A
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2004, 31 (02) : 117 - 136
  • [4] Learning sparse models for a dynamic Bayesian network classifier of protein secondary structure
    Zafer Aydin
    Ajit Singh
    Jeff Bilmes
    William S Noble
    [J]. BMC Bioinformatics, 12
  • [5] Learning sparse models for a dynamic Bayesian network classifier of protein secondary structure
    Aydin, Zafer
    Singh, Ajit
    Bilmes, Jeff
    Noble, William S.
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [6] Knowledge base and neural network approach for protein secondary structure prediction
    Patel, Maulika S.
    Mazumdar, Himanshu S.
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2014, 361 : 182 - 189
  • [7] An algorithm based on improved Bayesian inference network model for prediction protein secondary structure
    Yang, GH
    Zhou, CG
    Hu, CHQ
    Yu, ZZ
    [J]. 2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 1500 - 1504
  • [8] An architecture combining Bayesian segmentation and neural network ensembles for protein secondary structure prediction
    Bidargaddi, NP
    Chetty, M
    Kamruzzaman, J
    [J]. Proceedings of the 2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, 2005, : 498 - 505
  • [9] Bayesian neural networks for prediction of protein secondary structure
    Shao, JL
    Xu, D
    Wang, LZ
    Wang, YF
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 544 - 551
  • [10] Protein secondary structure prediction with Bayesian learning method
    Wang, PL
    Zhang, D
    [J]. 14TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, : 252 - 257