Sequence analysis and rule development of predicting protein stability change upon mutation using decision tree model

被引:0
|
作者
Liang-Tsung Huang
M. Michael Gromiha
Shinn-Ying Ho
机构
[1] Feng-Chia University,Institute of Information Engineering and Computer Science
[2] Ming-Dao University,Department of Computer Science and Information Engineering
[3] National Institute of Advanced Industrial Science and Technology (AIST),Computational Biology Research Center (CBRC)
[4] National Chiao Tung University,Department of Biological Science and Technology, and Institute of Bioinformatics
来源
关键词
Bioinformatics; Data mining; Decision trees; Prediction; Protein stability;
D O I
暂无
中图分类号
学科分类号
摘要
Understanding the mechanism of the protein stability change is one of the most challenging tasks. Recently, the prediction of protein stability change affected by single point mutations has become an interesting topic in molecular biology. However, it is desirable to further acquire knowledge from large databases to provide new insights into the nature of them. This paper presents an interpretable prediction tree method (named iPTREE-2) that can accurately predict changes of protein stability upon mutations from sequence based information and analyze sequence characteristics from the viewpoint of composition and order. Therefore, iPTREE-2 based on a regression tree algorithm exhibits the ability of finding important factors and developing rules for the purpose of data mining. On a dataset of 1859 different single point mutations from thermodynamic database, ProTherm, iPTREE-2 yields a correlation coefficient of 0.70 between predicted and experimental values. In the task of data mining, detailed analysis of sequences reveals the possibility of the compositional specificity of residues in different ranges of stability change and implies the existence of certain patterns. As building rules, we found that the mutation residues in wild type and in mutant protein play an important role. The present study demonstrates that iPTREE-2 can serve the purpose of predicting protein stability change, especially when one requires more understandable knowledge.
引用
收藏
页码:879 / 890
页数:11
相关论文
共 50 条
  • [21] Predicting ambulance offload delay using a hybrid decision tree model
    Li, Mengyu
    Vanberkel, Peter
    Zhong, Xiang
    SOCIO-ECONOMIC PLANNING SCIENCES, 2022, 80
  • [22] First Report of Knowledge Discovery in Predicting Protein Folding Rate Change upon Single Mutation
    Lai, Lien-Fu
    Wu, Chao-Chin
    Huang, Liang-Tsung
    BIO-INSPIRED COMPUTING AND APPLICATIONS, 2012, 6840 : 624 - +
  • [23] Reliable prediction of protein thermostability change upon double mutation from amino acid sequence
    Huang, Liang-Tsung
    Gromiha, M. Michael
    BIOINFORMATICS, 2009, 25 (17) : 2181 - 2187
  • [24] DeepPPAPredMut: deep ensemble method for predicting the binding affinity change in protein-protein complexes upon mutation
    Nikam, Rahul
    Jemimah, Sherlyn
    Gromiha, M. Michael
    BIOINFORMATICS, 2024, 40 (05)
  • [25] PoPMuSiC 2.1: a web server for the estimation of protein stability changes upon mutation and sequence optimality
    Yves Dehouck
    Jean Marc Kwasigroch
    Dimitri Gilis
    Marianne Rooman
    BMC Bioinformatics, 12
  • [26] PoPMuSiC 2.1: a web server for the estimation of protein stability changes upon mutation and sequence optimality
    Dehouck, Yves
    Kwasigroch, Jean Marc
    Gilis, Dimitri
    Rooman, Marianne
    BMC BIOINFORMATICS, 2011, 12
  • [27] Predicting Preeclampsia Using Principal Component Analysis and Decision Tree Classifier
    Musa, Farida
    Prasad, Rajesh
    CURRENT WOMENS HEALTH REVIEWS, 2024, 20 (02)
  • [28] Prediction of protein stability changes upon mutation using MELD x MD
    Sierra, Alfonso
    Brini, Emiliano
    BIOPHYSICAL JOURNAL, 2024, 123 (03) : 426A - 426A
  • [29] A STUDY ON PREDICTING PATTERNS OVER THE PROTEIN SEQUENCE DATASETS USING ASSOCIATION RULE MINING
    Priya, Lakshmi G.
    Hariharan, Shanmugasundaram
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2012, 7 (05): : 563 - 573
  • [30] Discrete model based answer script evaluation using decision tree rule classifier
    Madhumitha Ramamurthy
    Ilango Krishnamurthi
    Sudhagar Ilango
    Shanthi Palaniappan
    Cluster Computing, 2019, 22 : 13499 - 13510