Sequence analysis and rule development of predicting protein stability change upon mutation using decision tree model

被引:0
|
作者
Liang-Tsung Huang
M. Michael Gromiha
Shinn-Ying Ho
机构
[1] Feng-Chia University,Institute of Information Engineering and Computer Science
[2] Ming-Dao University,Department of Computer Science and Information Engineering
[3] National Institute of Advanced Industrial Science and Technology (AIST),Computational Biology Research Center (CBRC)
[4] National Chiao Tung University,Department of Biological Science and Technology, and Institute of Bioinformatics
来源
关键词
Bioinformatics; Data mining; Decision trees; Prediction; Protein stability;
D O I
暂无
中图分类号
学科分类号
摘要
Understanding the mechanism of the protein stability change is one of the most challenging tasks. Recently, the prediction of protein stability change affected by single point mutations has become an interesting topic in molecular biology. However, it is desirable to further acquire knowledge from large databases to provide new insights into the nature of them. This paper presents an interpretable prediction tree method (named iPTREE-2) that can accurately predict changes of protein stability upon mutations from sequence based information and analyze sequence characteristics from the viewpoint of composition and order. Therefore, iPTREE-2 based on a regression tree algorithm exhibits the ability of finding important factors and developing rules for the purpose of data mining. On a dataset of 1859 different single point mutations from thermodynamic database, ProTherm, iPTREE-2 yields a correlation coefficient of 0.70 between predicted and experimental values. In the task of data mining, detailed analysis of sequences reveals the possibility of the compositional specificity of residues in different ranges of stability change and implies the existence of certain patterns. As building rules, we found that the mutation residues in wild type and in mutant protein play an important role. The present study demonstrates that iPTREE-2 can serve the purpose of predicting protein stability change, especially when one requires more understandable knowledge.
引用
收藏
页码:879 / 890
页数:11
相关论文
共 50 条
  • [41] Predicting Change by Evaluating the Change Implementation Process in Construction Projects Using Event Tree Analysis
    Heravi, Gholamreza
    Charkhakan, Mohammad Hadi
    JOURNAL OF MANAGEMENT IN ENGINEERING, 2015, 31 (05)
  • [42] DeepDDG: Predicting the Stability Change of Protein Point Mutations Using Neural Networks
    Cao, Huali
    Wang, Jingxue
    He, Liping
    Qi, Yifei
    Zhang, John Z.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (04) : 1508 - 1514
  • [43] Development of a decision tree model for predicting the malignancy of localized gingival enlargements based on clinical characteristics
    Pawat Sripodok
    Puangwan Lapthanasupkul
    Tawepong Arayapisit
    Nakarin Kitkumthorn
    Natchalee Srimaneekarn
    Vichaya Neeranadpuree
    Watjanee Amornwatcharapong
    Supawit Hempornwisarn
    Sirikorn Amornwikaikul
    Dulyapong Rungraungrayabkul
    Scientific Reports, 14 (1)
  • [44] A Hybrid Machine Learning Framework of Gradient Boosting Decision Tree and Sequence Model for Predicting Escalation in Customer Support
    Gupta, Shubham
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020,
  • [45] PROTS-RF: A Robust Model for Predicting Mutation-Induced Protein Stability Changes
    Li, Yunqi
    Fang, Jianwen
    PLOS ONE, 2012, 7 (10):
  • [46] NeEMO: a method using residue interaction networks to improve prediction of protein stability upon mutation
    Giollo, Manuel
    Martin, Alberto J. M.
    Walsh, Ian
    Ferrari, Carlo
    Tosatto, Silvio C. E.
    BMC GENOMICS, 2014, 15
  • [47] NeEMO: a method using residue interaction networks to improve prediction of protein stability upon mutation
    Manuel Giollo
    Alberto JM Martin
    Ian Walsh
    Carlo Ferrari
    Silvio CE Tosatto
    BMC Genomics, 15
  • [48] Predicting changes in protein stability caused by mutation using sequence-and structure-based methods in a CAGI5 blind challenge
    Strokach, Alexey
    Corbi-Verge, Caries
    Kim, Philip M.
    HUMAN MUTATION, 2019, 40 (09) : 1414 - 1423
  • [49] Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks
    Li, Bian
    Yang, Yucheng T.
    Capra, John A.
    Gerstein, Mark B.
    PLOS COMPUTATIONAL BIOLOGY, 2020, 16 (11)
  • [50] Predicting protein stability changes upon single-point mutation: a thorough comparison of the available tools on a new dataset
    Pancotti, Corrado
    Benevenuta, Silvia
    Birolo, Giovanni
    Alberini, Virginia
    Repetto, Valeria
    Sanavia, Tiziana
    Capriotti, Emidio
    Fariselli, Piero
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (02)