An Ensemble Classifiers for Improved Prediction of Native-Non-Native Protein-Protein Interaction

被引:0
|
作者
Pratiwi, Nor Kumalasari Caecar [1 ,2 ]
Tayara, Hilal [3 ]
Chong, Kil To [1 ,4 ]
机构
[1] Jeonbuk Natl Univ, Dept Elect & Informat Engn, Jeonju 54896, South Korea
[2] Telkom Univ, Dept Elect Engn, Bandung 40257, West Java, Indonesia
[3] Jeonbuk Natl Univ, Sch Int Engn & Sci, Jeonju 54896, South Korea
[4] Jeonbuk Natl Univ, Adv Elect & Informat Res Ctr, Jeonju 54896, South Korea
基金
新加坡国家研究基金会;
关键词
protein-protein interaction; machine learning; ensemble classifiers; drug discovery; computational biology; CLASSIFICATION; BINDING;
D O I
10.3390/ijms25115957
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In this study, we present an innovative approach to improve the prediction of protein-protein interactions (PPIs) through the utilization of an ensemble classifier, specifically focusing on distinguishing between native and non-native interactions. Leveraging the strengths of various base models, including random forest, gradient boosting, extreme gradient boosting, and light gradient boosting, our ensemble classifier integrates these diverse predictions using a logistic regression meta-classifier. Our model was evaluated using a comprehensive dataset generated from molecular dynamics simulations. While the gains in AUC and other metrics might seem modest, they contribute to a model that is more robust, consistent, and adaptable. To assess the effectiveness of various approaches, we compared the performance of logistic regression to four baseline models. Our results indicate that logistic regression consistently underperforms across all evaluated metrics. This suggests that it may not be well-suited to capture the complex relationships within this dataset. Tree-based models, on the other hand, appear to be more effective for problems involving molecular dynamics simulations. Extreme gradient boosting (XGBoost) and light gradient boosting (LightGBM) are optimized for performance and speed, handling datasets effectively and incorporating regularizations to avoid over-fitting. Our findings indicate that the ensemble method enhances the predictive capability of PPIs, offering a promising tool for computational biology and drug discovery by accurately identifying potential interaction sites and facilitating the understanding of complex protein functions within biological systems.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] An ensemble framework for clustering protein-protein interaction networks
    Asur, Sitaram
    Ucar, Duygu
    Parthasarathy, Srinivasan
    BIOINFORMATICS, 2007, 23 (13) : I29 - I40
  • [22] The Prediction of Protein-Protein Interaction Sites Based on RBF Classifier Improved by SMOTE
    Li, Hui
    Pi, Dechang
    Wang, Chishe
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [23] Prediction of Protein Structure Classes with Ensemble Classifiers
    Bao, Wenzheng
    Chen, Yuehui
    Wang, Dong
    Kong, Fanliang
    Yu, Gaoqiang
    INTELLIGENT COMPUTING IN BIOINFORMATICS, 2014, 8590 : 330 - 338
  • [24] Electrophoresis of proteins and protein-protein complexes in a native agarose gel
    Kim, R
    Yokota, H
    Kim, SH
    ANALYTICAL BIOCHEMISTRY, 2000, 282 (01) : 147 - 149
  • [25] Quantitative analysis of protein-protein interactions by native page/fluorimaging
    Wagstaff, KM
    Dias, MM
    Alvisi, G
    Jans, DA
    JOURNAL OF FLUORESCENCE, 2005, 15 (04) : 469 - 473
  • [26] Protein-Protein Interaction Prediction for Targeted Protein Degradation
    Orasch, Oliver
    Weber, Noah
    Mueller, Michael
    Amanzadi, Amir
    Gasbarri, Chiara
    Trummer, Christopher
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2022, 23 (13)
  • [27] COFACTOR: improved protein function prediction by combining structure, sequence and protein-protein interaction information
    Zhang, Chengxin
    Freddolino, Peter L.
    Zhang, Yang
    NUCLEIC ACIDS RESEARCH, 2017, 45 (W1) : W291 - W299
  • [28] Improved prediction of protein-protein interactions using novel negative samples, features, and an ensemble classifier
    Wei, Leyi
    Xing, Pengwei
    Zeng, Jiancang
    Chen, JinXiu
    Su, Ran
    Guo, Fei
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2017, 83 : 67 - 74
  • [29] Prediction of contact matrix for protein-protein interaction
    Gonzalez, Alvaro J.
    Liao, Li
    Wu, Cathy H.
    BIOINFORMATICS, 2013, 29 (08) : 1018 - 1025
  • [30] NOXclass: prediction of protein-protein interaction types
    Zhu, HB
    Domingues, FS
    Sommer, I
    Lengauer, T
    BMC BIOINFORMATICS, 2006, 7