An Ensemble Classifiers for Improved Prediction of Native-Non-Native Protein-Protein Interaction

被引:0
|
作者
Pratiwi, Nor Kumalasari Caecar [1 ,2 ]
Tayara, Hilal [3 ]
Chong, Kil To [1 ,4 ]
机构
[1] Jeonbuk Natl Univ, Dept Elect & Informat Engn, Jeonju 54896, South Korea
[2] Telkom Univ, Dept Elect Engn, Bandung 40257, West Java, Indonesia
[3] Jeonbuk Natl Univ, Sch Int Engn & Sci, Jeonju 54896, South Korea
[4] Jeonbuk Natl Univ, Adv Elect & Informat Res Ctr, Jeonju 54896, South Korea
基金
新加坡国家研究基金会;
关键词
protein-protein interaction; machine learning; ensemble classifiers; drug discovery; computational biology; CLASSIFICATION; BINDING;
D O I
10.3390/ijms25115957
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In this study, we present an innovative approach to improve the prediction of protein-protein interactions (PPIs) through the utilization of an ensemble classifier, specifically focusing on distinguishing between native and non-native interactions. Leveraging the strengths of various base models, including random forest, gradient boosting, extreme gradient boosting, and light gradient boosting, our ensemble classifier integrates these diverse predictions using a logistic regression meta-classifier. Our model was evaluated using a comprehensive dataset generated from molecular dynamics simulations. While the gains in AUC and other metrics might seem modest, they contribute to a model that is more robust, consistent, and adaptable. To assess the effectiveness of various approaches, we compared the performance of logistic regression to four baseline models. Our results indicate that logistic regression consistently underperforms across all evaluated metrics. This suggests that it may not be well-suited to capture the complex relationships within this dataset. Tree-based models, on the other hand, appear to be more effective for problems involving molecular dynamics simulations. Extreme gradient boosting (XGBoost) and light gradient boosting (LightGBM) are optimized for performance and speed, handling datasets effectively and incorporating regularizations to avoid over-fitting. Our findings indicate that the ensemble method enhances the predictive capability of PPIs, offering a promising tool for computational biology and drug discovery by accurately identifying potential interaction sites and facilitating the understanding of complex protein functions within biological systems.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Native or Non-Native Protein-Protein Docking Models? Molecular Dynamics to the Rescue
    Jandova, Zuzana
    Vargiu, Attilio Vittorio
    Bonvin, Alexandre M. J. J.
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2021, 17 (09) : 5944 - 5954
  • [2] Discovering protein-protein interaction stabilisers by native mass spectrometry
    Bellamy-Carter, Jeddidiah
    Mohata, Manjari
    Falcicchio, Marta
    Basran, Jaswir
    Higuchi, Yusuke
    Doveston, Richard G.
    Leney, Aneika C.
    CHEMICAL SCIENCE, 2021, 12 (32) : 10724 - 10731
  • [3] Prediction of protein-protein interaction sites using an ensemble method
    Lei Deng
    Jihong Guan
    Qiwen Dong
    Shuigeng Zhou
    BMC Bioinformatics, 10
  • [4] Prediction of protein-protein interaction sites using an ensemble method
    Deng, Lei
    Guan, Jihong
    Dong, Qiwen
    Zhou, Shuigeng
    BMC BIOINFORMATICS, 2009, 10
  • [5] AutoPPI: An Ensemble of Deep Autoencoders for Protein-Protein Interaction Prediction
    Czibula, Gabriela
    Albu, Alexandra-Ioana
    Bocicor, Maria Iuliana
    Chira, Camelia
    ENTROPY, 2021, 23 (06)
  • [6] PPCM: Combing Multiple Classifiers to Improve Protein-Protein Interaction Prediction
    Yao, Jianzhuang
    Guo, Hong
    Yang, Xiaohan
    INTERNATIONAL JOURNAL OF GENOMICS, 2015, 2015
  • [7] Reciprocal Perspective for Improved Protein-Protein Interaction Prediction
    Kevin Dick
    James R. Green
    Scientific Reports, 8
  • [8] Reciprocal Perspective for Improved Protein-Protein Interaction Prediction
    Dick, Kevin
    Green, James R.
    SCIENTIFIC REPORTS, 2018, 8
  • [9] Predicting Protein-Protein Interactions based on ensemble classifiers
    Zhou, Zheng-Rong
    Song, Xiao-Feng
    Wang, Ming-Hao
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2010, 38 (06): : 1464 - 1467
  • [10] THE INTERACTION BETWEEN NATIVE-NON-NATIVE AND NON-NATIVE - NON NATIVE: SPEECH SIGNALS IN L2 ITALIAN SPANISH PHONE Learners
    De Marco, Anna
    ITALIANO LINGUADUE, 2020, 12 (02) : 92 - 109