DeepBP: Ensemble deep learning strategy for bioactive peptide prediction

被引:2
|
作者
Zhang, Ming [1 ]
Zhou, Jianren [1 ]
Wang, Xiaohua [1 ]
Wang, Xun [1 ]
Ge, Fang [2 ,3 ]
机构
[1] Jiangsu Univ Sci & Technol, Sch Comp, 666 Changhui Rd, Zhenjiang 212100, Peoples R China
[2] Nanjing Univ Posts & Telecommun, State Key Lab Organ Elect & Informat Displays, 9 Wenyuan Rd, Nanjing 210023, Peoples R China
[3] Nanjing Univ Posts & Telecommun, Inst Adv Mat IAM, 9 Wenyuan Rd, Nanjing 210023, Peoples R China
来源
BMC BIOINFORMATICS | 2024年 / 25卷 / 01期
关键词
ACE inhibitory peptides; Anticancer peptides; Protein language model; Gated recurrent unit; Generative adversarial capsule network; ATTENTION; NETWORKS; GRU; CNN;
D O I
10.1186/s12859-024-05974-5
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundBioactive peptides are important bioactive molecules composed of short-chain amino acids that play various crucial roles in the body, such as regulating physiological processes and promoting immune responses and antibacterial effects. Due to their significance, bioactive peptides have broad application potential in drug development, food science, and biotechnology. Among them, understanding their biological mechanisms will contribute to new ideas for drug discovery and disease treatment.ResultsThis study employs generative adversarial capsule networks (CapsuleGAN), gated recurrent units (GRU), and convolutional neural networks (CNN) as base classifiers to achieve ensemble learning through voting methods, which not only obtains high-precision prediction results on the angiotensin-converting enzyme (ACE) inhibitory peptides dataset and the anticancer peptides (ACP) dataset but also demonstrates effective model performance. For this method, we first utilized the protein language model-evolutionary scale modeling (ESM-2)-to extract relevant features for the ACE inhibitory peptides and ACP datasets. Following feature extraction, we trained three deep learning models-CapsuleGAN, GRU, and CNN-while continuously adjusting the model parameters throughout the training process. Finally, during the voting stage, different weights were assigned to the models based on their prediction accuracy, allowing full utilization of the model's performance. Experimental results show that on the ACE inhibitory peptide dataset, the balanced accuracy is 0.926, the Matthews correlation coefficient (MCC) is 0.831, and the area under the curve is 0.966; on the ACP dataset, the accuracy (ACC) is 0.779, and the MCC is 0.558. The experimental results on both datasets are superior to existing methods, demonstrating the effectiveness of the experimental approach.ConclusionIn this study, CapsuleGAN, GRU, and CNN were successfully employed as base classifiers to implement ensemble learning, which not only achieved good results in the prediction of two datasets but also surpassed existing methods. The ability to predict peptides with strong ACE inhibitory activity and ACPs more accurately and quickly is significant, and this work provides valuable insights for predicting other functional peptides. The source code and dataset for this experiment are publicly available at https://github.com/Zhou-Jianren/bioactive-peptides.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Ensemble learning method for the prediction of new bioactive molecules
    Afolabi, Lateefat Temitope
    Saeed, Faisal
    Hashim, Haslinda
    Petinrin, Olutomilayo Olayemi
    PLOS ONE, 2018, 13 (01):
  • [2] Towards an Ensemble Learning Strategy for Metagenomic Gene Prediction
    Goes, Fabiana
    Alves, Ronnie
    Correa, Leandro
    Chaparro, Cristian
    Thom, Lucineia
    ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, BSB 2014, 2014, 8826 : 17 - 24
  • [3] Railway accident prediction strategy based on ensemble learning
    Meng, Haining
    Tong, Xinyu
    Zheng, Yi
    Xie, Guo
    Ji, Wenjiang
    Hei, Xinhong
    ACCIDENT ANALYSIS AND PREVENTION, 2022, 176
  • [4] A deep learning based ensemble learning method for epileptic seizure prediction
    Usman, Syed Muhammad
    Khalid, Shehzad
    Bashir, Sadaf
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 136
  • [5] Ensemble Strategy Based on Deep Reinforcement Learning for Portfolio Optimization
    Su, Xiao
    Zhou, Yalan
    He, Shanshan
    Li, Xiangxia
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT IV, KSEM 2023, 2023, 14120 : 242 - 249
  • [6] Stacked Ensemble for Bioactive Molecule Prediction
    Petinrin, Olutomilayo Olayemi
    Saeed, Faisal
    IEEE ACCESS, 2019, 7 : 153952 - 153957
  • [7] An ensemble of deep learning algorithms for popularity prediction of flickr images
    Shadi Alijani
    Jafar Tanha
    Leyli Mohammadkhanli
    Multimedia Tools and Applications, 2022, 81 : 3253 - 3274
  • [8] Enhancing Flood Prediction using Ensemble and Deep Learning Techniques
    Nti, Isaac Kofi
    Nyarko-Boateng, Owusu
    Boateng, Samuel
    Bawah, F. U.
    Agbedanu, P. R.
    Awarayi, N. S.
    Nimbe, P.
    Adekoya, A. F.
    Weyori, B. A.
    Akoto-Adjepong, Vivian
    2021 22ND INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2021, : 662 - 670
  • [9] Ensemble Deep Learning Network Model for Dropout Prediction in MOOCs
    Kumar, Gaurav
    Singh, Amar
    Sharma, Ashok
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (02) : 187 - 196
  • [10] An ensemble of deep learning algorithms for popularity prediction of flickr images
    Alijani, Shadi
    Tanha, Jafar
    Mohammadkhanli, Leyli
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (03) : 3253 - 3274