DeepBP: Ensemble deep learning strategy for bioactive peptide prediction

被引：2

作者：

Zhang, Ming ^{[1
]}

Zhou, Jianren ^{[1
]}

Wang, Xiaohua ^{[1
]}

Wang, Xun ^{[1
]}

Ge, Fang ^{[2
,3
]}

机构：

[1] Jiangsu Univ Sci & Technol, Sch Comp, 666 Changhui Rd, Zhenjiang 212100, Peoples R China

[2] Nanjing Univ Posts & Telecommun, State Key Lab Organ Elect & Informat Displays, 9 Wenyuan Rd, Nanjing 210023, Peoples R China

[3] Nanjing Univ Posts & Telecommun, Inst Adv Mat IAM, 9 Wenyuan Rd, Nanjing 210023, Peoples R China

来源：

BMC BIOINFORMATICS | 2024年 / 25卷 / 01期

关键词：

ACE inhibitory peptides; Anticancer peptides; Protein language model; Gated recurrent unit; Generative adversarial capsule network; ATTENTION; NETWORKS; GRU; CNN;

D O I：

10.1186/s12859-024-05974-5

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

BackgroundBioactive peptides are important bioactive molecules composed of short-chain amino acids that play various crucial roles in the body, such as regulating physiological processes and promoting immune responses and antibacterial effects. Due to their significance, bioactive peptides have broad application potential in drug development, food science, and biotechnology. Among them, understanding their biological mechanisms will contribute to new ideas for drug discovery and disease treatment.ResultsThis study employs generative adversarial capsule networks (CapsuleGAN), gated recurrent units (GRU), and convolutional neural networks (CNN) as base classifiers to achieve ensemble learning through voting methods, which not only obtains high-precision prediction results on the angiotensin-converting enzyme (ACE) inhibitory peptides dataset and the anticancer peptides (ACP) dataset but also demonstrates effective model performance. For this method, we first utilized the protein language model-evolutionary scale modeling (ESM-2)-to extract relevant features for the ACE inhibitory peptides and ACP datasets. Following feature extraction, we trained three deep learning models-CapsuleGAN, GRU, and CNN-while continuously adjusting the model parameters throughout the training process. Finally, during the voting stage, different weights were assigned to the models based on their prediction accuracy, allowing full utilization of the model's performance. Experimental results show that on the ACE inhibitory peptide dataset, the balanced accuracy is 0.926, the Matthews correlation coefficient (MCC) is 0.831, and the area under the curve is 0.966; on the ACP dataset, the accuracy (ACC) is 0.779, and the MCC is 0.558. The experimental results on both datasets are superior to existing methods, demonstrating the effectiveness of the experimental approach.ConclusionIn this study, CapsuleGAN, GRU, and CNN were successfully employed as base classifiers to implement ensemble learning, which not only achieved good results in the prediction of two datasets but also surpassed existing methods. The ability to predict peptides with strong ACE inhibitory activity and ACPs more accurately and quickly is significant, and this work provides valuable insights for predicting other functional peptides. The source code and dataset for this experiment are publicly available at https://github.com/Zhou-Jianren/bioactive-peptides.

引用

页数：19

共 50 条

[1] Ensemble learning method for the prediction of new bioactive molecules
Afolabi, Lateefat Temitope
Saeed, Faisal
Hashim, Haslinda
Petinrin, Olutomilayo Olayemi
PLOS ONE, 2018, 13 (01):
[2] Towards an Ensemble Learning Strategy for Metagenomic Gene Prediction
Goes, Fabiana
Alves, Ronnie
Correa, Leandro
Chaparro, Cristian
Thom, Lucineia
ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, BSB 2014, 2014, 8826 : 17 - 24
[3] Railway accident prediction strategy based on ensemble learning
Meng, Haining
Tong, Xinyu
Zheng, Yi
Xie, Guo
Ji, Wenjiang
Hei, Xinhong
ACCIDENT ANALYSIS AND PREVENTION, 2022, 176
[4] A deep learning based ensemble learning method for epileptic seizure prediction
Usman, Syed Muhammad
Khalid, Shehzad
Bashir, Sadaf
COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 136
[5] Ensemble Strategy Based on Deep Reinforcement Learning for Portfolio Optimization
Su, Xiao
Zhou, Yalan
He, Shanshan
Li, Xiangxia
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT IV, KSEM 2023, 2023, 14120 : 242 - 249
[6] Stacked Ensemble for Bioactive Molecule Prediction
Petinrin, Olutomilayo Olayemi
Saeed, Faisal
IEEE ACCESS, 2019, 7 : 153952 - 153957
[7] An ensemble of deep learning algorithms for popularity prediction of flickr images
Shadi Alijani
Jafar Tanha
Leyli Mohammadkhanli
Multimedia Tools and Applications, 2022, 81 : 3253 - 3274
[8] Enhancing Flood Prediction using Ensemble and Deep Learning Techniques
Nti, Isaac Kofi
Nyarko-Boateng, Owusu
Boateng, Samuel
Bawah, F. U.
Agbedanu, P. R.
Awarayi, N. S.
Nimbe, P.
Adekoya, A. F.
Weyori, B. A.
Akoto-Adjepong, Vivian
2021 22ND INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2021, : 662 - 670
[9] Ensemble Deep Learning Network Model for Dropout Prediction in MOOCs
Kumar, Gaurav
Singh, Amar
Sharma, Ashok
INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (02) : 187 - 196
[10] An ensemble of deep learning algorithms for popularity prediction of flickr images
Alijani, Shadi
Tanha, Jafar
Mohammadkhanli, Leyli
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (03) : 3253 - 3274

← 1 2 3 4 5 →