EBOLApred: A machine learning-based web application for predicting cell entry inhibitors of the Ebola virus

被引:7
|
作者
Adams, Joseph [1 ,2 ]
Agyenkwa-Mawuli, Kwasi [1 ,3 ]
Agyapong, Odame [1 ]
Wilson, Michael D. [2 ,4 ]
Kwofie, Samuel K. [1 ,3 ]
机构
[1] Univ Ghana, Coll Basic & Appl Sci, Sch Engn Sci, Dept Biomed Engn, PMB LG 77,LG 77, Accra, Ghana
[2] Univ Ghana, Noguchi Mem Inst Med Res NMIMR, Coll Hlth Sci CHS, Dept Parasitol, POB LG 581,LG 581, Accra, Ghana
[3] Univ Ghana, Coll Basic & Appl Sci, West African Ctr Cell Biol Infect Pathogens, Dept Biochem Cell & Mol Biol, LG 54, Accra, Ghana
[4] Loyola Univ, Dept Med, Med Ctr, Maywood, IL 60153 USA
关键词
Ebola virus protein; Machine learning; Inhibitors; Support vector machine; Random forest; Logistic regression; MATRIX PROTEIN VP40; APPLICABILITY DOMAIN; DOCKING; DATABASE; SMOTE;
D O I
10.1016/j.compbiolchem.2022.107766
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Ebola virus disease (EVD) is a highly virulent and often lethal illness that affects humans through contact with the body fluid of infected persons. Glycoprotein and matrix protein VP40 play essential roles in the virus life cycle within the host. Whilst glycoprotein mediates the entry and fusion of the virus with the host cell membrane, VP40 is also responsible for viral particle assembly and budding. This study aimed at developing machine learning models to predict small molecules as possible anti-Ebola virus compounds capable of inhibiting the activities of GP and VP40 using Ebola virus (EBOV) cell entry inhibitors from the PubChem database as training data. Predictive models were developed using five algorithms comprising random forest (RF), support vector machine (SVM), naive Bayes (NB), k-nearest neighbor (kNN), and logistic regression (LR). The models were evaluated using a 10-fold cross-validation technique and the algorithm with the best performance was the random forest model with an accuracy of 89 %, an F1 score of 0.9, and a receiver operating characteristic curve (ROC curve) showing the area under the curve (AUC) score of 0.95. LR and SVM models also showed plausible performances with overall accuracy values of 0.84 and 0.86, respectively. The models, RF, LR, and SVM were deployed as a web server known as EBOLApred accessible via http://197.255.126.13:8000/.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] THPep: A machine learning-based approach for predicting tumor homing peptides
    Shoombuatong, Watshara
    Schaduangrat, Nalini
    Pratiwi, Reny
    Nantasenamat, Chanin
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2019, 80 : 441 - 451
  • [42] A machine learning-based radiomic model for predicting urinary infection stone
    Zhang, Lu
    Zhang, Bin
    KIDNEY INTERNATIONAL, 2021, 100 (05) : 1142 - 1142
  • [43] Proposing a machine learning-based model for predicting nonreassuring fetal heart
    Roozbeh, Nasibeh
    Montazeri, Farideh
    Farashah, Mohammadsadegh Vahidi
    Mehrnoush, Vahid
    Darsareh, Fatemeh
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [44] A machine learning-based approach to predicting the malignant and metastasis of thyroid cancer
    Gu, Jianhua
    Xie, Rongli
    Zhao, Yanna
    Zhao, Zhifeng
    Xu, Dan
    Ding, Min
    Lin, Tingyu
    Xu, Wenjuan
    Nie, Zihuai
    Miao, Enjun
    Tan, Dan
    Zhu, Sibo
    Shen, Dongjie
    Fei, Jian
    FRONTIERS IN ONCOLOGY, 2022, 12
  • [45] Predicting metastasis in gastric cancer patients: machine learning-based approaches
    Atefeh Talebi
    Carlos A. Celis-Morales
    Nasrin Borumandnia
    Somayeh Abbasi
    Mohamad Amin Pourhoseingholi
    Abolfazl Akbari
    Javad Yousefi
    Scientific Reports, 13
  • [46] Predicting metastasis in gastric cancer patients: machine learning-based approaches
    Talebi, Atefeh
    Celis-Morales, Carlos A.
    Borumandnia, Nasrin
    Abbasi, Somayeh
    Pourhoseingholi, Mohamad Amin
    Akbari, Abolfazl
    Yousefi, Javad
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [47] Exploration and Evaluation of Machine Learning-Based Models for Predicting Enzymatic Reactions
    Watanabe, Naoki
    Murata, Masahiro
    Ogawa, Teppei
    Vavricka, Christopher J.
    Kondo, Akihiko
    Ogino, Chiaki
    Araki, Michihiro
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2020, 60 (03) : 1833 - 1843
  • [48] A machine learning-based model for predicting survival in patients with Rectosigmoid Cancer
    Wang, Yifei
    Chen, Bingbing
    Yu, Jinhai
    PLOS ONE, 2025, 20 (03):
  • [49] A machine learning-based framework for predicting the power factor of thermoelectric materials
    Zeng, Yuxuan
    Cao, Wei
    Peng, Tan
    Hou, Yue
    Miao, Ling
    Wang, Ziyu
    Shi, Jing
    APPLIED MATERIALS TODAY, 2025, 43
  • [50] Machine Learning-Based Research for Predicting Shale Gas Well Production
    Qi, Nijun
    Li, Xizhe
    Wu, Zhenkan
    Wan, Yujin
    Wang, Nan
    Duan, Guifu
    Wang, Longyi
    Xiang, Jing
    Zhao, Yaqi
    Zhan, Hongming
    SYMMETRY-BASEL, 2024, 16 (05):