EBOLApred: A machine learning-based web application for predicting cell entry inhibitors of the Ebola virus

被引:7
|
作者
Adams, Joseph [1 ,2 ]
Agyenkwa-Mawuli, Kwasi [1 ,3 ]
Agyapong, Odame [1 ]
Wilson, Michael D. [2 ,4 ]
Kwofie, Samuel K. [1 ,3 ]
机构
[1] Univ Ghana, Coll Basic & Appl Sci, Sch Engn Sci, Dept Biomed Engn, PMB LG 77,LG 77, Accra, Ghana
[2] Univ Ghana, Noguchi Mem Inst Med Res NMIMR, Coll Hlth Sci CHS, Dept Parasitol, POB LG 581,LG 581, Accra, Ghana
[3] Univ Ghana, Coll Basic & Appl Sci, West African Ctr Cell Biol Infect Pathogens, Dept Biochem Cell & Mol Biol, LG 54, Accra, Ghana
[4] Loyola Univ, Dept Med, Med Ctr, Maywood, IL 60153 USA
关键词
Ebola virus protein; Machine learning; Inhibitors; Support vector machine; Random forest; Logistic regression; MATRIX PROTEIN VP40; APPLICABILITY DOMAIN; DOCKING; DATABASE; SMOTE;
D O I
10.1016/j.compbiolchem.2022.107766
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Ebola virus disease (EVD) is a highly virulent and often lethal illness that affects humans through contact with the body fluid of infected persons. Glycoprotein and matrix protein VP40 play essential roles in the virus life cycle within the host. Whilst glycoprotein mediates the entry and fusion of the virus with the host cell membrane, VP40 is also responsible for viral particle assembly and budding. This study aimed at developing machine learning models to predict small molecules as possible anti-Ebola virus compounds capable of inhibiting the activities of GP and VP40 using Ebola virus (EBOV) cell entry inhibitors from the PubChem database as training data. Predictive models were developed using five algorithms comprising random forest (RF), support vector machine (SVM), naive Bayes (NB), k-nearest neighbor (kNN), and logistic regression (LR). The models were evaluated using a 10-fold cross-validation technique and the algorithm with the best performance was the random forest model with an accuracy of 89 %, an F1 score of 0.9, and a receiver operating characteristic curve (ROC curve) showing the area under the curve (AUC) score of 0.95. LR and SVM models also showed plausible performances with overall accuracy values of 0.84 and 0.86, respectively. The models, RF, LR, and SVM were deployed as a web server known as EBOLApred accessible via http://197.255.126.13:8000/.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Machine Learning-Based Malicious Application Detection of Android
    Wei, Linfeng
    Luo, Weiqi
    Weng, Jian
    Zhong, Yanjun
    zhang, Xiaoqian
    Yan, Zheng
    IEEE ACCESS, 2017, 5 : 25591 - 25601
  • [32] Parameters estimation in Ebola virus transmission dynamics model based on machine learning
    Gong, Jing
    Wu, Yong-Ping
    Li, Li
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2019, 536
  • [33] Marigold: a machine learning-based web app for zebrafish pose tracking
    Teicher, Gregory
    Riffe, R. Madison
    Barnaby, Wayne
    Martin, Gabrielle
    Clayton, Benjamin E.
    Trapani, Josef G.
    Downes, Gerald B.
    BMC BIOINFORMATICS, 2025, 26 (01):
  • [34] A machine learning-based pipeline and web server ImmuneMirror for neoantigen prediction
    Dai, Wei
    Chuwdhury, Gulam Sarwar
    Guo, Yunshan
    Liu, Zhonghua
    CANCER RESEARCH, 2023, 83 (07)
  • [35] A Machine Learning-Based Approach to Detect Web Service Design Defects
    Ouni, Ali
    Daagi, Marwa
    Kessentini, Marouane
    Bouktif, Salah
    Gammoudi, Mohamed Mohsen
    2017 IEEE 24TH INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS 2017), 2017, : 532 - 539
  • [36] Machine learning-based approach for predicting the consolidation characteristics of soft soil
    Singh, Moirangthem Johnson
    Kaushik, Anshul
    Patnaik, Gyanesh
    Xu, Dong-Sheng
    Feng, Wei-Qiang
    Rajput, Abhishek
    Prakash, Guru
    Borana, Lalit
    MARINE GEORESOURCES & GEOTECHNOLOGY, 2024, 42 (04) : 405 - 419
  • [37] A machine learning-based method for predicting the shear behaviors of rock joints
    He, Liu
    Tan, Yu
    Copeland, Timothy
    Chen, Jiannan
    Tang, Qiang
    SOILS AND FOUNDATIONS, 2024, 64 (06)
  • [38] A machine learning-based analysis for predicting fragility curve parameters of buildings
    Dabiri, Hamed
    Faramarzi, Asaad
    Dall 'Asta, Andrea
    Tondi, Emanuele
    Micozzi, Fabio
    JOURNAL OF BUILDING ENGINEERING, 2022, 62
  • [39] Comprehensive assessment of machine learning-based methods for predicting antimicrobial peptides
    Xu, Jing
    Li, Fuyi
    Leier, Andre
    Xiang, Dongxu
    Shen, Hsin-Hui
    Lago, Tatiana T. Marquez
    Li, Jian
    Yu, Dong-Jun
    Song, Jiangning
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
  • [40] A Novel Machine Learning-Based Systolic Blood Pressure Predicting Model
    Zheng, Jiao
    Yu, Zhengyu
    JOURNAL OF NANOMATERIALS, 2021, 2021