EBOLApred: A machine learning-based web application for predicting cell entry inhibitors of the Ebola virus

被引:7
|
作者
Adams, Joseph [1 ,2 ]
Agyenkwa-Mawuli, Kwasi [1 ,3 ]
Agyapong, Odame [1 ]
Wilson, Michael D. [2 ,4 ]
Kwofie, Samuel K. [1 ,3 ]
机构
[1] Univ Ghana, Coll Basic & Appl Sci, Sch Engn Sci, Dept Biomed Engn, PMB LG 77,LG 77, Accra, Ghana
[2] Univ Ghana, Noguchi Mem Inst Med Res NMIMR, Coll Hlth Sci CHS, Dept Parasitol, POB LG 581,LG 581, Accra, Ghana
[3] Univ Ghana, Coll Basic & Appl Sci, West African Ctr Cell Biol Infect Pathogens, Dept Biochem Cell & Mol Biol, LG 54, Accra, Ghana
[4] Loyola Univ, Dept Med, Med Ctr, Maywood, IL 60153 USA
关键词
Ebola virus protein; Machine learning; Inhibitors; Support vector machine; Random forest; Logistic regression; MATRIX PROTEIN VP40; APPLICABILITY DOMAIN; DOCKING; DATABASE; SMOTE;
D O I
10.1016/j.compbiolchem.2022.107766
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Ebola virus disease (EVD) is a highly virulent and often lethal illness that affects humans through contact with the body fluid of infected persons. Glycoprotein and matrix protein VP40 play essential roles in the virus life cycle within the host. Whilst glycoprotein mediates the entry and fusion of the virus with the host cell membrane, VP40 is also responsible for viral particle assembly and budding. This study aimed at developing machine learning models to predict small molecules as possible anti-Ebola virus compounds capable of inhibiting the activities of GP and VP40 using Ebola virus (EBOV) cell entry inhibitors from the PubChem database as training data. Predictive models were developed using five algorithms comprising random forest (RF), support vector machine (SVM), naive Bayes (NB), k-nearest neighbor (kNN), and logistic regression (LR). The models were evaluated using a 10-fold cross-validation technique and the algorithm with the best performance was the random forest model with an accuracy of 89 %, an F1 score of 0.9, and a receiver operating characteristic curve (ROC curve) showing the area under the curve (AUC) score of 0.95. LR and SVM models also showed plausible performances with overall accuracy values of 0.84 and 0.86, respectively. The models, RF, LR, and SVM were deployed as a web server known as EBOLApred accessible via http://197.255.126.13:8000/.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Machine learning-based approach for predicting low birth weight
    Amene Ranjbar
    Farideh Montazeri
    Mohammadsadegh Vahidi Farashah
    Vahid Mehrnoush
    Fatemeh Darsareh
    Nasibeh Roozbeh
    BMC Pregnancy and Childbirth, 23
  • [22] Machine Learning-based Models for Predicting the Penetration Depth of Concrete
    Li M.
    Wu H.
    Dong H.
    Ren G.
    Zhang P.
    Huang F.
    Binggong Xuebao/Acta Armamentarii, 2023, 44 (12): : 3771 - 3782
  • [23] Predicting submerged vegetation drag with a machine learning-based method
    Liu, Meng-yang
    Tang, Hong-wu
    Yuan, Sai-yu
    Yan, Jing
    JOURNAL OF HYDRODYNAMICS, 2024, 36 (03) : 534 - 545
  • [24] A machine learning-based framework for predicting game server load
    Ozer, Cagdas
    Cevik, Taner
    Gurhanli, Ahmet
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (06) : 9527 - 9546
  • [25] A Machine Learning-Based Model for Predicting the Risk of Cardiovascular Disease
    Hsiao, Chiu-Han
    Yu, Po-Chun
    Hsieh, Chia-Ying
    Zhong, Bing-Zi
    Tsai, Yu-Ling
    Cheng, Hao-min
    Chang, Wei-Lun
    Lin, Frank Yeong-Sung
    Huang, Yennun
    ADVANCED INFORMATION NETWORKING AND APPLICATIONS, AINA-2022, VOL 1, 2022, 449 : 364 - 374
  • [26] MACHINE LEARNING-BASED MODEL FOR PREDICTING CONCRETE COMPRESSIVE STRENGTH
    Tu Trung Nguyen
    Long Tran Ngoc
    Hoang Hiep Vu
    Tung Pham Thanh
    INTERNATIONAL JOURNAL OF GEOMATE, 2021, 20 (77): : 197 - 204
  • [27] A machine learning-based framework for predicting game server load
    Çağdaş Özer
    Taner Çevik
    Ahmet Gürhanlı
    Multimedia Tools and Applications, 2021, 80 : 9527 - 9546
  • [28] Machine learning-based cell death signature for predicting the prognosis and immunotherapy benefit in stomach adenocarcinoma
    Li, Fan
    Feng, Qian
    Tao, Ran
    MEDICINE, 2024, 103 (10) : E37314
  • [29] Machine Learning-based Pin Accessibility Prediction and Application
    Fang, Shao-Yun
    2021 INTERNATIONAL SYMPOSIUM ON VLSI DESIGN, AUTOMATION AND TEST (VLSI-DAT), 2021,
  • [30] Continuous Management of Machine Learning-Based Application Behavior
    Anisetti, Marco
    Ardagna, Claudio A.
    Bena, Nicola
    Damiani, Ernesto
    Panero, Paolo G.
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2025, 18 (01) : 112 - 125