EBOLApred: A machine learning-based web application for predicting cell entry inhibitors of the Ebola virus

被引:7
|
作者
Adams, Joseph [1 ,2 ]
Agyenkwa-Mawuli, Kwasi [1 ,3 ]
Agyapong, Odame [1 ]
Wilson, Michael D. [2 ,4 ]
Kwofie, Samuel K. [1 ,3 ]
机构
[1] Univ Ghana, Coll Basic & Appl Sci, Sch Engn Sci, Dept Biomed Engn, PMB LG 77,LG 77, Accra, Ghana
[2] Univ Ghana, Noguchi Mem Inst Med Res NMIMR, Coll Hlth Sci CHS, Dept Parasitol, POB LG 581,LG 581, Accra, Ghana
[3] Univ Ghana, Coll Basic & Appl Sci, West African Ctr Cell Biol Infect Pathogens, Dept Biochem Cell & Mol Biol, LG 54, Accra, Ghana
[4] Loyola Univ, Dept Med, Med Ctr, Maywood, IL 60153 USA
关键词
Ebola virus protein; Machine learning; Inhibitors; Support vector machine; Random forest; Logistic regression; MATRIX PROTEIN VP40; APPLICABILITY DOMAIN; DOCKING; DATABASE; SMOTE;
D O I
10.1016/j.compbiolchem.2022.107766
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Ebola virus disease (EVD) is a highly virulent and often lethal illness that affects humans through contact with the body fluid of infected persons. Glycoprotein and matrix protein VP40 play essential roles in the virus life cycle within the host. Whilst glycoprotein mediates the entry and fusion of the virus with the host cell membrane, VP40 is also responsible for viral particle assembly and budding. This study aimed at developing machine learning models to predict small molecules as possible anti-Ebola virus compounds capable of inhibiting the activities of GP and VP40 using Ebola virus (EBOV) cell entry inhibitors from the PubChem database as training data. Predictive models were developed using five algorithms comprising random forest (RF), support vector machine (SVM), naive Bayes (NB), k-nearest neighbor (kNN), and logistic regression (LR). The models were evaluated using a 10-fold cross-validation technique and the algorithm with the best performance was the random forest model with an accuracy of 89 %, an F1 score of 0.9, and a receiver operating characteristic curve (ROC curve) showing the area under the curve (AUC) score of 0.95. LR and SVM models also showed plausible performances with overall accuracy values of 0.84 and 0.86, respectively. The models, RF, LR, and SVM were deployed as a web server known as EBOLApred accessible via http://197.255.126.13:8000/.
引用
收藏
页数:10
相关论文
共 50 条
  • [11] Machine learning-based modeling to predict inhibitors of acetylcholinesterase
    Hardeep Sandhu
    Rajaram Naresh Kumar
    Prabha Garg
    Molecular Diversity, 2022, 26 : 331 - 340
  • [12] Machine learning-based modeling to predict inhibitors of acetylcholinesterase
    Sandhu, Hardeep
    Kumar, Rajaram Naresh
    Garg, Prabha
    MOLECULAR DIVERSITY, 2022, 26 (01) : 331 - 340
  • [13] Discovery of novel Ebola virus entry inhibitors enabled by QSAR-based approaches
    Capuzzi, Stephen
    Tropsha, Alexander
    Muratov, Eugene
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 251
  • [14] A machine learning-based workflow for predicting transplant outcomes in patients with sickle cell disease
    Li, Haiou
    Sachdev, Vandana
    Tian, Xin
    Nguyen, My-Le
    Hsieh, Matthew
    Fitzhugh, Courtney
    Limerick, Emily
    Coles, Wynona
    Asomaning, Nancy
    Conrey, Anna
    Wu, Colin O.
    Thein, Swee Lay
    BRITISH JOURNAL OF HAEMATOLOGY, 2025, 206 (03) : 919 - 923
  • [15] Application of machine learning for predicting G9a inhibitors
    Ivanova, Mariya L.
    Russo, Nicola
    Djaid, Nadia
    Nikolic, Konstantin
    DIGITAL DISCOVERY, 2024, 3 (10): : 2010 - 2018
  • [16] INFINITy: A fast machine learning-based application for human influenza A and B virus subtyping
    Cacciabue, Marco
    Marcone, Debora N.
    INFLUENZA AND OTHER RESPIRATORY VIRUSES, 2023, 17 (01)
  • [17] Machine Learning Based Web Application Firewall
    Isiker, Batuhan
    Sogukpinar, Ibrahim
    2ND INTERNATIONAL INFORMATICS AND SOFTWARE ENGINEERING CONFERENCE (IISEC), 2021,
  • [18] CrystalMELA: A machine learning-based web platform for polycrystalline characterization
    Rizzi, R.
    Corriero, N.
    Del Buono, N.
    Settembre, G.
    Diacono, D.
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2024, 80
  • [19] Machine Learning-Based Method for Predicting Compressive Strength of Concrete
    Li, Daihong
    Tang, Zhili
    Kang, Qian
    Zhang, Xiaoyu
    Li, Youhua
    PROCESSES, 2023, 11 (02)
  • [20] Machine learning-based approach for predicting low birth weight
    Ranjbar, Amene
    Montazeri, Farideh
    Farashah, Mohammadsadegh Vahidi
    Mehrnoush, Vahid
    Darsareh, Fatemeh
    Roozbeh, Nasibeh
    BMC PREGNANCY AND CHILDBIRTH, 2023, 23 (01)