Support Vector Machine Outperforms Other Machine Learning Models in Early Diagnosis of Dengue Using Routine Clinical Data

被引:0
|
作者
Qaiser, Ariba [1 ]
Manzoor, Sobia [1 ]
Hashmi, Asraf Hussain [2 ]
Javed, Hasnain [3 ]
Zafar, Anam [4 ]
Ashraf, Javed [5 ,6 ]
机构
[1] Natl Univ Sci & Technol NUST, Atta Ur Rehman Sch Appl Biosci ASAB, Mol Virol Lab, Islamabad, Pakistan
[2] KRL Hosp, Inst Biomed & Genet Engn IBGE, Islamabad, Pakistan
[3] Prov Publ Hlth Reference Lab, Punjab AIDS Control Programe, Lahore, Pakistan
[4] Dept Pediat, Avicenna Med Complex, Lahore, Pakistan
[5] Riphah Int Univ, Dept Community Dent, Islamabad, Pakistan
[6] Univ Eastern Finland, Inst Dent, Kuopio, Finland
关键词
D O I
10.1155/2024/5588127
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Background: There is a dire need for the establishment of active dengue surveillance to continuously detect cases, circulating serotypes, and determine the disease burden of dengue fever (DF) in the country and region. Predicting dengue PCR results using machine learning (ML) models represents a significant advancement in pre-emptive healthcare measures. This study outlines the comprehensive process of data preprocessing, model selection, and the underlying mechanisms of each algorithm employed to accurately predict dengue PCR outcomes.Methods: We analyzed data from 300 suspected dengue patients in Islamabad and Rawalpindi, Pakistan, from August to October 2023. NS1 antigen ELISA, IgM and IgG antibody tests, and serotype-specific real-time polymerase chain reaction (RT-PCR) were used to detect the dengue virus (DENV). Representative PCR-positive samples were sequenced by Sanger sequencing to confirm the circulation of various dengue serotypes. Demographic information, serological test results, and hematological parameters were used as inputs to the ML models, with the dengue PCR result serving as the output to be predicted. The models used were logistic regression, XGBoost, LightGBM, random forest, support vector machine (SVM), and CatBoost.Results: Of the 300 patients, 184 (61.33%) were PCR positive. Among the total positive cases detected by PCR, 9 (4.89%), 171 (92.93%), and 4 (2.17%) were infected with serotypes 1, 2, and 3, respectively. A total of 147 (79.89%) males and 37 (20.11%) females were infected, with a mean age of 33 +/- 16 years. In addition, the mean platelet and leukocyte counts and the hematocrit percentages were 75,447%, 4189.02%, and 46.05%, respectively. The SVM was the best-performing ML model for predicting RT-PCR results, with 71.4% accuracy, 97.4% recall, and 71.6% precision. Hyperparameter tuning improved the recall to 100%.Conclusion: Our study documents three circulating serotypes in the capital territory of Pakistan and highlights that the SVM outperformed other models, potentially serving as a valuable tool in clinical settings to aid in the rapid diagnosis of DF.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Machine Learning for Early Lung Cancer Identification Using Routine Clinical and Laboratory Data
    Gould, Michael K.
    Huang, Brian Z.
    Tammemagi, Martin C.
    Kinar, Yaron
    Shiff, Ron
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2021, 204 (04) : 445 - 453
  • [2] A Hybrid Support Vector Machine Algorithm for Big Data Heterogeneity Using Machine Learning
    Ul Ahsaan, Shafqat
    Kaur, Harleen
    Mourya, Ashish Kumar
    Naaz, Sameena
    SYMMETRY-BASEL, 2022, 14 (11):
  • [3] Early Diagnosis of Primary Immunodeficiency Disease Using Clinical Data and Machine Learning
    Mayampurath, Anoop
    Ajith, Aswathy
    Anderson-Smits, Colin
    Chang, Shun-Chiao
    Brouwer, Emily
    Johnson, Julie
    Baltasi, Michael
    Volchenboum, Samuel
    Devercelli, Giovanna
    Ciaccio, Christina E.
    JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY-IN PRACTICE, 2022, 10 (11): : 3002 - +
  • [4] Machine Learning Models for Early Dengue Severity Prediction
    Caicedo-Torres, William
    Paternina, Angel
    Pinzon, Hernando
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2016, 2016, 10022 : 247 - 258
  • [5] Modeling Dengue vector population using remotely sensed data and machine learning
    Scavuzzo, Juan M.
    Trucco, Francisco
    Espinosa, Manuel
    Tauro, Carolina B.
    Abril, Marcelo
    Scavuzzo, Carlos M.
    Frery, Alejandro C.
    ACTA TROPICA, 2018, 185 : 167 - 175
  • [6] GLAUCOMA DIAGNOSIS USING SUPPORT VECTOR MACHINE
    Thangaraj, Vigneswaran
    Natarajan, V.
    2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2017, : 394 - 399
  • [7] Ensemble Feature Learning of Genomic Data Using Support Vector Machine
    Anaissi, Ali
    Goyal, Madhu
    Catchpoole, Daniel R.
    Braytee, Ali
    Kennedy, Paul J.
    PLOS ONE, 2016, 11 (06):
  • [8] Market Data Analysis by Using Support Vector Machine Learning Technique
    Reddy, Raghavendra
    Shyam, Gopal K.
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA ENGINEERING (ICCIDE 2018), 2019, 28 : 19 - 27
  • [9] Diagnosis of Alzheimer Diseases in Early Step Using SVM (Support Vector Machine)
    Ben Rabeh, Amira
    Benzarti, Faouzi
    Amiri, Hamid
    2016 13TH INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS, IMAGING AND VISUALIZATION (CGIV), 2016, : 364 - 367
  • [10] Early differential diagnosis models of Talaromycosis and Tuberculosis in HIV-negative hosts using clinical data and machine learning
    Qiu, Ye
    Li, Zheng-tu
    Yang, Shi-xiong
    Chen, Wu-shu
    Zhang, Yong
    Kong, Qun-yu
    Chen, Ling-rui
    Huang, Jie
    Lin, Lue
    Xie, Kan
    Zeng, Wen
    Li, Shao-qiang
    Zhan, Yang-qing
    Wang, Yan
    Zhang, Jian-quan
    Ye, Feng
    JOURNAL OF INFECTION AND PUBLIC HEALTH, 2025, 18 (06)