Enhancing Prediction Accuracy in Gastric Cancer Using High-Confidence Machine Learning Models for Class Imbalance

被引:0
|
作者
Jamil, Danish [1 ,2 ]
Palaniappan, Sellappan [1 ]
Naseem, Muhammad [2 ]
Lokman, Asiah [1 ]
机构
[1] Malaysia Univ Sci & Technol, Dept Informat Technol, Kuala Lumpur, Malaysia
[2] Sir Syed Univ Engn & Technol, Dept Software Engn, Karachi, Pakistan
关键词
class imbalance; gastric cancer; decision support system; machine learning; prediction accuracy; naive bayes; logistic regression; medical diagnostics; positive predictive value;
D O I
10.12720/jait.14.6.1410-1424
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gastric Cancer (GC) diagnosis and prognosis present significant challenges in the clinical industry. To address the issue of low prediction accuracy resulting from imbalanced positive and negative GC cases, this study proposes a medical Decision Support System (DSS) based on supervised Machine Learning (ML) methods. Four ML models, including Naive Bayes (NB), Logistic Regression (LR), and Multilayer Perceptron (MLP), were employed in this study. The impact of data imbalance on GC prediction was assessed through two procedures. Among the ML models, the MLP model demonstrated the best performance in weighted GC prediction, achieving a sensitivity of 0.930 and a Positive Predictive Value (PPV) of 0.932 for balanced predictions, and a sensitivity of 0.918 and a PPV of 0.908 for unbalanced predictions. The NB model showed promise in handling the data imbalance issue, achieving a sensitivity of 0.722 and a PPV of 0.420 on the unbalanced dataset. Additionally, a DSS was developed specifically for the NB and LR models to improve prediction accuracy. The proposed method significantly improved the sensitivity of optimistic GC case prediction, with the Naive Bayes model achieving a sensitivity of 0.936 and the Logistic Regression model achieving a sensitivity of 0.8306. These improvements enhance the reliability and efficiency of GC diagnostics, offering valuable decision support in healthcare. This research provides insights into addressing class imbalance in GC likelihood prediction and has potential implications for clinical practice.
引用
收藏
页码:1410 / 1424
页数:15
相关论文
共 50 条
  • [1] Enhancing accuracy of membrane fouling prediction using hybrid machine learning models
    Lim, Seung Ji
    Kim, Young Mi
    Park, Hosik
    Ki, Seojin
    Jeong, Kwanho
    Seo, Jangwon
    Chae, Sung Ho
    Kim, Joon Ha
    DESALINATION AND WATER TREATMENT, 2019, 146 : 22 - 28
  • [2] Enhancing accuracy of membrane fouling prediction using hybrid machine learning models
    Lim, Seung Ji
    Kim, Young Mi
    Park, Hosik
    Ki, Seojin
    Jeong, Kwanho
    Seo, Jangwon
    Chae, Sung Ho
    Kim, Joon Ha
    DESALINATION AND WATER TREATMENT, 2022, 270 : 320 - 320
  • [3] Extreme Learning Machine prediction under high class imbalance in bioinformatics
    Rodriguez, T.
    Di Persia, L. E.
    Milone, D. H.
    Stegmayer, G.
    2017 XLIII LATIN AMERICAN COMPUTER CONFERENCE (CLEI), 2017,
  • [4] Class imbalance in machine learning for neurosurgical outcome prediction: are our models valid?
    Staartjes, Victor E.
    Schroder, Marc L.
    JOURNAL OF NEUROSURGERY-SPINE, 2018, 29 (05) : 611 - 612
  • [5] Editorial: machine learning models for gastric cancer risk prediction
    Spaander, Manon C. W.
    Kuipers, Ernst J.
    ALIMENTARY PHARMACOLOGY & THERAPEUTICS, 2021, 53 (08) : 943 - 944
  • [6] Enhancing Prediction Accuracy of Vessel Arrival Times Using Machine Learning
    Evmides, Nicos
    Aslam, Sheraz
    Ramez, Tzioyntmprian T.
    Michaelides, Michalis P.
    Herodotou, Herodotos
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2024, 12 (08)
  • [7] Breast Cancer Prediction using Machine Learning Models
    Iparraguirre-Villanueva, Orlando
    Epifania-Huerta, Andres
    Torres-Ceclen, Carmen
    Ruiz-Alvarado, John
    Cabanillas-Carbonell, Michael
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (02) : 610 - 620
  • [8] Enhancing the accuracy of metocean hindcasts with machine learning models
    Costa, Mariana O.
    Campos, Ricardo M.
    Soares, C. Guedes
    OCEAN ENGINEERING, 2023, 287
  • [9] Diversity based imbalance learning approach for software fault prediction using machine learning models
    Manchala, Pravali
    Bisi, Manjubala
    APPLIED SOFT COMPUTING, 2022, 124
  • [10] Comparison of machine learning models for the prediction of cancer cells using MHC class I complexes
    Mateo, N.
    Canon, Alvaro David Orjuela
    Charry, Oscar Julian Perdomo
    16TH INTERNATIONAL SYMPOSIUM ON MEDICAL INFORMATION PROCESSING AND ANALYSIS, 2020, 11583