Comparative analysis of machine learning and ensemble approaches for hepatitis B prediction using data mining with synthetic minority oversampling technique

被引:0
|
作者
Alizargar, Azadeh [1 ]
Chang, Yang-Lang [1 ]
Tan, Tan-Hsu [1 ]
Liu, Tsung-Yu [2 ]
机构
[1] Natl Taipei Univ Technol, Coll Elect Engn & Comp Sci, Dept Elect Engn, Taipei 10608, Taiwan
[2] Lunghwa Univ Sci & Technol, Dept Multimedia & Game Sci, Taoyuan 333326, Taiwan
关键词
Index terms- Hepatitis B; Liver damage; Early detection; Machine learning; Ensemble model; SMOTE; RISK; DIAGNOSIS; VIRUS;
D O I
10.1007/s12553-023-00802-x
中图分类号
R-058 [];
学科分类号
摘要
PurposeHepatitis B, caused by the Hepatitis B virus (HBV), can harm the liver without noticeable symptoms. Early detection is crucial to prevent transmission and enhance recovery. The main goal is to predict Hepatitis B through cost-effective lab test data, by utilizing machine learning. The primary focus is on evaluating the effectiveness of various algorithms in predicting the disease and their potential to enhance early diagnosis capabilities.MethodsSix distinct algorithms (Support Vector Machine, K-nearest Neighbors, Logistic Regression, decision tree, extreme gradient boosting, random forest) were employed alongside an ensemble model. Analysis involved two rounds: considering all features and key attributes. The Synthetic Minority Oversampling Technique (SMOTE) was employed for data imbalance. Various metrics, including the confusion matrix, precision, recall, F1 score, accuracy, receiver operating characteristics (ROC) curve, area under the curve (AUC), and mean absolute error (MAE), were utilized to assess the efficacy of each predictive technique. The National Health and Nutrition Examination Survey (NHANES) dataset was employed.ResultsThe experimental results demonstrate that the ensemble model attained the highest accuracy (97%) and AUC (0.997) in comparison to existing models. The analysis revealed that specific crucial features possess substantial predictive significance within this model.ConclusionThe study underscores the potential of the ensemble model as a valuable tool for medical practitioners, leveraging cost-effective and readily obtainable laboratory test data to predict Hepatitis B with remarkable accuracy. By facilitating early diagnosis and intervention, this research presents a promising avenue to enhance patient outcomes in the context of Hepatitis B.
引用
收藏
页码:109 / 118
页数:10
相关论文
共 50 条
  • [21] A comparative evaluation of machine learning ensemble approaches for disease prediction using multiple datasets
    Palak Mahajan
    Shahadat Uddin
    Farshid Hajati
    Mohammad Ali Moni
    Ergun Gide
    Health and Technology, 2024, 14 : 597 - 613
  • [22] Evolutionary synthetic oversampling technique and cocktail ensemble model for warfarin dose prediction with imbalanced data
    Yanyun Tao
    Bin Jiang
    Ling Xue
    Cheng Xie
    Yuzhen Zhang
    Neural Computing and Applications, 2021, 33 : 11203 - 11221
  • [23] Evolutionary synthetic oversampling technique and cocktail ensemble model for warfarin dose prediction with imbalanced data
    Tao, Yanyun
    Jiang, Bin
    Xue, Ling
    Xie, Cheng
    Zhang, Yuzhen
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (17): : 11203 - 11221
  • [24] Learning class-imbalanced data with region-impurity synthetic minority oversampling technique
    Li, Der -Chiang
    Wang, Ssu-Yang
    Huang, Kuan-Cheng
    Tsai, Tung -, I
    INFORMATION SCIENCES, 2022, 607 : 1391 - 1407
  • [25] Machine Learning Approaches for Prediction of Facial Rejuvenation Using Real and Synthetic Data
    Shah, Syed Afaq Ali
    Bennamoun, Mohammed
    Molton, Michael K.
    IEEE ACCESS, 2019, 7 : 23779 - 23787
  • [26] Performance Comparison of Machine Learning Approaches on Hepatitis C Prediction Employing Data Mining Techniques
    Alizargar, Azadeh
    Chang, Yang-Lang
    Tan, Tan-Hsu
    BIOENGINEERING-BASEL, 2023, 10 (04):
  • [27] Smart pathological brain detection by synthetic minority oversampling technique, extreme learning machine, and Jaya algorithm
    Zhang, Yu-Dong
    Zhao, Guihu
    Sun, Junding
    Wu, Xiaosheng
    Wang, Zhi-Heng
    Liu, Hong-Min
    Govindaraj, Vishnu Varthanan
    Zhan, Tianmin
    Li, Jianwu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (17) : 22629 - 22648
  • [28] Mitigating Bias in Machine Learning: Improving MinoritySpecific Graft Failure Survival Prediction with Synthetic Minority Oversampling
    Malyala, R.
    Nguan, C.
    AMERICAN JOURNAL OF TRANSPLANTATION, 2023, 23 (06) : S677 - S677
  • [29] Smart pathological brain detection by synthetic minority oversampling technique, extreme learning machine, and Jaya algorithm
    Yu-Dong Zhang
    Guihu Zhao
    Junding Sun
    Xiaosheng Wu
    Zhi-Heng Wang
    Hong-Min Liu
    Vishnu Varthanan Govindaraj
    Tianmin Zhan
    Jianwu Li
    Multimedia Tools and Applications, 2018, 77 : 22629 - 22648
  • [30] An Insider Data Leakage Detection Using One-Hot Encoding, Synthetic Minority Oversampling and Machine Learning Techniques
    Al-Shehari, Taher
    Alsowail, Rakan A.
    ENTROPY, 2021, 23 (10)