Improving prediction of cervical cancer using KNN imputer and multi-model ensemble learning

被引:7
|
作者
Aljrees, Turki [1 ]
机构
[1] Univ Hafr Al Batin, Coll Comp Sci & Engn, Hafar al Batin, Saudi Arabia
来源
PLOS ONE | 2024年 / 19卷 / 01期
关键词
HUMAN-PAPILLOMAVIRUS; MACHINE; CLASSIFICATION; LEVEL;
D O I
10.1371/journal.pone.0295632
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cervical cancer is a leading cause of women's mortality, emphasizing the need for early diagnosis and effective treatment. In line with the imperative of early intervention, the automated identification of cervical cancer has emerged as a promising avenue, leveraging machine learning techniques to enhance both the speed and accuracy of diagnosis. However, an inherent challenge in the development of these automated systems is the presence of missing values in the datasets commonly used for cervical cancer detection. Missing data can significantly impact the performance of machine learning models, potentially leading to inaccurate or unreliable results. This study addresses a critical challenge in automated cervical cancer identification-handling missing data in datasets. The study present a novel approach that combines three machine learning models into a stacked ensemble voting classifier, complemented by the use of a KNN Imputer to manage missing values. The proposed model achieves remarkable results with an accuracy of 0.9941, precision of 0.98, recall of 0.96, and an F1 score of 0.97. This study examines three distinct scenarios: one involving the deletion of missing values, another utilizing KNN imputation, and a third employing PCA for imputing missing values. This research has significant implications for the medical field, offering medical experts a powerful tool for more accurate cervical cancer therapy and enhancing the overall effectiveness of testing procedures. By addressing missing data challenges and achieving high accuracy, this work represents a valuable contribution to cervical cancer detection, ultimately aiming to reduce the impact of this disease on women's health and healthcare systems.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] Examination of multi-model ensemble seasonal prediction methods using a simple climate system
    Kang, IS
    Yoo, JH
    CLIMATE DYNAMICS, 2006, 26 (2-3) : 285 - 294
  • [42] A novel deep learning-based multi-model ensemble method for the prediction of neuromuscular disorders
    Khamparia, Aditya
    Singh, Aman
    Anand, Divya
    Gupta, Deepak
    Khanna, Ashish
    Kumar, N. Arun
    Tan, Joseph
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (15): : 11083 - 11095
  • [43] Short-Term Traffic Flow Prediction Based on Multi-Model by Stacking Ensemble Learning
    Chen, Yong
    CICTP 2020: TRANSPORTATION EVOLUTION IMPACTING FUTURE MOBILITY, 2020, : 87 - 99
  • [44] Enhancing Breast Cancer Detection and Classification Using Advanced Multi-Model Features and Ensemble Machine Learning Techniques
    Al Reshan, Mana Saleh
    Amin, Samina
    Zeb, Muhammad Ali
    Sulaiman, Adel
    Alshahrani, Hani
    Azar, Ahmad Taher
    Shaikh, Asadullah
    LIFE-BASEL, 2023, 13 (10):
  • [46] An Ensemble Learning Approach of Multi-Model for Classifying House Damage
    Fan, Junqiao
    Xu, Chun
    Zhang, Jiahe
    2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 145 - 152
  • [47] Improving the quality of simulated soil moisture with a multi-model ensemble approach
    Guo, Zhichang
    Dirmeyer, Paul A.
    Gao, Xiang
    Zhao, Mei
    QUARTERLY JOURNAL OF THE ROYAL METEOROLOGICAL SOCIETY, 2007, 133 (624) : 731 - 747
  • [48] Hydrological ensemble forecasting using a multi-model framework
    Dion, Patrice
    Martel, Jean-Luc
    Arsenault, Richard
    JOURNAL OF HYDROLOGY, 2021, 600 (600)
  • [49] Load Forecasting Based on Multi-model by Stacking Ensemble Learning
    Shi J.
    Zhang J.
    Zhongguo Dianji Gongcheng Xuebao/Proceedings of the Chinese Society of Electrical Engineering, 2019, 39 (14): : 4032 - 4041
  • [50] Improving multi-model ensemble streamflow forecasts by combining lumped, distributed and deep learning hydrological models
    Armstrong, William
    Arsenault, Richard
    Martel, Jean-Luc
    Troin, Magali
    Dion, Patrice
    Sabzipour, Behmard
    Brissette, Francois
    Mai, Juliane
    HYDROLOGICAL SCIENCES JOURNAL, 2025,