Prognosis of Cervical Cancer Disease by Applying Machine Learning Techniques

被引:12
|
作者
Kumawat, Gaurav [1 ]
Vishwakarma, Santosh Kumar [1 ]
Chakrabarti, Prasun [2 ]
Chittora, Pankaj [1 ]
Chakrabarti, Tulika [3 ]
Lin, Jerry Chun-Wei [4 ]
机构
[1] Manipal Univ Jaipur, Dept Comp Sci & Engn, Jaipur 302034, Rajasthan, India
[2] ITM SLS Baroda Univ, Vadodara 391510, Gujarat, India
[3] Sir Padampat Singhania Univ, Dept Basic Sci, Udaipur 313601, Rajasthan, India
[4] Western Norway Univ Appl Sci, Dept Comp Sci Elect Engn & Math Sci, N-5063 Bergen, Norway
关键词
Cervical cancer; BayesNet; artificial neural network; support vector machine; random tree; logistic; XG boost tree; LASSO; prediction; RISK-FACTORS; SYMPTOMS;
D O I
10.1142/S0218126623500196
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Cervical cancer is one of the deadliest diseases in women worldwide. It is caused by long-term infection of the skin cells and mucosal cells of the genital area of women. The most disturbing thing about this cancer is the fact that it does not show any symptoms when it occurs. In the diagnosis and prognosis of cervical cancer disease, machine learning has the potential to help detect it at an early stage. In this paper, we analyzed different supervised machine learning techniques to detect cervical cancer at an early stage. To train the machine learning model, a cervical cancer dataset from the UCI repository was used. The different methods were evaluated using this dataset of 858 cervical cancer patients with 36 risk factors and one outcome variable. Six classification algorithms were applied in this study, including an artificial neural network, a Bayesian network, an SVM, a random tree, a logistic tree, and an XG-boost tree. All models were trained with and without a feature selection algorithm to compare the performance and accuracy of the classifiers. Three feature selection algorithms were used, namely (i) relief rank, (ii) wrapper method and (iii) LASSO regression. The maximum accuracy of 94.94% was recorded using XG Boost with complete features. It is also observed that for this dataset, in some cases, the feature selection algorithm performs better. Machine learning has been shown to have advantages over traditional statistical models when it comes to dealing with the complexity of large-scale data and uncovering prognostic features. It offers much potential for clinical use and for improving the treatment of cervical cancer. However, the limitations of prediction studies and models, such as simplified, incomplete information, overfitting, and lack of interpretability, suggest that further efforts are needed to improve the accuracy, reliability, and practicality of clinical outcome prediction.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] Prognosis of forest production using machine learning techniques
    Silva, Jeferson Pereira Martins
    da Silva, Mayra Luiza Marques
    Mendonca, Adriano Ribeiro de
    da Silva, Gilson Fernandes
    de Barros Jr, Antonio Almeida
    da Silva, Evandro Ferreira
    Aguiar, Marcelo Otone
    Santos, Jeangelis Silva
    Rodrigues, Nivea Maria Mafra
    [J]. INFORMATION PROCESSING IN AGRICULTURE, 2023, 10 (01): : 71 - 84
  • [32] Hybrid Model for Detection of Cervical Cancer Using Causal Analysis and Machine Learning Techniques
    Lilhore, Umesh Kumar
    Poongodi, M.
    Kaur, Amandeep
    Simaiya, Sarita
    Algarni, Abeer D.
    Elmannai, Hela
    Vijayakumar, V.
    Tunze, Godwin Brown
    Hamdi, Mounir
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2022, 2022
  • [33] Applying machine learning techniques to predict the properties of energetic materials
    Daniel C. Elton
    Zois Boukouvalas
    Mark S. Butrico
    Mark D. Fuge
    Peter W. Chung
    [J]. Scientific Reports, 8
  • [34] Applying machine learning techniques to localize quadcopter sensor failures*
    Kim, Stanislav
    Margun, Alexey
    Pyrkin, Anton
    Evstafev, Oleg
    [J]. 2022 8TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT'22), 2022, : 1542 - 1544
  • [35] Applying Machine Learning Techniques to Mine Ventilation Control Systems
    Kashnikov, Aleksey V.
    Levin, Lev
    [J]. PROCEEDINGS OF 2017 XX IEEE INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND MEASUREMENTS (SCM), 2017, : 391 - 393
  • [36] Applying Machine Learning Techniques for Performing Comparative Opinion Mining
    Younis, Umair
    Asghar, Muhammad Zubair
    Khan, Adil
    Khan, Alamsher
    Iqbal, Javed
    Jillani, Nosheen
    [J]. OPEN COMPUTER SCIENCE, 2020, 10 (01) : 461 - 477
  • [37] Applying machine learning techniques to predict the properties of energetic materials
    Elton, Daniel C.
    Boukouvalas, Zois
    Butrico, Mark S.
    Fuge, Mark D.
    Chung, Peter W.
    [J]. SCIENTIFIC REPORTS, 2018, 8
  • [38] Applying machine learning techniques to improve linux process scheduling
    Negi, Atul
    Kishore, Kumar P.
    [J]. TENCON 2005 - 2005 IEEE REGION 10 CONFERENCE, VOLS 1-5, 2006, : 393 - +
  • [39] Comprehensive Analysis of Students' Performance by Applying Machine Learning Techniques
    HemaMalini, B. H.
    Suresh, L.
    Kushal, Mayank
    [J]. SMART INTELLIGENT COMPUTING AND APPLICATIONS, VOL 2, 2020, 160 : 547 - 556
  • [40] PREDICTIVE MODEL OF THE LEARNING PROGRESS OF UNIMINUTO STUDENTS APPLYING MACHINE LEARNING TECHNIQUES
    Bautista Canon, Elmer
    Quirama Salamanca, Jenny E.
    Canon, Edilfonso Bautista
    [J]. REVISTA CONRADO, 2021, 17 (83): : 305 - 310