Evaluating Diagnostic Performance of Machine Learning Algorithms on Breast Cancer

被引:3
|
作者
Gatuha, George [1 ]
Jiang, Tao [1 ]
机构
[1] Harbin Engn Univ, Coll Informat & Commun Engn, Harbin, Peoples R China
关键词
Data mining; Classification; Open source; Confusion matrix; Breast cancer; RISK-FACTORS;
D O I
10.1007/978-3-319-23862-3_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on comparing performance of six data mining methods namely: Bagging, SVM (SMO), Decorate, C4.5 (J48), Naive Bayes and IBK in analyzing Wisconsin Breast Cancer (WBC) datasets. The datasets were obtained from the UCI Machine Learning Repository and comprises of 699 instances and 11 attributes. A confusion matrix, based on a 10-fold cross validation technique was used in our experiment to provide the basis for measuring the accuracy of each algorithm. We introduce an idea of combining the algorithms at classification level to obtain the most ideal multi-classifier approach for the WBC data set. Waikato Environment Knowledge Explorer (WEKA), open source data mining software was used for the experimental analysis. The experimental results show that SMO offers the best accuracy (97 %) among the six algorithms, while merging SMO, Naive Bayes, J48 and IBK offers the best accuracy (97.3 %) on the data set.
引用
收藏
页码:258 / 266
页数:9
相关论文
共 50 条
  • [1] On Breast Cancer Detection: An Application of Machine Learning Algorithms on the Wisconsin Diagnostic Dataset
    Agarap, Abien Fred M.
    [J]. 2ND INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING (ICMLSC 2018), 2015, : 5 - 9
  • [2] Performance analysis of different machine learning algorithms in breast cancer predictions
    Battineni, Gopi
    Chintalapudi, Nalini
    Amenta, Francesco
    [J]. EAI Endorsed Transactions on Pervasive Health and Technology, 2020, 6 (23): : 1 - 7
  • [3] COMPARISON OF MACHINE LEARNING ALGORITHMS FOR BREAST CANCER
    Suryachandra, Palli
    Reddy, P. Venkata Subba
    [J]. 2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 3, 2015, : 439 - 444
  • [4] DIAGNOSING BREAST CANCER WITH MACHINE LEARNING ALGORITHMS
    Thiyagarajan, S.
    Chakravarthy, T.
    Arivoli, P., V
    [J]. INTERNATIONAL JOURNAL OF LIFE SCIENCE AND PHARMA RESEARCH, 2020, : 42 - 46
  • [5] Machine Learning Algorithms for Breast Cancer Prediction
    Kumar, K. M. E. Senthil
    Akalya, A.
    Kanimozhi, V.
    [J]. JOURNAL OF POPULATION THERAPEUTICS AND CLINICAL PHARMACOLOGY, 2023, 30 (07): : E245 - E250
  • [6] Machine Learning Algorithms for Diagnosis of Breast Cancer
    Negi, Richaa
    Mathew, Rejo
    [J]. PROCEEDING OF THE INTERNATIONAL CONFERENCE ON COMPUTER NETWORKS, BIG DATA AND IOT (ICCBI-2018), 2020, 31 : 928 - 932
  • [7] Comparison of the performance of machine learning algorithms in breast cancer screening and detection: A protocol
    Salod, Zakia
    Singh, Yashik
    [J]. JOURNAL OF PUBLIC HEALTH RESEARCH, 2019, 8 (03) : 112 - 118
  • [8] Classification and diagnostic prediction of breast cancer metastasis on clinical data using machine learning algorithms
    Mahendran Botlagunta
    Madhavi Devi Botlagunta
    Madhu Bala Myneni
    D. Lakshmi
    Anand Nayyar
    Jaithra Sai Gullapalli
    Mohd Asif Shah
    [J]. Scientific Reports, 13
  • [9] Classification and diagnostic prediction of breast cancer metastasis on clinical data using machine learning algorithms
    Botlagunta, Mahendran
    Botlagunta, Madhavi Devi
    Myneni, Madhu Bala
    Lakshmi, D.
    Nayyar, Anand
    Gullapalli, Jaithra Sai
    Shah, Mohd Asif
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [10] Prediction of Breast Cancer using Machine Learning Algorithms
    Mangal, Anuj
    Jain, Vinod
    [J]. PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 464 - 466