Classification performance of data mining algorithms applied to breast cancer data

被引:0
|
作者
Santos, Vitor [1 ]
Datia, Nuno [1 ]
Pato, M. P. M. [1 ]
机构
[1] ISEL, Lisbon, Portugal
关键词
ROC CURVE; AREA;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we study how several classification algorithms perform when applied to a breast cancer dataset. The challenge is to develop models for computer-aided detection (CAD), capable to classify, at early stages, masses spotted in X-ray images. The dataset was available at KDD CUP 2008. The imbalanced nature of the dataset and its high-dimensional feature space poses problems to the modelling that are tackled using dimension reduction techniques. The algorithms are compared using the area under the curve (AUC) of the receiver operating characteristic curve (ROC) between true-and false-positive rates (TPR and FPR). Other metrics, such as patient sensitivity and FPR are used and discussed. We find that Naive Bayes classifier achieved the best performance irrespective of the combination of datasets and allow controlled trade-offs between false positives and negatives.
引用
收藏
页码:307 / 312
页数:6
相关论文
共 50 条
  • [1] Comparison of Data Mining Classification Algorithms for Breast Cancer Prediction
    Shah, Chintan
    Jivani, Anjali G.
    [J]. 2013 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND NETWORKING TECHNOLOGIES (ICCCNT), 2013,
  • [2] Application of Data Mining Classification Algorithms for Breast Cancer Diagnosis
    Saoud, Hajar
    Ghadi, Abderrahim
    Ghailani, Mohamed
    Abdelhakim, Boudhir Anouar
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON SMART CITY APPLICATIONS (SCA'18), 2018,
  • [3] Classification Algorithms of Data Mining Applied for Demographic Processes
    Ionita, Irina
    Ionita, Liviu
    [J]. BRAIN-BROAD RESEARCH IN ARTIFICIAL INTELLIGENCE AND NEUROSCIENCE, 2018, 9 (01): : 94 - 100
  • [4] Data mining classification algorithms
    Saouabi, Mohamed
    Ezzati, Abdellah
    [J]. INTERNATIONAL JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE, 2020, 15 (01): : 389 - 394
  • [5] Breast Cancer Prediction and Detection Using Data Mining Classification Algorithms: A Comparative Study
    Kaya Keles, Mumine
    [J]. TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2019, 26 (01): : 149 - 155
  • [6] Breast cancer prediction and detection using data mining classification algorithms: A comparative study
    Kaya Keleş, Mümine
    [J]. Tehnicki Vjesnik, 2019, 26 (01): : 149 - 155
  • [7] Comparative Evaluation of Data Mining Algorithms in Breast Cancer
    Al-Yarimi, Fuad A. M.
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (01): : 633 - 645
  • [8] Review of classification algorithms for data mining
    Liu, Hongyan
    Chen, Jian
    Chen, Guoqing
    [J]. Qinghua Daxue Xuebao/Journal of Tsinghua University, 2002, 42 (06): : 727 - 730
  • [9] A Review of Classification Algorithms for Data Mining
    Li Mindong
    Chen Qingwei
    Huang Panling
    Zhou Jun
    Gong Weike
    [J]. 2019 2ND INTERNATIONAL CONFERENCE ON MECHANICAL, ELECTRONIC AND ENGINEERING TECHNOLOGY (MEET 2019), 2019, : 364 - 370
  • [10] Mining of Classification Patterns in Clinical Data through Data Mining Algorithms
    Jacob, Shomona Gracia
    Ramani, R. Geetha
    [J]. PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI'12), 2012, : 997 - 1003