Towards an Accurate Breast Cancer Classification Model based on Ensemble Learning

被引:0
|
作者
Hesham, Aya [1 ]
El-Rashidy, Nora [2 ]
Rezk, Amira [3 ]
Hikal, Noha A. [1 ]
机构
[1] Mansoura Univ, Fac Comp & Informat, Informat Technol Dept, Mansoura 13518, Egypt
[2] Kafrelsheiksh Univ, Machine Learning & Informat Retrieval Dept, Fac Artificial Intelligence, Kafrelsheiksh 13518, Egypt
[3] Mansoura Univ, Fac Comp & Informat, Informat Syst Dept, Mansoura 13518, Egypt
关键词
Breast cancer; feature selection; classification; machine learning; FEATURE-SELECTION; PREDICTION; ALGORITHM;
D O I
10.14569/IJACSA.2022.0131272
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Breast cancer (BC) is considered the most common cancer among women and the major reason for the increased death rate. This condition begins in breast cells and may spread to the rest of the body tissues. The early detection and prediction of BC can help in saving a patient's life. In the last decades, machine learning (ML) has played a significant role in the development of models that can be used to detect and predict various diseases at an early stage, which can greatly increase the survival rate of patients. The importance of ML Classification is attributed to its capability to learn from previous datasets, detects patterns that are difficult to comprehend in massive datasets, predicts a categorical variable within a predefined example and provide accurate results within a short amount of time. Feature selection (FS) method was used to reduce the data dimensionality and choose the optimal feature set. In this paper, we proposed a stacking ensemble model that can differentiate between malignant and benign BC cells. A total of 25 different experiments have been conducted using several classifiers, including logistic regression (LR), decision tree (DT), linear discriminant analysis (LDA), K-nearest neighbor (KNN), naive Bayes (NB), and support vector machine (SVM). In addition to several ensembles, the classifiers included random forest (RF), bagging, AdaBoost, voting, and stacking. The results indicate that our ensemble model outperformed other state-of-the-art models in terms of accuracy (98.6%), precision (89.7%), recall, and F1 score (93.33%). The result shows that the ensemble methods with FS have a high improvement of classification accuracy rather than a single method in detecting BC accurately.
引用
收藏
页码:590 / 602
页数:13
相关论文
共 50 条
  • [21] An ensemble learning based model for real estate project classification
    Paireekreng, Worapat
    Choensawat, Worawat
    [J]. 6TH INTERNATIONAL CONFERENCE ON APPLIED HUMAN FACTORS AND ERGONOMICS (AHFE 2015) AND THE AFFILIATED CONFERENCES, AHFE 2015, 2015, 3 : 3852 - 3859
  • [22] CWV-BANN-SVM ensemble learning classifier for an accurate diagnosis of breast cancer
    Abdar, Moloud
    Makarenkov, Vladimir
    [J]. MEASUREMENT, 2019, 146 : 557 - 570
  • [23] A Novel Ensemble Bagging Classification Method for Breast Cancer Classification Using Machine Learning Techniques
    Ponnaganti, Naga Deepti
    Anitha, Raju
    [J]. TRAITEMENT DU SIGNAL, 2022, 39 (01) : 229 - 237
  • [24] Multi-Modal Classification for Human Breast Cancer Prognosis Prediction: Proposal of Deep-Learning Based Stacked Ensemble Model
    Arya, Nikhilanand
    Saha, Sriparna
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (02) : 1032 - 1041
  • [25] Breast Cancer Histopathology Image Classification Using an Ensemble of Deep Learning Models
    Hameed, Zabit
    Zahia, Sofia
    Garcia-Zapirain, Begonya
    Javier Aguirre, Jose
    Maria Vanegas, Ana
    [J]. SENSORS, 2020, 20 (16) : 1 - 17
  • [26] Lung Cancer Classification using Reinforcement Learning-based Ensemble Learning
    Luo, Shengping
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (08) : 1112 - 1122
  • [27] An optimized deep belief network model for accurate breast cancer classification
    Ibrokhimov, Bunyodbek
    Hur, Cheonghwan
    Kim, Hyunseok
    Kang, Sanggil
    [J]. IEIE Transactions on Smart Processing and Computing, 2020, 9 (04): : 266 - 273
  • [28] An ensemble of deep learning architectures for accurate plant disease classification
    Ali, Ali Hussein
    Youssef, Ayman
    Abdelal, Mahmoud
    Raja, Muhammad Adil
    [J]. ECOLOGICAL INFORMATICS, 2024, 81
  • [29] Sliding window based deep ensemble system for breast cancer classification
    Alqudah, Amin
    Alqudah, Ali Mohammad
    [J]. Journal of Medical Engineering and Technology, 2021, 45 (04): : 313 - 323
  • [30] Assessing the impact of parameters tuning in ensemble based breast Cancer classification
    Ali Idri
    El Ouassif Bouchra
    Mohamed Hosni
    Ibtissam Abnane
    [J]. Health and Technology, 2020, 10 : 1239 - 1255