Improved Machine Learning-Based Predictive Models for Breast Cancer Diagnosis

被引:26
|
作者
Rasool, Abdur [1 ,2 ]
Bunterngchit, Chayut [1 ,3 ]
Tiejian, Luo [1 ]
Islam, Md Ruhul [4 ]
Qu, Qiang [2 ]
Jiang, Qingshan [2 ]
机构
[1] Univ Chinese Acad Sci, Beijing 101408, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen Key Lab High Performance Data Min, Shenzhen 518055, Peoples R China
[3] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[4] Univ Stavanger, Dept Elect Engn & Comp Sci, N-4044 Stavanger, Norway
关键词
machine learning models; data exploratory techniques; breast cancer diagnosis; tumors classification; SUPPORT VECTOR MACHINE; FEATURE-SELECTION; FEATURE-EXTRACTION; CLASSIFICATION; SVM; BLOCKCHAIN; ENSEMBLE;
D O I
10.3390/ijerph19063211
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Breast cancer death rates are higher than any other cancer in American women. Machine learning-based predictive models promise earlier detection techniques for breast cancer diagnosis. However, making an evaluation for models that efficiently diagnose cancer is still challenging. In this work, we proposed data exploratory techniques (DET) and developed four different predictive models to improve breast cancer diagnostic accuracy. Prior to models, four-layered essential DET, e.g., feature distribution, correlation, elimination, and hyperparameter optimization, were deep-dived to identify the robust feature classification into malignant and benign classes. These proposed techniques and classifiers were implemented on the Wisconsin Diagnostic Breast Cancer (WDBC) and Breast Cancer Coimbra Dataset (BCCD) datasets. Standard performance metrics, including confusion matrices and K-fold cross-validation techniques, were applied to assess each classifier's efficiency and training time. The models' diagnostic capability improved with our DET, i.e., polynomial SVM gained 99.3%, LR with 98.06%, KNN acquired 97.35%, and EC achieved 97.61% accuracy with the WDBC dataset. We also compared our significant results with previous studies in terms of accuracy. The implementation procedure and findings can guide physicians to adopt an effective model for a practical understanding and prognosis of breast cancer tumors.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Machine learning-based models for the prediction of breast cancer recurrence risk
    Zuo, Duo
    Yang, Lexin
    Jin, Yu
    Qi, Huan
    Liu, Yahui
    Ren, Li
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)
  • [2] Machine learning-based models for the prediction of breast cancer recurrence risk
    Duo Zuo
    Lexin Yang
    Yu Jin
    Huan Qi
    Yahui Liu
    Li Ren
    [J]. BMC Medical Informatics and Decision Making, 23
  • [3] An Assessment of the Predictive Performance of Current Machine Learning-Based Breast Cancer Risk Prediction Models: Systematic Review
    Gao, Ying
    Li, Shu
    Jin, Yujing
    Zhou, Lengxiao
    Sun, Shaomei
    Xu, Xiaoqian
    Li, Shuqian
    Yang, Hongxi
    Zhang, Qing
    Wang, Yaogang
    [J]. JMIR PUBLIC HEALTH AND SURVEILLANCE, 2022, 8 (12):
  • [4] Machine Learning-Based Classification Models for Diagnosis of Diabetes
    Jaiswal, Sushma
    Jaiswal, Tarun
    [J]. Recent Advances in Computer Science and Communications, 2022, 15 (06) : 813 - 821
  • [5] Machine Learning-Based Predictive Models for Detection of Cardiovascular Diseases
    Ogunpola, Adedayo
    Saeed, Faisal
    Basurra, Shadi
    Albarrak, Abdullah M.
    Qasem, Sultan Noman
    [J]. DIAGNOSTICS, 2024, 14 (02)
  • [6] Machine learning-based radiomics models for prediction of locoregional recurrence in patients with breast cancer
    Lee, Joongyo
    Yoo, Sang Kyun
    Kim, Kangpyo
    Lee, Byung Min
    Park, Vivian Youngjean
    Kim, Jin Sung
    Kim, Yong Bae
    [J]. ONCOLOGY LETTERS, 2023, 26 (04)
  • [7] MACHINE LEARNING-BASED PREDICTIVE MODELS OF BEHAVIORAL AND PSYCHOLOGICAL SYMPTOMS OF DEMENTIA
    Cho, Eunhee
    Kim, Sujin
    Heo, Seok-Jae
    Shin, Jinhee
    Ye, Byoung Seok
    Lee, Jun Hong
    Kang, Bada
    [J]. INNOVATION IN AGING, 2021, 5 : 645 - 645
  • [8] Machine Learning-Based Predictive Model for Mortality in Female Breast Cancer Patients Considering Lifestyle Factors
    Zhen, Meixin
    Chen, Haibing
    Lu, Qing
    Li, Hui
    Yan, Huang
    Wang, Ling
    [J]. CANCER MANAGEMENT AND RESEARCH, 2024, 16 : 1253 - 1265
  • [9] Machine Learning-Based Models Enhance the Prediction of Prostate Cancer
    Chen, Sunmeng
    Jian, Tengteng
    Chi, Changliang
    Liang, Yi
    Liang, Xiao
    Yu, Ying
    Jiang, Fengming
    Lu, Ji
    [J]. FRONTIERS IN ONCOLOGY, 2022, 12
  • [10] Diagnosis methods of breast cancer based on machine learning
    Liu, Jinwan
    Guo, Shuzhen
    Fei, Teng
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 125 : 3 - 3