Applications of Machine Learning Techniques to Predict Diagnostic Breast Cancer

被引:0
|
作者
Chaurasia V. [1 ]
Pal S. [1 ]
机构
[1] Department of Computer Applications, VBS Purvanchal University, Jaunpur
关键词
Classification; Ensemble; k-Nearest neighbors; Linear regression; Machine learning; Multilayer perceptron; Stack; Support vector machine;
D O I
10.1007/s42979-020-00296-8
中图分类号
学科分类号
摘要
This article compares six machine learning (ML) algorithms: Classification and Regression Tree (CART), Support Vector Machine (SVM), Naïve Bayes (NB), K-Nearest Neighbors (KNN), Linear Regression (LR) and Multilayer Perceptron (MLP) on the Wisconsin Diagnostic Breast Cancer (WDBC) dataset by estimating their classification test accuracy, standardized data accuracy and runtime analysis. The main objective of this study is to improve the accuracy of prediction using a new statistical method of feature selection. The data set has 32 features, which are reduced using statistical techniques (mode), and the same measurements as above are applied for comparative studies. In the reduced attribute data subset (12 features), we applied 6 integrated models AdaBoost (AB), Gradient Boosting Classifier (GBC), Random Forest (RF), Extra Tree (ET) Bagging and Extra Gradient Boost (XGB), to minimize the probability of misclassification based on any single induced model. We also apply the stacking classifier (Voting Classifier) ​​to basic learners: Logistic Regression (LR), Decision Tree (DT), Support-vector clustering (SVC), K-Nearest Neighbors (KNN), Random Forest (RF) and Naïve Bays (NB) to find out the accuracy obtained by voting classifier (Meta level). To implement the ML algorithm, the data set is divided in the following manner: 80% is used in the training phase and 20% is used in the test phase. To adjust the classifier, manually assigned hyper-parameters are used. At different stages of classification, all ML algorithms perform best, with test accuracy exceeding 90% especially when it is applied to a data subset. © 2020, Springer Nature Singapore Pte Ltd.
引用
收藏
相关论文
共 50 条
  • [1] Using Machine Learning Techniques to Predict Recurrent Breast Cancer in Taiwan
    Chen, Ying-Chen
    Chang, Chi-Chang
    [J]. 2021 30TH WIRELESS AND OPTICAL COMMUNICATIONS CONFERENCE (WOCC 2021), 2021, : 145 - 145
  • [2] A Comparative Study of Machine Learning Techniques to Predict Types of Breast Cancer Recurrence
    Chakkouch, Meryem
    Ertel, Merouane
    Mengad, Aziz
    Amali, Said
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (05) : 296 - 302
  • [3] Using Different Machine Learning Techniques to Predict Recurrent Breast Cancer at Young Age
    Wu, Yan-Kuen
    Lu, Yen-Chiao Angel
    Lu, Chi-Jie
    Sun, Chih Cheng
    [J]. Journal of Quality, 2022, 29 (03): : 196 - 207
  • [4] Machine learning (ML) techniques to predict breast cancer in imbalanced datasets: a systematic review
    Ghavidel, Arman
    Pazos, Pilar
    [J]. JOURNAL OF CANCER SURVIVORSHIP, 2023,
  • [5] Medical Diagnostic Models an Implementation of Machine Learning Techniques for Diagnosis in Breast Cancer Patients
    Borah, Rupam
    Dhimal, Sunil
    Sharma, Kalpana
    [J]. ADVANCED COMPUTATIONAL AND COMMUNICATION PARADIGMS, VOL 1, 2018, 475 : 395 - 405
  • [6] The Classification of Breast Cancer with Machine Learning Techniques
    Kolay, Nurdan
    Erdogmus, Pakize
    [J]. 2016 ELECTRIC ELECTRONICS, COMPUTER SCIENCE, BIOMEDICAL ENGINEERINGS' MEETING (EBBT), 2016,
  • [7] Machine Learning Techniques for Classification of Breast Cancer
    Osmanovic, Ahmed
    Halilovic, Sabina
    Ilah, Layla Abdel
    Fojnica, Adnan
    Gromilic, Zehra
    [J]. WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING 2018, VOL 1, 2019, 68 (01): : 197 - 200
  • [8] Machine Learning Techniques for Breast Cancer Detection
    Hall, Karl
    Chang, Victor
    Mitchell, Paul
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON COMPLEXITY, FUTURE INFORMATION SYSTEMS AND RISK (COMPLEXIS), 2022, : 116 - 122
  • [9] A Diagnostic Model of Breast Cancer Based on Digital Mammogram Images Using Machine Learning Techniques
    Al-Fahaidy, Farouk A. K.
    Al-Fuhaidi, Belal
    AL-Darouby, Ishaq
    AL-Abady, Faheem
    AL-Qadry, Mohammed
    AL-Gamal, Abdurhman
    [J]. APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2022, 2022
  • [10] Machine Learning Techniques and Breast Cancer Prediction: A Review
    Kaur, Gagandeep
    Gupta, Ruchika
    Hooda, Nistha
    Gupta, Nidhi Rani
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2022, 125 (03) : 2537 - 2564