Machine learning models in breast cancer survival prediction

被引:137
|
作者
Montazeri, Mitra [1 ,2 ]
Montazeri, Mohadeseh [3 ,4 ]
Montazeri, Mahdieh [5 ]
Beigzadeh, Amin [6 ]
机构
[1] Kerman Univ Med Sci, Inst Futures Studies Hlth, Med Informat Res Ctr, Kerman, Iran
[2] Shahid Bahonar Univ, Comp Engn Dept, Kerman, Iran
[3] Kerman Univ Med Sci, Inst Futures Studies Hlth, Social Determinants Hlth Res Ctr, Kerman, Iran
[4] Tech & Vocat Univ, Dept Comp, Kerman, Iran
[5] Kerman Univ Med Sci, Inst Futures Studies Hlth, Res Ctr Modeling Hlth, Kerman, Iran
[6] Kerman Univ Med Sci, Inst Futures Studies Hlth, Hlth Serv Management Res Ctr, Kerman, Iran
关键词
Breast cancer survival prediction; classification; machine learning models; SUPPORT VECTOR MACHINES; NEURAL-NETWORKS; HYBRID;
D O I
10.3233/THC-151071
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
BACKGROUND: Breast cancer is one of the most common cancers with a high mortality rate among women. With the early diagnosis of breast cancer survival will increase from 56% to more than 86%. Therefore, an accurate and reliable system is necessary for the early diagnosis of this cancer. The proposed model is the combination of rules and different machine learning techniques. Machine learning models can help physicians to reduce the number of false decisions. They try to exploit patterns and relationships among a large number of cases and predict the outcome of a disease using historical cases stored in datasets. OBJECTIVE: The objective of this study is to propose a rule-based classification method with machine learning techniques for the prediction of different types of Breast cancer survival. METHODS: We use a dataset with eight attributes that include the records of 900 patients in which 876 patients (97.3%) and 24 (2.7%) patients were females and males respectively. Naive Bayes (NB), Trees Random Forest (TRF), 1-Nearest Neighbor (1NN), AdaBoost (AD), Support Vector Machine (SVM), RBF Network (RBFN), and Multilayer Perceptron (MLP) machine learning techniques with 10-cross fold technique were used with the proposed model for the prediction of breast cancer survival. The performance of machine learning techniques were evaluated with accuracy, precision, sensitivity, specificity, and area under ROC curve. RESULTS: Out of 900 patients, 803 patients and 97 patients were alive and dead, respectively. In this study, Trees Random Forest (TRF) technique showed better results in comparison to other techniques (NB, 1NN, AD, SVM and RBFN, MLP). The accuracy, sensitivity and the area under ROC curve of TRF are 96%, 96%, 93%, respectively. However, 1NN machine learning technique provided poor performance (accuracy 91%, sensitivity 91% and area under ROC curve 78%). CONCLUSIONS: This study demonstrates that Trees Random Forest model (TRF) which is a rule-based classification model was the best model with the highest level of accuracy. Therefore, this model is recommended as a useful tool for breast cancer survival prediction as well as medical decision making.
引用
收藏
页码:31 / 42
页数:12
相关论文
共 50 条
  • [1] Osteoporosis, fracture and survival: Application of machine learning in breast cancer prediction models
    Ji, Lichen
    Zhang, Wei
    Zhong, Xugang
    Zhao, Tingxiao
    Sun, Xixi
    Zhu, Senbo
    Tong, Yu
    Luo, Junchao
    Xu, Youjia
    Yang, Di
    Kang, Yao
    Wang, Jin
    Bi, Qing
    [J]. FRONTIERS IN ONCOLOGY, 2022, 12
  • [2] Breast Cancer Prediction using Machine Learning Models
    Iparraguirre-Villanueva, Orlando
    Epifania-Huerta, Andres
    Torres-Ceclen, Carmen
    Ruiz-Alvarado, John
    Cabanillas-Carbonell, Michael
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (02) : 610 - 620
  • [3] Machine Learning Techniques for Survival Time Prediction in Breast Cancer
    Mihaylov, Iliyan
    Nisheva, Maria
    Vassilev, Dimitar
    [J]. ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, AIMSA 2018, 2018, 11089 : 186 - 194
  • [4] A comparison of machine learning techniques for survival prediction in breast cancer
    Leonardo Vanneschi
    Antonella Farinaccio
    Giancarlo Mauri
    Marco Antoniotti
    Paolo Provero
    Mario Giacobini
    [J]. BioData Mining, 4
  • [5] A comparison of machine learning techniques for survival prediction in breast cancer
    Vanneschi, Leonardo
    Farinaccio, Antonella
    Mauri, Giancarlo
    Antoniotti, Mauro
    Provero, Paolo
    Giacobini, Mario
    [J]. BIODATA MINING, 2011, 4
  • [6] Prediction Model of Breast Cancer Survival Months: A Machine Learning Approach
    Naser, Mohammad Y. M.
    Chambers, Destini
    Bhattacharya, Sylvia
    [J]. SOUTHEASTCON 2023, 2023, : 851 - 855
  • [7] Application of Machine Learning Models for Survival Prognosis in Breast Cancer Studies
    Mihaylov, Iliyan
    Nisheva, Maria
    Vassilev, Dimitar
    [J]. INFORMATION, 2019, 10 (03)
  • [8] Survival analysis of breast cancer patients using machine learning models
    Evangeline, I. Keren
    Kirubha, S. P. Angeline
    Precious, J. Glory
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (20) : 30909 - 30928
  • [9] Survival analysis of breast cancer patients using machine learning models
    Keren Evangeline I.
    S. P. Angeline Kirubha
    J. Glory Precious
    [J]. Multimedia Tools and Applications, 2023, 82 : 30909 - 30928
  • [10] Analysis of breast cancer prediction and visualisation using machine learning models
    Magesh, G.
    Swarnalatha, P.
    [J]. International Journal of Cloud Computing, 2022, 11 (01) : 43 - 60