Enhancing fairness in breast cancer recurrence prediction through temporal machine learning models

被引:0
|
作者
Sundus, Katrina I. [1 ]
Hammo, Bassam H. [1 ,2 ]
Al-Zoubi, Mohammad B. [1 ]
机构
[1] King Abdullah II School of Information Technology, The University of Jordan, Amman, Jordan
[2] King Hussein School of Computing Sciences, Princess Sumaya University for Technology, Amman, Jordan
关键词
Contrastive Learning - Diseases - Logistic regression - Lung cancer - Oncology - Prediction models;
D O I
10.1007/s00521-024-10407-8
中图分类号
学科分类号
摘要
Breast cancer recurrence prediction is a significant challenge in oncology. Advanced methodologies are required to improve prediction accuracy and clinical decision-making. This study presents a novel approach to breast cancer recurrence prediction by integrating machine learning techniques and a hybrid data mining methodology incorporating a temporal dimension into dataset derivation. Our research is based on the Jordan Breast Cancer Dataset (JBRCA), which includes over 44,000 cases spanning 15 years collected from the King Hussein Cancer Center’s registry database in Amman, Jordan. The proposed methodology encompasses data understanding, preparation, and model development stages. We use a thorough data preparation process involving multicollinearity feature selection, feature scaling, and strategic sampling to address dataset challenges. Moreover, we introduce a temporal-derived dataset strategy, dividing the data into four distinct time intervals to capture evolving characteristics and optimize model relevance. We employ diverse base classifiers and ensemble methods to enhance predictive performance in model development. We use evaluation metrics such as accuracy, recall, specificity, G-mean, and ROC-AUC to assess model efficacy across temporal intervals. Our experimental findings reveal significant impacts on classifier performance with temporal dataset derivation, with notable strengths observed in specific classifiers and temporal intervals. For instance, the Naive Bayes model demonstrates efficacy in identifying recurrence cases, while logistic regression exhibits robust performance in ROC-AUC and G-mean metrics. Our study contributes to breast cancer recurrence prediction by introducing a novel methodology that addresses dataset challenges and leverages temporal insights for enhanced predictive accuracy. The findings have a direct impact on clinical practice, providing valuable tools for early detection and improved therapy planning.
引用
收藏
页码:22697 / 22718
页数:21
相关论文
共 50 条
  • [41] Breast Cancer Classification Through Transfer Learning with Vision Transformer, PCA, and Machine Learning Models
    Gutierrez-Cardenas, Juan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (04) : 1027 - 1036
  • [42] Breast Cancer Prediction Using Soft Voting Classifier Based on Machine Learning Models
    Hashim, Mohammed S.
    Yassin, Ali A.
    IAENG International Journal of Computer Science, 2023, 50 (02)
  • [43] Prognostic prediction of breast cancer patients using machine learning models: a retrospective analysis
    Song, Xuchun
    Chu, Jiebin
    Guo, Zijie
    Wei, Qun
    Wang, Qingchuan
    Hu, Wenxian
    Wang, Linbo
    Zhao, Wenhe
    Zheng, Heming
    Lu, Xudong
    Zhou, Jichun
    GLAND SURGERY, 2024, 13 (09) : 1575 - 1587
  • [44] Prediction System for Prostate Cancer Recurrence Using Machine Learning
    Lee, Sun Jung
    Yu, Sung Hye
    Kim, Yejin
    Kim, Jae Kwon
    Hong, Jun Hyuk
    Kim, Choung-Soo
    Seo, Seong Il
    Byun, Seok-Soo
    Jeong, Chang Wook
    Lee, Ji Youl
    Choi, In Young
    APPLIED SCIENCES-BASEL, 2020, 10 (04):
  • [45] Enhancing Coarse-Grained Models through Machine Learning
    Karmakar, Tarak
    Soares, Thereza A.
    Merz Jr, Kenneth M.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2024, 64 (08) : 2931 - 2932
  • [46] Explanation and reliability of prediction models: the case of breast cancer recurrence
    Erik Štrumbelj
    Zoran Bosnić
    Igor Kononenko
    Branko Zakotnik
    Cvetka Grašič Kuhar
    Knowledge and Information Systems, 2010, 24 : 305 - 324
  • [47] Explanation and reliability of prediction models: the case of breast cancer recurrence
    Strumbelj, Erik
    Bosnic, Zoran
    Kononenko, Igor
    Zakotnik, Branko
    Kuhar, Cvetka Grasic
    KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 24 (02) : 305 - 324
  • [48] Predicting the recurrence of breast cancer using machine learning algorithms
    Amal Alzu’bi
    Hassan Najadat
    Wesam Doulat
    Osama Al-Shari
    Leming Zhou
    Multimedia Tools and Applications, 2021, 80 : 13787 - 13800
  • [49] Enhancing metastatic colorectal cancer prediction through advanced feature selection and machine learning techniques
    Yang, Hui
    Liu, Jun
    Yang, Na
    Fu, Qingsheng
    Wang, Yingying
    Ye, Mingquan
    Tao, Shaoneng
    Liu, Xiaocen
    Li, Qingqing
    INTERNATIONAL IMMUNOPHARMACOLOGY, 2024, 142
  • [50] Predicting the recurrence of breast cancer using machine learning algorithms
    Alzu'bi, Amal
    Najadat, Hassan
    Doulat, Wesam
    Al-Shari, Osama
    Zhou, Leming
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (09) : 13787 - 13800