Enhancing fairness in breast cancer recurrence prediction through temporal machine learning models

被引:0
|
作者
Sundus, Katrina I. [1 ]
Hammo, Bassam H. [1 ,2 ]
Al-Zoubi, Mohammad B. [1 ]
机构
[1] King Abdullah II School of Information Technology, The University of Jordan, Amman, Jordan
[2] King Hussein School of Computing Sciences, Princess Sumaya University for Technology, Amman, Jordan
关键词
Contrastive Learning - Diseases - Logistic regression - Lung cancer - Oncology - Prediction models;
D O I
10.1007/s00521-024-10407-8
中图分类号
学科分类号
摘要
Breast cancer recurrence prediction is a significant challenge in oncology. Advanced methodologies are required to improve prediction accuracy and clinical decision-making. This study presents a novel approach to breast cancer recurrence prediction by integrating machine learning techniques and a hybrid data mining methodology incorporating a temporal dimension into dataset derivation. Our research is based on the Jordan Breast Cancer Dataset (JBRCA), which includes over 44,000 cases spanning 15 years collected from the King Hussein Cancer Center’s registry database in Amman, Jordan. The proposed methodology encompasses data understanding, preparation, and model development stages. We use a thorough data preparation process involving multicollinearity feature selection, feature scaling, and strategic sampling to address dataset challenges. Moreover, we introduce a temporal-derived dataset strategy, dividing the data into four distinct time intervals to capture evolving characteristics and optimize model relevance. We employ diverse base classifiers and ensemble methods to enhance predictive performance in model development. We use evaluation metrics such as accuracy, recall, specificity, G-mean, and ROC-AUC to assess model efficacy across temporal intervals. Our experimental findings reveal significant impacts on classifier performance with temporal dataset derivation, with notable strengths observed in specific classifiers and temporal intervals. For instance, the Naive Bayes model demonstrates efficacy in identifying recurrence cases, while logistic regression exhibits robust performance in ROC-AUC and G-mean metrics. Our study contributes to breast cancer recurrence prediction by introducing a novel methodology that addresses dataset challenges and leverages temporal insights for enhanced predictive accuracy. The findings have a direct impact on clinical practice, providing valuable tools for early detection and improved therapy planning.
引用
收藏
页码:22697 / 22718
页数:21
相关论文
共 50 条
  • [21] Exploring the Best Machine Learning Models for Breast Cancer Prediction in Wisconsin
    Al Mamun, Abdullah
    Bhuiyan, Touhid
    Hassan, Md Maruf
    Anik, Shahedul Islam
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (01) : 1362 - 1368
  • [22] Temporal Machine Learning Analysis of Prior Mammograms for Breast Cancer Risk Prediction
    Li, Hui
    Robinson, Kayla
    Lan, Li
    Baughan, Natalie
    Chan, Chun-Wai
    Embury, Matthew
    Whitman, Gary J.
    El-Zein, Randa
    Bedrosian, Isabelle
    Giger, Maryellen L.
    CANCERS, 2023, 15 (07)
  • [23] Enhancing patient outcomes through machine learning: A study of lung cancer prediction
    Bajaj, Madhvan
    Rawat, Priyanshu
    Vats, Satvik
    Sharma, Vikrant
    Mehta, Shreshtha
    Sagar, B. B.
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2023, 44 (06): : 1075 - 1086
  • [24] Leveraging survival analysis and machine learning for accurate prediction of breast cancer recurrence and metastasis
    Noman, Shahd M.
    Fadel, Youssef M.
    Henedak, Mayar T.
    Attia, Nada A.
    Essam, Malak
    Elmaasarawii, Sarah
    Fouad, Fayrouz A.
    Eltasawi, Esraa G.
    Al-Atabany, Walid
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [25] Fairness in the prediction of acute postoperative pain using machine learning models
    Davoudi, Anis
    Sajdeya, Ruba
    Ison, Ron
    Hagen, Jennifer
    Rashidi, Parisa
    Price, Catherine C.
    Tighe, Patrick J.
    FRONTIERS IN DIGITAL HEALTH, 2023, 4
  • [26] Enhancing Multilevel Models Through Supervised Machine Learning
    Kilian, Pascal
    Kelava, Augustin
    QUANTITATIVE PSYCHOLOGY, IMPS 2023, 2024, 452 : 145 - 154
  • [27] Enhancing User Fairness in OFDMA Radio Access Networks Through Machine Learning
    Comsa, Ioan-Sorin
    Zhang, Sijing
    Aydin, Mehmet
    Kuonen, Pierre
    Trestian, Ramona
    Ghinea, Gheorghita
    2019 WIRELESS DAYS (WD), 2019,
  • [28] Osteoporosis, fracture and survival: Application of machine learning in breast cancer prediction models
    Ji, Lichen
    Zhang, Wei
    Zhong, Xugang
    Zhao, Tingxiao
    Sun, Xixi
    Zhu, Senbo
    Tong, Yu
    Luo, Junchao
    Xu, Youjia
    Yang, Di
    Kang, Yao
    Wang, Jin
    Bi, Qing
    FRONTIERS IN ONCOLOGY, 2022, 12
  • [29] Enhancing skin toxicity predictions in breast cancer radiotherapy through integrated CT radiomics, dosiomics, and machine learning models
    Ren, Weiqiang
    Liu, Xiaoming
    JOURNAL OF RADIATION RESEARCH AND APPLIED SCIENCES, 2025, 18 (02)
  • [30] Machine Learning Models for the Prediction of Kidney Stone Composition and Recurrence
    Bargagli, Matteo
    Peischl, Stephan
    Vogt, Bruno
    Bruggmann, Remy
    Fuster, Daniel G.
    SWISS MEDICAL WEEKLY, 2023, 153 : 16S - 16S