Enhancing fairness in breast cancer recurrence prediction through temporal machine learning models

被引:0
|
作者
Sundus, Katrina I. [1 ]
Hammo, Bassam H. [1 ,2 ]
Al-Zoubi, Mohammad B. [1 ]
机构
[1] King Abdullah II School of Information Technology, The University of Jordan, Amman, Jordan
[2] King Hussein School of Computing Sciences, Princess Sumaya University for Technology, Amman, Jordan
关键词
Contrastive Learning - Diseases - Logistic regression - Lung cancer - Oncology - Prediction models;
D O I
10.1007/s00521-024-10407-8
中图分类号
学科分类号
摘要
Breast cancer recurrence prediction is a significant challenge in oncology. Advanced methodologies are required to improve prediction accuracy and clinical decision-making. This study presents a novel approach to breast cancer recurrence prediction by integrating machine learning techniques and a hybrid data mining methodology incorporating a temporal dimension into dataset derivation. Our research is based on the Jordan Breast Cancer Dataset (JBRCA), which includes over 44,000 cases spanning 15 years collected from the King Hussein Cancer Center’s registry database in Amman, Jordan. The proposed methodology encompasses data understanding, preparation, and model development stages. We use a thorough data preparation process involving multicollinearity feature selection, feature scaling, and strategic sampling to address dataset challenges. Moreover, we introduce a temporal-derived dataset strategy, dividing the data into four distinct time intervals to capture evolving characteristics and optimize model relevance. We employ diverse base classifiers and ensemble methods to enhance predictive performance in model development. We use evaluation metrics such as accuracy, recall, specificity, G-mean, and ROC-AUC to assess model efficacy across temporal intervals. Our experimental findings reveal significant impacts on classifier performance with temporal dataset derivation, with notable strengths observed in specific classifiers and temporal intervals. For instance, the Naive Bayes model demonstrates efficacy in identifying recurrence cases, while logistic regression exhibits robust performance in ROC-AUC and G-mean metrics. Our study contributes to breast cancer recurrence prediction by introducing a novel methodology that addresses dataset challenges and leverages temporal insights for enhanced predictive accuracy. The findings have a direct impact on clinical practice, providing valuable tools for early detection and improved therapy planning.
引用
收藏
页码:22697 / 22718
页数:21
相关论文
共 50 条
  • [31] Prediction of breast cancer through fast optimization techniques applied to machine learning
    Cholamjiak, Watcharaporn
    Shehu, Yekini
    Yao, Jen-Chih
    OPTIMIZATION, 2024,
  • [32] Fairness in machine learning with tractable models
    Varley, Michael
    Belle, Vaishak
    KNOWLEDGE-BASED SYSTEMS, 2021, 215
  • [33] Advanced machine learning framework for enhancing breast cancer diagnostics through transcriptomic profiling
    Saadh, Mohamed J.
    Ahmed, Hanan Hassan
    Kareem, Radhwan Abdul
    Yadav, Anupam
    Ganesan, Subbulakshmi
    Shankhyan, Aman
    Sharma, Girish Chandra
    Naidu, K. Satyam
    Rakhmatullaev, Akmal
    Sameer, Hayder Naji
    Yaseen, Ahmed
    Athab, Zainab H.
    Adil, Mohaned
    Farhood, Bagher
    DISCOVER ONCOLOGY, 2025, 16 (01)
  • [34] Ensemble learning method for the prediction of breast cancer recurrence
    Almuhaidib, Daad Abdullah
    Shaiba, Hadil Ahmed
    Alharbi, Najla Ghazi
    Alotaibi, Sara Muhammad
    Albusayyis, Fatima Moteb
    Alzaid, Mashael Abdulalim
    Almadhi, Reem Mohammed
    2018 1ST INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS & INFORMATION SECURITY (ICCAIS' 2018), 2018,
  • [35] Machine Learning Algorithms for Breast Cancer Prediction
    Kumar, K. M. E. Senthil
    Akalya, A.
    Kanimozhi, V.
    JOURNAL OF POPULATION THERAPEUTICS AND CLINICAL PHARMACOLOGY, 2023, 30 (07): : E245 - E250
  • [36] MRI Radiomics and Machine Learning for the Prediction of Oncotype Dx Recurrence Score in Invasive Breast Cancer
    Romeo, Valeria
    Cuocolo, Renato
    Sanduzzi, Luca
    Carpentiero, Vincenzo
    Caruso, Martina
    Lama, Beatrice
    Garifalos, Dimitri
    Stanzione, Arnaldo
    Maurea, Simone
    Brunetti, Arturo
    CANCERS, 2023, 15 (06)
  • [37] Enhancing Machine Learning based QoE Prediction by Ensemble Models
    Casas, Pedro
    Seufert, Michael
    Wehner, Nikolas
    Schwind, Anika
    Wamser, Florian
    2018 IEEE 38TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2018, : 1642 - 1647
  • [38] Development of machine learning models for post-operative recurrence prediction in lung cancer patients
    Ranganathan, Dhakshinamoorthy Dhayanitha
    Kumar, Thirunavukkarasu Muthu
    Shanthi, Veerappapillai
    Ramanathan, Karuppasamy
    RESEARCH JOURNAL OF BIOTECHNOLOGY, 2023, 18 (10): : 227 - 234
  • [39] Machine Learning in Prediction of Second Primary Cancer and Recurrence in Colorectal Cancer
    Ting, Wen-Chien
    Lu, Yen-Chiao Angel
    Ho, Wei-Chi
    Cheewakriangkrai, Chalong
    Chang, Horng-Rong
    Lin, Chia-Ling
    INTERNATIONAL JOURNAL OF MEDICAL SCIENCES, 2020, 17 (03): : 280 - 291
  • [40] Advancing Breast Cancer Diagnosis through Breast Mass Images, Machine Learning, and Regression Models
    Zaylaa, Amira J.
    Kourtian, Sylva
    SENSORS, 2024, 24 (07)