Cancer Metastasis Prediction and Genomic Biomarker Identification through Machine Learning and eXplainable Artificial Intelligence in Breast Cancer Research

被引:15
|
作者
Yagin, Burak [1 ]
Yagin, Fatma Hilal [1 ]
Colak, Cemil [1 ]
Inceoglu, Feyza [2 ]
Kadry, Seifedine [3 ,4 ,5 ]
Kim, Jungeun [6 ]
机构
[1] Inonu Univ, Fac Med, Dept Biostat & Med Informat, TR-44280 Malatya, Turkiye
[2] Malatya Turgut Ozal Univ, Fac Med, Dept Biostat, TR-44090 Malatya, Turkiye
[3] Noroff Univ Coll, Dept Appl Data Sci, N-4612 Kristiansand, Norway
[4] Ajman Univ, Artificial Intelligence Res Ctr AIRC, Ajman 346, U Arab Emirates
[5] Lebanese Amer Univ, Dept Elect & Comp Engn, Byblos 36, Lebanon
[6] Kongju Natl Univ, Dept Software, Cheonan 31080, South Korea
关键词
breast cancer metastasis; machine learning algorithms; genomic biomarkers; eXplainable artificial intelligence; SHAP; EXPRESSION; ASSOCIATION; PROGNOSIS;
D O I
10.3390/diagnostics13213314
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Aim: Method: This research presents a model combining machine learning (ML) techniques and eXplainable artificial intelligence (XAI) to predict breast cancer (BC) metastasis and reveal important genomic biomarkers in metastasis patients. Method: A total of 98 primary BC samples was analyzed, comprising 34 samples from patients who developed distant metastases within a 5-year follow-up period and 44 samples from patients who remained disease-free for at least 5 years after diagnosis. Genomic data were then subjected to biostatistical analysis, followed by the application of the elastic net feature selection method. This technique identified a restricted number of genomic biomarkers associated with BC metastasis. A light gradient boosting machine (LightGBM), categorical boosting (CatBoost), Extreme Gradient Boosting (XGBoost), Gradient Boosting Trees (GBT), and Ada boosting (AdaBoost) algorithms were utilized for prediction. To assess the models' predictive abilities, the accuracy, F1 score, precision, recall, area under the ROC curve (AUC), and Brier score were calculated as performance evaluation metrics. To promote interpretability and overcome the "black box" problem of ML models, a SHapley Additive exPlanations (SHAP) method was employed. Results: The LightGBM model outperformed other models, yielding remarkable accuracy of 96% and an AUC of 99.3%. In addition to biostatistical evaluation, in XAI-based SHAP results, increased expression levels of TSPYL5, ATP5E, CA9, NUP210, SLC37A1, ARIH1, PSMD7, UBQLN1, PRAME, and UBE2T (p <= 0.05) were found to be associated with an increased incidence of BC metastasis. Finally, decreased levels of expression of CACTIN, TGFB3, SCUBE2, ARL4D, OR1F1, ALDH4A1, PHF1, and CROCC (p <= 0.05) genes were also determined to increase the risk of metastasis in BC. Conclusion: The findings of this study may prevent disease progression and metastases and potentially improve clinical outcomes by recommending customized treatment approaches for BC patients.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Modeling and Predictive Analytics of Breast Cancer Using Ensemble Learning Techniques: An Explainable Artificial Intelligence Approach
    Raha, Avi Deb
    Dihan, Fatema Jannat
    Gain, Mrityunjoy
    Murad, Saydul Akbar
    Adhikary, Apurba
    Hossain, Md. Bipul
    Hassan, Md. Mehedi
    Al-Shehari, Taher
    Alsadhan, Nasser A.
    Kadrie, Mohammed
    Bairagi, Anupam Kumar
    Computers, Materials and Continua, 2024, 81 (03): : 4033 - 4048
  • [42] Deep learning based computer aided diagnosis (CAD) tool supported by explainable artificial intelligence for breast cancer explorationDeep learning based computer aided diagnosis (CAD) tool supported by explainable artificial intelligence for breast cancer explorationMarwa Naas
    Marwa Naas
    Hiba Mzoughi
    Ines Njeh
    Mohamed Ben Slima
    Applied Intelligence, 2025, 55 (7)
  • [43] The artificial intelligence and machine learning in lung cancer immunotherapy
    Gao, Qing
    Yang, Luyu
    Lu, Mingjun
    Jin, Renjing
    Ye, Huan
    Ma, Teng
    JOURNAL OF HEMATOLOGY & ONCOLOGY, 2023, 16 (01)
  • [44] Artificial Intelligence and Machine Learning in Lung Cancer Screening
    Adams, Scott J.
    Mikhael, Peter
    Wohlwend, Jeremy
    Barzilay, Regina
    Sequist, Lecia, V
    Fintelmann, Florian J.
    THORACIC SURGERY CLINICS, 2023, 33 (04) : 401 - 409
  • [45] The artificial intelligence and machine learning in lung cancer immunotherapy
    Qing Gao
    Luyu Yang
    Mingjun Lu
    Renjing Jin
    Huan Ye
    Teng Ma
    Journal of Hematology & Oncology, 16
  • [46] Artificial intelligence, machine learning, and drug repurposing in cancer
    Tanoli, Ziaurrehman
    Vaha-Koskela, Markus
    Aittokallio, Tero
    EXPERT OPINION ON DRUG DISCOVERY, 2021, 16 (09) : 977 - 989
  • [47] Artificial intelligence and machine learning in cancer diagnosis and treatment
    Luethy, Isabel A.
    MEDICINA-BUENOS AIRES, 2022, 82 (05) : 798 - 800
  • [48] Reply to "Harnessing machine learning to predict colorectal cancer metastasis: A promising artificial intelligence frontier"
    Guo, Zhentian
    Zhang, Zongming
    EJSO, 2024, 50 (11):
  • [49] Efficient breast cancer detection using neural networks and explainable artificial intelligence
    Tamilarasi Kathirvel Murugan
    Pritikaa Karthikeyan
    Pavithra Sekar
    Neural Computing and Applications, 2025, 37 (5) : 3759 - 3776
  • [50] Explainable Artificial Intelligence in Quantifying Breast Cancer Factors: Saudi Arabia Context
    Alelyani, Turki
    Alshammari, Maha M.
    Almuhanna, Afnan
    Asan, Onur
    HEALTHCARE, 2024, 12 (10)