Cancer Metastasis Prediction and Genomic Biomarker Identification through Machine Learning and eXplainable Artificial Intelligence in Breast Cancer Research

被引:15
|
作者
Yagin, Burak [1 ]
Yagin, Fatma Hilal [1 ]
Colak, Cemil [1 ]
Inceoglu, Feyza [2 ]
Kadry, Seifedine [3 ,4 ,5 ]
Kim, Jungeun [6 ]
机构
[1] Inonu Univ, Fac Med, Dept Biostat & Med Informat, TR-44280 Malatya, Turkiye
[2] Malatya Turgut Ozal Univ, Fac Med, Dept Biostat, TR-44090 Malatya, Turkiye
[3] Noroff Univ Coll, Dept Appl Data Sci, N-4612 Kristiansand, Norway
[4] Ajman Univ, Artificial Intelligence Res Ctr AIRC, Ajman 346, U Arab Emirates
[5] Lebanese Amer Univ, Dept Elect & Comp Engn, Byblos 36, Lebanon
[6] Kongju Natl Univ, Dept Software, Cheonan 31080, South Korea
关键词
breast cancer metastasis; machine learning algorithms; genomic biomarkers; eXplainable artificial intelligence; SHAP; EXPRESSION; ASSOCIATION; PROGNOSIS;
D O I
10.3390/diagnostics13213314
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Aim: Method: This research presents a model combining machine learning (ML) techniques and eXplainable artificial intelligence (XAI) to predict breast cancer (BC) metastasis and reveal important genomic biomarkers in metastasis patients. Method: A total of 98 primary BC samples was analyzed, comprising 34 samples from patients who developed distant metastases within a 5-year follow-up period and 44 samples from patients who remained disease-free for at least 5 years after diagnosis. Genomic data were then subjected to biostatistical analysis, followed by the application of the elastic net feature selection method. This technique identified a restricted number of genomic biomarkers associated with BC metastasis. A light gradient boosting machine (LightGBM), categorical boosting (CatBoost), Extreme Gradient Boosting (XGBoost), Gradient Boosting Trees (GBT), and Ada boosting (AdaBoost) algorithms were utilized for prediction. To assess the models' predictive abilities, the accuracy, F1 score, precision, recall, area under the ROC curve (AUC), and Brier score were calculated as performance evaluation metrics. To promote interpretability and overcome the "black box" problem of ML models, a SHapley Additive exPlanations (SHAP) method was employed. Results: The LightGBM model outperformed other models, yielding remarkable accuracy of 96% and an AUC of 99.3%. In addition to biostatistical evaluation, in XAI-based SHAP results, increased expression levels of TSPYL5, ATP5E, CA9, NUP210, SLC37A1, ARIH1, PSMD7, UBQLN1, PRAME, and UBE2T (p <= 0.05) were found to be associated with an increased incidence of BC metastasis. Finally, decreased levels of expression of CACTIN, TGFB3, SCUBE2, ARL4D, OR1F1, ALDH4A1, PHF1, and CROCC (p <= 0.05) genes were also determined to increase the risk of metastasis in BC. Conclusion: The findings of this study may prevent disease progression and metastases and potentially improve clinical outcomes by recommending customized treatment approaches for BC patients.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Artificial intelligence and machine learning in cancer imaging
    Koh, Dow-Mu
    Papanikolaou, Nickolas
    Bick, Ulrich
    Illing, Rowland
    Kahn, Charles E., Jr.
    Kalpathi-Cramer, Jayshree
    Matos, Celso
    Marti-Bonmati, Luis
    Miles, Anne
    Mun, Seong Ki
    Napel, Sandy
    Rockall, Andrea
    Sala, Evis
    Strickland, Nicola
    Prior, Fred
    COMMUNICATIONS MEDICINE, 2022, 2 (01):
  • [22] Artificial intelligence and machine learning in cancer imaging
    Dow-Mu Koh
    Nickolas Papanikolaou
    Ulrich Bick
    Rowland Illing
    Charles E. Kahn
    Jayshree Kalpathi-Cramer
    Celso Matos
    Luis Martí-Bonmatí
    Anne Miles
    Seong Ki Mun
    Sandy Napel
    Andrea Rockall
    Evis Sala
    Nicola Strickland
    Fred Prior
    Communications Medicine, 2
  • [23] Clinicomics-guided distant metastasis prediction in breast cancer via artificial intelligence
    Chao Zhang
    Lisha Qi
    Jun Cai
    Haixiao Wu
    Yao Xu
    Yile Lin
    Zhijun Li
    Vladimir P. Chekhonin
    Karl Peltzer
    Manqing Cao
    Zhuming Yin
    Xin Wang
    Wenjuan Ma
    BMC Cancer, 23
  • [24] Prediction of Breast Cancer Distant Metastasis by Artificial Intelligence Methods from an Epidemiological Perspective
    Akbulut, Sami
    Yagin, Fatma Hilal
    Colak, Cemil
    ISTANBUL MEDICAL JOURNAL, 2022, 23 (03): : 210 - 215
  • [25] Clinicomics-guided distant metastasis prediction in breast cancer via artificial intelligence
    Zhang, Chao
    Qi, Lisha
    Cai, Jun
    Wu, Haixiao
    Xu, Yao
    Lin, Yile
    Li, Zhijun
    Chekhonin, Vladimir P.
    Peltzer, Karl
    Cao, Manqing
    Yin, Zhuming
    Wang, Xin
    Ma, Wenjuan
    BMC CANCER, 2023, 23 (01)
  • [26] Harnessing machine learning to predict colorectal cancer metastasis: A promising artificial intelligence frontier
    Awais, Abdul Raffay
    Manzoor, Ibrahim
    Pakistan, Mustafa Mansoor
    EJSO, 2024, 50 (11):
  • [27] Past, Present, and Future of Machine Learning and Artificial Intelligence for Breast Cancer Screening
    Baughan, Natalie
    Douglas, Lindsay
    Giger, Maryellen L.
    JOURNAL OF BREAST IMAGING, 2022, 4 (05) : 451 - 459
  • [28] Explainable machine learning approach for cancer prediction through binarilization of RNA sequencing data
    Chen, Tianjie
    Kabir, Md Faisal
    PLOS ONE, 2024, 19 (05):
  • [29] A Roadmap towards Breast Cancer Therapies Supported by Explainable Artificial Intelligence
    Amoroso, Nicola
    Pomarico, Domenico
    Fanizzi, Annarita
    Didonna, Vittorio
    Giotta, Francesco
    La Forgia, Daniele
    Latorre, Agnese
    Monaco, Alfonso
    Pantaleo, Ester
    Petruzzellis, Nicole
    Tamborra, Pasquale
    Zito, Alfredo
    Lorusso, Vito
    Bellotti, Roberto
    Massafra, Raffaella
    APPLIED SCIENCES-BASEL, 2021, 11 (11):
  • [30] Identification of biological markers in cancer disease using explainable artificial intelligence
    Shahzad, Muhammad
    Lohana, Ruhal
    Aurangzeb, Khursheed
    Ali, Isbah Imtiaz
    Anwar, Muhammad Shahid
    Murtaza, Mahnoor
    Malick, Rauf Ahmed Shams
    Allayarov, Piratdin
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (02)