Extracting Knowledge from Machine Learning Models to Diagnose Breast Cancer

被引:0
|
作者
Martinez-Ramirez, Jose Manuel [1 ]
Carmona, Cristobal [1 ,2 ,3 ]
Ramirez-Exposito, Maria Jesus [4 ]
Martinez-Martos, Jose Manuel [4 ]
机构
[1] Univ Jaen, Dept Comp Sci, E-23071 Jaen, Spain
[2] Univ Jaen, DASCI, Andalusian Res Inst Data Sci & Computat Intelligen, E-23071 Jaen, Spain
[3] DeMontfort Univ, Leicester Sch Pharm, Leicester LE1 7RH, England
[4] Univ Jaen, Dept Hlth Sci, Expt & Clin Physiopathol Res Grp CVI 1039, E-23071 Jaen, Spain
来源
LIFE-BASEL | 2025年 / 15卷 / 02期
关键词
breast cancer; serum biomarkers; explainable AI; oxytocin; early diagnosis; peptide hormones; IRAP; progesterone; REGULATING AMINOPEPTIDASE ACTIVITIES; OXYTOCIN RECEPTOR EXPRESSION; POST-MENOPAUSAL WOMEN; PROGESTERONE-RECEPTORS; MENDELIAN RANDOMIZATION; GENETIC EPIDEMIOLOGY; AT(4) RECEPTOR; TUMOR-GROWTH; IRON; RISK;
D O I
10.3390/life15020211
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
This study explored the application of explainable machine learning models to enhance breast cancer diagnosis using serum biomarkers, contrary to many studies that focus on medical images and demographic data. The primary objective was to develop models that are not only accurate but also provide insights into the factors driving predictions, addressing the need for trustworthy AI in healthcare. Several classification models were evaluated, including OneR, JRIP, the FURIA, J48, the ADTree, and the Random Forest, all of which are known for their explainability. The dataset included a variety of biomarkers, such as electrolytes, metal ions, marker proteins, enzymes, lipid profiles, peptide hormones, steroid hormones, and hormone receptors. The Random Forest model achieved the highest accuracy at 99.401%, followed closely by JRIP, the FURIA, and the ADTree at 98.802%. OneR and J48 achieved 98.204% accuracy. Notably, the models identified oxytocin as a key predictive biomarker, with most models featuring it in their rules. Other significant parameters included GnRH, beta-endorphin, vasopressin, IRAP, and APB, as well as factors like iron, cholinesterase, the total protein, progesterone, 5-nucleotidase, and the BMI, which are considered clinically relevant to breast cancer pathogenesis. This study discusses the roles of the identified parameters in cancer development, thus underscoring the potential of explainable machine learning models for enhancing early breast cancer diagnosis by focusing on explainability and the use of serum biomarkers.The combination of both can lead to improved early detection and personalized treatments, emphasizing the potential of these methods in clinical settings. The identified markers also provide additional research and therapeutic targets for breast cancer pathogenesis and a deep understanding of their interactions, advancing personalized approaches to breast cancer management.
引用
收藏
页数:29
相关论文
共 50 条
  • [31] Comparison of Fuzzy and Neural Network Models to Diagnose Breast Cancer
    Hameed, W. Abdul
    Bagavandas, M.
    CONTROL, COMPUTATION AND INFORMATION SYSTEMS, 2011, 140 : 241 - +
  • [32] Predicting gene signature in breast cancer patients with multiple machine learning models
    Zhu, Fangfang
    Xu, Dafang
    DISCOVER ONCOLOGY, 2024, 15 (01)
  • [33] Machine learning-based models for the prediction of breast cancer recurrence risk
    Duo Zuo
    Lexin Yang
    Yu Jin
    Huan Qi
    Yahui Liu
    Li Ren
    BMC Medical Informatics and Decision Making, 23
  • [34] Novel models by machine learning to predict prognosis of breast cancer brain metastases
    Chaofan Li
    Mengjie Liu
    Yinbin Zhang
    Yusheng Wang
    Jia Li
    Shiyu Sun
    Xuanyu Liu
    Huizi Wu
    Cong Feng
    Peizhuo Yao
    Yiwei Jia
    Yu Zhang
    Xinyu Wei
    Fei Wu
    Chong Du
    Xixi Zhao
    Shuqun Zhang
    Jingkun Qu
    Journal of Translational Medicine, 21
  • [35] Machine learning-based models for the prediction of breast cancer recurrence risk
    Zuo, Duo
    Yang, Lexin
    Jin, Yu
    Qi, Huan
    Liu, Yahui
    Ren, Li
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)
  • [36] Novel models by machine learning to predict prognosis of breast cancer brain metastases
    Li, Chaofan
    Liu, Mengjie
    Zhang, Yinbin
    Wang, Yusheng
    Li, Jia
    Sun, Shiyu
    Liu, Xuanyu
    Wu, Huizi
    Feng, Cong
    Yao, Peizhuo
    Jia, Yiwei
    Zhang, Yu
    Wei, Xinyu
    Wu, Fei
    Du, Chong
    Zhao, Xixi
    Zhang, Shuqun
    Qu, Jingkun
    JOURNAL OF TRANSLATIONAL MEDICINE, 2023, 21 (01)
  • [37] Novel models based on machine learning to predict the prognosis of metaplastic breast cancer
    Zhang, Yinghui
    An, Wenxin
    Wang, Cong
    Liu, Xiaolei
    Zhang, Qihong
    Zhang, Yue
    Cheng, Shaoqiang
    BREAST, 2025, 79
  • [38] Osteoporosis, fracture and survival: Application of machine learning in breast cancer prediction models
    Ji, Lichen
    Zhang, Wei
    Zhong, Xugang
    Zhao, Tingxiao
    Sun, Xixi
    Zhu, Senbo
    Tong, Yu
    Luo, Junchao
    Xu, Youjia
    Yang, Di
    Kang, Yao
    Wang, Jin
    Bi, Qing
    FRONTIERS IN ONCOLOGY, 2022, 12
  • [39] The Impact of Feature Selection on Different Machine Learning Models for Breast Cancer Classification
    Algherairy, Atheer
    Almattar, Wadha
    Bakri, Eman
    Albelali, Salma
    2022 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MACHINE LEARNING APPLICATIONS (CDMA 2022), 2022, : 91 - 96
  • [40] Improved Machine Learning-Based Predictive Models for Breast Cancer Diagnosis
    Rasool, Abdur
    Bunterngchit, Chayut
    Tiejian, Luo
    Islam, Md Ruhul
    Qu, Qiang
    Jiang, Qingshan
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (06)