Extracting Knowledge from Machine Learning Models to Diagnose Breast Cancer

被引:0
|
作者
Martinez-Ramirez, Jose Manuel [1 ]
Carmona, Cristobal [1 ,2 ,3 ]
Ramirez-Exposito, Maria Jesus [4 ]
Martinez-Martos, Jose Manuel [4 ]
机构
[1] Univ Jaen, Dept Comp Sci, E-23071 Jaen, Spain
[2] Univ Jaen, DASCI, Andalusian Res Inst Data Sci & Computat Intelligen, E-23071 Jaen, Spain
[3] DeMontfort Univ, Leicester Sch Pharm, Leicester LE1 7RH, England
[4] Univ Jaen, Dept Hlth Sci, Expt & Clin Physiopathol Res Grp CVI 1039, E-23071 Jaen, Spain
来源
LIFE-BASEL | 2025年 / 15卷 / 02期
关键词
breast cancer; serum biomarkers; explainable AI; oxytocin; early diagnosis; peptide hormones; IRAP; progesterone; REGULATING AMINOPEPTIDASE ACTIVITIES; OXYTOCIN RECEPTOR EXPRESSION; POST-MENOPAUSAL WOMEN; PROGESTERONE-RECEPTORS; MENDELIAN RANDOMIZATION; GENETIC EPIDEMIOLOGY; AT(4) RECEPTOR; TUMOR-GROWTH; IRON; RISK;
D O I
10.3390/life15020211
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
This study explored the application of explainable machine learning models to enhance breast cancer diagnosis using serum biomarkers, contrary to many studies that focus on medical images and demographic data. The primary objective was to develop models that are not only accurate but also provide insights into the factors driving predictions, addressing the need for trustworthy AI in healthcare. Several classification models were evaluated, including OneR, JRIP, the FURIA, J48, the ADTree, and the Random Forest, all of which are known for their explainability. The dataset included a variety of biomarkers, such as electrolytes, metal ions, marker proteins, enzymes, lipid profiles, peptide hormones, steroid hormones, and hormone receptors. The Random Forest model achieved the highest accuracy at 99.401%, followed closely by JRIP, the FURIA, and the ADTree at 98.802%. OneR and J48 achieved 98.204% accuracy. Notably, the models identified oxytocin as a key predictive biomarker, with most models featuring it in their rules. Other significant parameters included GnRH, beta-endorphin, vasopressin, IRAP, and APB, as well as factors like iron, cholinesterase, the total protein, progesterone, 5-nucleotidase, and the BMI, which are considered clinically relevant to breast cancer pathogenesis. This study discusses the roles of the identified parameters in cancer development, thus underscoring the potential of explainable machine learning models for enhancing early breast cancer diagnosis by focusing on explainability and the use of serum biomarkers.The combination of both can lead to improved early detection and personalized treatments, emphasizing the potential of these methods in clinical settings. The identified markers also provide additional research and therapeutic targets for breast cancer pathogenesis and a deep understanding of their interactions, advancing personalized approaches to breast cancer management.
引用
收藏
页数:29
相关论文
共 50 条
  • [41] MACHINE LEARNING TECHNIQUES TO DIAGNOSE BREAST-CANCER FROM IMAGE-PROCESSED NUCLEAR FEATURES OF FINE-NEEDLE ASPIRATES
    WOLBERG, WH
    STREET, WN
    MANGASARIAN, OL
    CANCER LETTERS, 1994, 77 (2-3) : 163 - 171
  • [42] Extracting Visual Knowledge from the Web with Multimodal Learning
    Gong, Dihong
    Wang, Daisy Zhe
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1718 - 1724
  • [43] Breast Cancer Classification Through Transfer Learning with Vision Transformer, PCA, and Machine Learning Models
    Gutierrez-Cardenas, Juan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (04) : 1027 - 1036
  • [44] Extracting Semantic Knowledge From GANs With Unsupervised Learning
    Xu, Jianjin
    Zhang, Zhaoxiang
    Hu, Xiaolin
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9654 - 9668
  • [45] A Breast Cancer Diagnose Application Using Deep Learning Technology
    Hao, Qing
    Sang, Guankai
    Zhang, Wenqing
    SECOND IYSF ACADEMIC SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND COMPUTER ENGINEERING, 2021, 12079
  • [46] Machine learning models from computed tomography to diagnose thymic epithelial tumors requiring combined resection
    Onozato, Yuki
    Suzuki, Hidemi
    Matsumoto, Hiroki
    Ito, Takamasa
    Yamamoto, Takayoshi
    Tanaka, Kazuhisa
    Sakairi, Yuichi
    Matsui, Yukiko
    Iwata, Takekazu
    Iida, Tomohiko
    Iizasa, Toshihiko
    Yoshino, Ichiro
    JOURNAL OF THORACIC DISEASE, 2024, 16 (08)
  • [47] Machine Learning Models for Predicting Breast Cancer Risk in Women Exposed to Blue Light from Digital Screens
    Mortazavi S.A.R.
    Tahmasebi S.
    Parsaei H.
    Taleie A.
    Faraz M.
    Rezaianza-Deh A.
    Zamani A.
    Zamani A.
    Mortazavi S.M.J.
    Journal of Biomedical Physics and Engineering, 2022, 12 (06): : 637 - 644
  • [48] Extracting knowledge from association relationships to build navigational models
    Albert, M
    Pelechano, V
    Fons, J
    Rojas, G
    Pastor, O
    FIRST LATIN AMERICAN WEB CONGRESS, PROCEEDINGS, 2003, : 2 - 10
  • [49] KE: A Knowledge Enhancing Framework for Machine Learning Models
    Wang, Yijue
    Shah, Nidhibahen
    Soliman, Ahmed
    Guo, Dan
    Rajasekaran, Sanguthevar
    JOURNAL OF PHYSICAL CHEMISTRY A, 2023, 127 (40): : 8437 - 8446
  • [50] Predicting chronic pain in postoperative breast cancer patients with multiple machine learning and deep learning models
    Wang, Ying
    Zhu, Yu
    Xue, Qiong
    Ji, Muhuo
    Tong, Jianhua
    Yang, Jian-Jun
    Zhou, Cheng-Mao
    JOURNAL OF CLINICAL ANESTHESIA, 2021, 74