Extracting Knowledge from Machine Learning Models to Diagnose Breast Cancer

被引:0
|
作者
Martinez-Ramirez, Jose Manuel [1 ]
Carmona, Cristobal [1 ,2 ,3 ]
Ramirez-Exposito, Maria Jesus [4 ]
Martinez-Martos, Jose Manuel [4 ]
机构
[1] Univ Jaen, Dept Comp Sci, E-23071 Jaen, Spain
[2] Univ Jaen, DASCI, Andalusian Res Inst Data Sci & Computat Intelligen, E-23071 Jaen, Spain
[3] DeMontfort Univ, Leicester Sch Pharm, Leicester LE1 7RH, England
[4] Univ Jaen, Dept Hlth Sci, Expt & Clin Physiopathol Res Grp CVI 1039, E-23071 Jaen, Spain
来源
LIFE-BASEL | 2025年 / 15卷 / 02期
关键词
breast cancer; serum biomarkers; explainable AI; oxytocin; early diagnosis; peptide hormones; IRAP; progesterone; REGULATING AMINOPEPTIDASE ACTIVITIES; OXYTOCIN RECEPTOR EXPRESSION; POST-MENOPAUSAL WOMEN; PROGESTERONE-RECEPTORS; MENDELIAN RANDOMIZATION; GENETIC EPIDEMIOLOGY; AT(4) RECEPTOR; TUMOR-GROWTH; IRON; RISK;
D O I
10.3390/life15020211
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
This study explored the application of explainable machine learning models to enhance breast cancer diagnosis using serum biomarkers, contrary to many studies that focus on medical images and demographic data. The primary objective was to develop models that are not only accurate but also provide insights into the factors driving predictions, addressing the need for trustworthy AI in healthcare. Several classification models were evaluated, including OneR, JRIP, the FURIA, J48, the ADTree, and the Random Forest, all of which are known for their explainability. The dataset included a variety of biomarkers, such as electrolytes, metal ions, marker proteins, enzymes, lipid profiles, peptide hormones, steroid hormones, and hormone receptors. The Random Forest model achieved the highest accuracy at 99.401%, followed closely by JRIP, the FURIA, and the ADTree at 98.802%. OneR and J48 achieved 98.204% accuracy. Notably, the models identified oxytocin as a key predictive biomarker, with most models featuring it in their rules. Other significant parameters included GnRH, beta-endorphin, vasopressin, IRAP, and APB, as well as factors like iron, cholinesterase, the total protein, progesterone, 5-nucleotidase, and the BMI, which are considered clinically relevant to breast cancer pathogenesis. This study discusses the roles of the identified parameters in cancer development, thus underscoring the potential of explainable machine learning models for enhancing early breast cancer diagnosis by focusing on explainability and the use of serum biomarkers.The combination of both can lead to improved early detection and personalized treatments, emphasizing the potential of these methods in clinical settings. The identified markers also provide additional research and therapeutic targets for breast cancer pathogenesis and a deep understanding of their interactions, advancing personalized approaches to breast cancer management.
引用
收藏
页数:29
相关论文
共 50 条
  • [21] An Efficient Breast Cancer Detection Using Machine Learning Classification Models
    Kumar, B. N. Ravi
    Gowda, Naveen Chandra
    Ambika, B. J.
    Veena, H. N.
    Ben Sujitha, B.
    Ramani, D. Roja
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2024, 20 (13) : 24 - 40
  • [22] Machine learning in breast cancer imaging: a review on data, models and methods
    Grinet, Macro S. V. M.
    Gouveia, Ana I. R.
    Gomes, Abel J. P.
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2024, 11 (07):
  • [23] Survival analysis of breast cancer patients using machine learning models
    Evangeline, I. Keren
    Kirubha, S. P. Angeline
    Precious, J. Glory
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (20) : 30909 - 30928
  • [24] Exploring the Best Machine Learning Models for Breast Cancer Prediction in Wisconsin
    Al Mamun, Abdullah
    Bhuiyan, Touhid
    Hassan, Md Maruf
    Anik, Shahedul Islam
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (01) : 1362 - 1368
  • [25] Survival analysis of breast cancer patients using machine learning models
    Keren Evangeline I.
    S. P. Angeline Kirubha
    J. Glory Precious
    Multimedia Tools and Applications, 2023, 82 : 30909 - 30928
  • [26] On the Machine Learning Based Business Workflows Extracting Knowledge from Large Scale Graph Data
    Musaoglu, Mert
    Bekler, Merve
    Budak, Huseyin
    Akcelik, Celal
    Aktas, Mehmet S.
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2022 WORKSHOPS, PART V, 2022, 13381 : 463 - 475
  • [27] Nanomaterial Synthesis Insights from Machine Learning of Scientific Articles by Extracting, Structuring, and Visualizing Knowledge
    Hiszpanski, Anna M.
    Gallagher, Brian
    Chellappan, Karthik
    Li, Peggy
    Liu, Shusen
    Kim, Hyojin
    Han, Jinkyu
    Kailkhura, Bhavya
    Buttler, David J.
    Han, Thomas Yong-Jin
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2020, 60 (06) : 2876 - 2887
  • [28] Extracting data models from background knowledge graphs
    Oliveira, Daniela
    d'Aquin, Mathieu
    KNOWLEDGE-BASED SYSTEMS, 2022, 237
  • [29] Interpretable Machine Learning Models to Predict the Resistance of Breast Cancer Patients to Doxorubicin from Their microRNA Profiles
    Ogunleye, Adeolu Z.
    Piyawajanusorn, Chayanit
    Goncalves, Anthony
    Ghislat, Ghita
    Ballester, Pedro J.
    ADVANCED SCIENCE, 2022, 9 (24)
  • [30] Advancing Breast Cancer Diagnosis through Breast Mass Images, Machine Learning, and Regression Models
    Zaylaa, Amira J.
    Kourtian, Sylva
    SENSORS, 2024, 24 (07)