Extracting Knowledge from Machine Learning Models to Diagnose Breast Cancer

被引:0
|
作者
Martinez-Ramirez, Jose Manuel [1 ]
Carmona, Cristobal [1 ,2 ,3 ]
Ramirez-Exposito, Maria Jesus [4 ]
Martinez-Martos, Jose Manuel [4 ]
机构
[1] Univ Jaen, Dept Comp Sci, E-23071 Jaen, Spain
[2] Univ Jaen, DASCI, Andalusian Res Inst Data Sci & Computat Intelligen, E-23071 Jaen, Spain
[3] DeMontfort Univ, Leicester Sch Pharm, Leicester LE1 7RH, England
[4] Univ Jaen, Dept Hlth Sci, Expt & Clin Physiopathol Res Grp CVI 1039, E-23071 Jaen, Spain
来源
LIFE-BASEL | 2025年 / 15卷 / 02期
关键词
breast cancer; serum biomarkers; explainable AI; oxytocin; early diagnosis; peptide hormones; IRAP; progesterone; REGULATING AMINOPEPTIDASE ACTIVITIES; OXYTOCIN RECEPTOR EXPRESSION; POST-MENOPAUSAL WOMEN; PROGESTERONE-RECEPTORS; MENDELIAN RANDOMIZATION; GENETIC EPIDEMIOLOGY; AT(4) RECEPTOR; TUMOR-GROWTH; IRON; RISK;
D O I
10.3390/life15020211
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
This study explored the application of explainable machine learning models to enhance breast cancer diagnosis using serum biomarkers, contrary to many studies that focus on medical images and demographic data. The primary objective was to develop models that are not only accurate but also provide insights into the factors driving predictions, addressing the need for trustworthy AI in healthcare. Several classification models were evaluated, including OneR, JRIP, the FURIA, J48, the ADTree, and the Random Forest, all of which are known for their explainability. The dataset included a variety of biomarkers, such as electrolytes, metal ions, marker proteins, enzymes, lipid profiles, peptide hormones, steroid hormones, and hormone receptors. The Random Forest model achieved the highest accuracy at 99.401%, followed closely by JRIP, the FURIA, and the ADTree at 98.802%. OneR and J48 achieved 98.204% accuracy. Notably, the models identified oxytocin as a key predictive biomarker, with most models featuring it in their rules. Other significant parameters included GnRH, beta-endorphin, vasopressin, IRAP, and APB, as well as factors like iron, cholinesterase, the total protein, progesterone, 5-nucleotidase, and the BMI, which are considered clinically relevant to breast cancer pathogenesis. This study discusses the roles of the identified parameters in cancer development, thus underscoring the potential of explainable machine learning models for enhancing early breast cancer diagnosis by focusing on explainability and the use of serum biomarkers.The combination of both can lead to improved early detection and personalized treatments, emphasizing the potential of these methods in clinical settings. The identified markers also provide additional research and therapeutic targets for breast cancer pathogenesis and a deep understanding of their interactions, advancing personalized approaches to breast cancer management.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] Machine Learning to Diagnose Breast Cancer
    Ismaeel, Ghassan Ahmad
    PRZEGLAD ELEKTROTECHNICZNY, 2023, 99 (01): : 10 - 12
  • [2] MACHINE LEARNING BASE METHODS FOR BREAST CANCER DIAGNOSE
    Deng Yang
    Yang Yujun
    Qiu Laixiang
    Zhou Wang
    2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
  • [3] Comparison of Deep Learning and Machine Learning Algorithms to Diagnose and Predict Breast Cancer
    Ghazalnaz Sharifonnasabi, F.
    Makhdoom, Iman
    Lecture Notes in Networks and Systems, 2024, 839 : 31 - 43
  • [4] Use of machine learning to diagnose breast cancer from raw electrical impedance tomography data
    Korjenevsky, A. V.
    BIOMEDICAL ENGINEERING, 2024, 58 (03) : 208 - 212
  • [5] Research on Logistic Regression Algorithm of Breast Cancer Diagnose Data by Machine Learning
    Lei, Liu
    2018 INTERNATIONAL CONFERENCE ON ROBOTS & INTELLIGENT SYSTEM (ICRIS 2018), 2018, : 157 - 160
  • [6] Automated Breast Cancer Detection using Machine Learning Techniques by Extracting Different Feature Extracting Strategies
    Hussain, Lal
    Aziz, Wajid
    Saeed, Sharjil
    Rathore, Saima
    Rafique, Muhammad
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (IEEE TRUSTCOM) / 12TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING (IEEE BIGDATASE), 2018, : 327 - 331
  • [7] Ensemble Machine Learning Models for Breast Cancer Identification
    Dritsas, Elias
    Trigka, Maria
    Mylonas, Phivos
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS. AIAI 2023 IFIP WG 12.5 INTERNATIONAL WORKSHOPS, 2023, 677 : 303 - 311
  • [8] Application of Machine Learning Models to the Detection of Breast Cancer
    Binsaif, Nasser
    MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [9] Breast Cancer Prediction using Machine Learning Models
    Iparraguirre-Villanueva, Orlando
    Epifania-Huerta, Andres
    Torres-Ceclen, Carmen
    Ruiz-Alvarado, John
    Cabanillas-Carbonell, Michael
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (02) : 610 - 620
  • [10] Machine learning models in breast cancer survival prediction
    Montazeri, Mitra
    Montazeri, Mohadeseh
    Montazeri, Mahdieh
    Beigzadeh, Amin
    TECHNOLOGY AND HEALTH CARE, 2016, 24 (01) : 31 - 42