A Novel Evaluation Framework for Medical LLMs: Combining Fuzzy Logic and MCDM for Medical Relation and Clinical Concept Extraction

被引:0
|
作者
Alamoodi, A. H. [1 ,2 ,3 ]
Zughoul, Omar [4 ]
David, Dianese [5 ]
Garfan, Salem [5 ]
Pamucar, Dragan [6 ,7 ,8 ]
Albahri, O. S. [9 ,10 ]
Albahri, A. S. [11 ,12 ]
Yussof, Salman [1 ,13 ]
Sharaf, Iman Mohamad [14 ]
机构
[1] Univ Tenaga Nas, Inst Informat & Comp Energy, Kajang, Malaysia
[2] Appl Sci Private Univ, Appl Sci Res Ctr, Amman, Jordan
[3] Middle East Univ, MEU Res Unit, Amman, Jordan
[4] Ahmed bin Mohammed Mil Coll, Informat Syst & Comp Sci Dept, Al Shahaniya, Qatar
[5] Univ Pendidikan Sultan Idris UPSI, Fac Comp & Meta Technol FKMT, Perak, Malaysia
[6] Szechenyi Istvan Univ, Gyor, Hungary
[7] Yuan Ze Univ, Dept Ind Engn & Management, Taoyuan 320315, Taiwan
[8] Western Caspian Univ, Dept Mech & Math, Baku, Azerbaijan
[9] Australian Tech & Management Coll, Melbourne, Australia
[10] Mazaya Univ Coll, Comp Tech Engn Dept, Nasiriyah, Iraq
[11] Imam Jaafar Al Sadiq Univ, Tech Coll, Baghdad, Iraq
[12] Iraqi Commiss Comp & Informat ICCI, Baghdad, Iraq
[13] Univ Tenaga Nas, Coll Comp & Informat, Dept Comp, Kajang, Malaysia
[14] Higher Technol Inst, Dept Basic Sci, Tenth Of Ramadan City, Egypt
关键词
FWZIC; MAIRCA; p; q-QROFS; Medical Relation Extraction; Clinical Concept Extraction; MODEL;
D O I
10.1007/s10916-024-02090-y
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Artificial intelligence (AI) has become a crucial element of modern technology, especially in the healthcare sector, which is apparent given the continuous development of large language models (LLMs), which are utilized in various domains, including medical beings. However, when it comes to using these LLMs for the medical domain, there's a need for an evaluation platform to determine their suitability and drive future development efforts. Towards that end, this study aims to address this concern by developing a comprehensive Multi-Criteria Decision Making (MCDM) approach that is specifically designed to evaluate medical LLMs. The success of AI, particularly LLMs, in the healthcare domain, depends on their efficacy, safety, and ethical compliance. Therefore, it is essential to have a robust evaluation framework for their integration into medical contexts. This study proposes using the Fuzzy-Weighted Zero-InConsistency (FWZIC) method extended to p, q-quasirung orthopair fuzzy set (p, q-QROFS) for weighing evaluation criteria. This extension enables the handling of uncertainties inherent in medical decision-making processes. The approach accommodates the imprecise and multifaceted nature of real-world medical data and criteria by incorporating fuzzy logic principles. The MultiAtributive Ideal-Real Comparative Analysis (MAIRCA) method is employed for the assessment of medical LLMs utilized in the case study of this research. The results of this research revealed that "Medical Relation Extraction" criteria with its sub-levels had more importance with (0.504) than "Clinical Concept Extraction" with (0.495). For the LLMs evaluated, out of 6 alternatives, (A4) "GatorTron S 10B" had the 1st rank as compared to (A1) "GatorTron 90B" had the 6th rank. The implications of this study extend beyond academic discourse, directly impacting healthcare practices and patient outcomes. The proposed framework can help healthcare professionals make more informed decisions regarding the adoption and utilization of LLMs in medical settings.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] JaMIE: A Pipeline Japanese Medical Information Extraction System with Novel Relation Annotation
    Cheng, Fei
    Yada, Shuntaro
    Tanaka, Ribeka
    Aramaki, Eiji
    Kurohashi, Sadao
    2022 Language Resources and Evaluation Conference, LREC 2022, 2022, : 3724 - 3731
  • [22] JaMIE: A Pipeline Japanese Medical Information Extraction System with Novel Relation Annotation
    Cheng, Fei
    Yada, Shuntaro
    Tanaka, Ribeka
    Aramaki, Eiji
    Kurohashi, Sadao
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 3724 - 3731
  • [23] Fusion of medical images based on salient features extraction by PSO optimized fuzzy logic in NSST domain
    Gao, Yuan
    Ma, Shiwei
    Liu, Jingjing
    Liu, Yanyan
    Zhang, Xianxia
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 69
  • [24] A Novel NLP-FUZZY System Prototype for Information Extraction from Medical Guidelines
    Fazlic, Lejla Begic
    Hallawa, Ahmed
    Schmeink, Anke
    Peine, Arne
    Martin, Lukas
    Dartmann, Guido
    2019 42ND INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2019, : 1025 - 1030
  • [25] A Novel Fuzzy Logic-Based Medical Expert System for Diagnosis of Chronic Kidney Disease
    Singla, Jimmy
    Kaur, Balwinder
    Prashar, Deepak
    Jha, Sudan
    Joshi, Gyanendra Prasad
    Park, Kyungyun
    Tariq, Usman
    Seo, Changho
    MOBILE INFORMATION SYSTEMS, 2020, 2020
  • [26] A novel fuzzy logic-based image steganography method to ensure medical data security
    Karalcis, R.
    Guler, J.
    Capraz, I.
    Bilir, E.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2015, 67 : 172 - 183
  • [27] Rule-Based Method for Automatic Medical Concept Extraction from Unstructured Clinical Text
    Sahu, Ruchi
    RECENT FINDINGS IN INTELLIGENT COMPUTING TECHNIQUES, VOL 3, 2018, 709 : 261 - 267
  • [28] A novel Pythagorean fuzzy combined compromise solution framework for the assessment of medical waste treatment technology
    Liu, Peide
    Rani, Pratibha
    Mishra, Arunodaya Raj
    JOURNAL OF CLEANER PRODUCTION, 2021, 292
  • [29] A Novel Framework for Improving Pulse-Coupled Neural Networks With Fuzzy Connectedness for Medical Image Segmentation
    Bai, Peirui
    Yang, Kai
    Min, Xiaolin
    Guo, Ziyang
    Li, Chang
    Fu, Yingxia
    Han, Chao
    Lu, Xiang
    Liu, Qingyi
    IEEE ACCESS, 2020, 8 : 138129 - 138140
  • [30] A Novel Two-Stage Unsupervised Fault Recognition Framework Combining Feature Extraction and Fuzzy Clustering for Collaborative AIoT
    Hu, Xufeng
    Li, Yibin
    Jia, Lei
    Qiu, Meikang
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (02) : 1291 - 1300