A Novel Evaluation Framework for Medical LLMs: Combining Fuzzy Logic and MCDM for Medical Relation and Clinical Concept Extraction

被引:0
|
作者
Alamoodi, A. H. [1 ,2 ,3 ]
Zughoul, Omar [4 ]
David, Dianese [5 ]
Garfan, Salem [5 ]
Pamucar, Dragan [6 ,7 ,8 ]
Albahri, O. S. [9 ,10 ]
Albahri, A. S. [11 ,12 ]
Yussof, Salman [1 ,13 ]
Sharaf, Iman Mohamad [14 ]
机构
[1] Univ Tenaga Nas, Inst Informat & Comp Energy, Kajang, Malaysia
[2] Appl Sci Private Univ, Appl Sci Res Ctr, Amman, Jordan
[3] Middle East Univ, MEU Res Unit, Amman, Jordan
[4] Ahmed bin Mohammed Mil Coll, Informat Syst & Comp Sci Dept, Al Shahaniya, Qatar
[5] Univ Pendidikan Sultan Idris UPSI, Fac Comp & Meta Technol FKMT, Perak, Malaysia
[6] Szechenyi Istvan Univ, Gyor, Hungary
[7] Yuan Ze Univ, Dept Ind Engn & Management, Taoyuan 320315, Taiwan
[8] Western Caspian Univ, Dept Mech & Math, Baku, Azerbaijan
[9] Australian Tech & Management Coll, Melbourne, Australia
[10] Mazaya Univ Coll, Comp Tech Engn Dept, Nasiriyah, Iraq
[11] Imam Jaafar Al Sadiq Univ, Tech Coll, Baghdad, Iraq
[12] Iraqi Commiss Comp & Informat ICCI, Baghdad, Iraq
[13] Univ Tenaga Nas, Coll Comp & Informat, Dept Comp, Kajang, Malaysia
[14] Higher Technol Inst, Dept Basic Sci, Tenth Of Ramadan City, Egypt
关键词
FWZIC; MAIRCA; p; q-QROFS; Medical Relation Extraction; Clinical Concept Extraction; MODEL;
D O I
10.1007/s10916-024-02090-y
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Artificial intelligence (AI) has become a crucial element of modern technology, especially in the healthcare sector, which is apparent given the continuous development of large language models (LLMs), which are utilized in various domains, including medical beings. However, when it comes to using these LLMs for the medical domain, there's a need for an evaluation platform to determine their suitability and drive future development efforts. Towards that end, this study aims to address this concern by developing a comprehensive Multi-Criteria Decision Making (MCDM) approach that is specifically designed to evaluate medical LLMs. The success of AI, particularly LLMs, in the healthcare domain, depends on their efficacy, safety, and ethical compliance. Therefore, it is essential to have a robust evaluation framework for their integration into medical contexts. This study proposes using the Fuzzy-Weighted Zero-InConsistency (FWZIC) method extended to p, q-quasirung orthopair fuzzy set (p, q-QROFS) for weighing evaluation criteria. This extension enables the handling of uncertainties inherent in medical decision-making processes. The approach accommodates the imprecise and multifaceted nature of real-world medical data and criteria by incorporating fuzzy logic principles. The MultiAtributive Ideal-Real Comparative Analysis (MAIRCA) method is employed for the assessment of medical LLMs utilized in the case study of this research. The results of this research revealed that "Medical Relation Extraction" criteria with its sub-levels had more importance with (0.504) than "Clinical Concept Extraction" with (0.495). For the LLMs evaluated, out of 6 alternatives, (A4) "GatorTron S 10B" had the 1st rank as compared to (A1) "GatorTron 90B" had the 6th rank. The implications of this study extend beyond academic discourse, directly impacting healthcare practices and patient outcomes. The proposed framework can help healthcare professionals make more informed decisions regarding the adoption and utilization of LLMs in medical settings.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Circular economy of medical waste: novel intelligent medical waste management framework based on extension linear Diophantine fuzzy FDOSM and neural network approach
    Chew, XinYing
    Khaw, Khai Wah
    Alnoor, Alhamzah
    Ferasso, Marcos
    Al Halbusi, Hussam
    Muhsen, Yousif Raad
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2023, 30 (21) : 60473 - 60499
  • [32] The Logic Model - a rationale for an encompassing evaluation framework of the German Competence Centers for Postgraduate Medical Education in general practice
    Foerster, Christian
    Haumann, Hannah
    Schwill, Simon
    Bischoff, Martina
    Portenhauser, Frank
    Stengel, Sandra
    Barzel, Anne
    Koch, Roland
    Joos, Stefanie
    ZEITSCHRIFT FUR EVIDENZ FORTBILDUNG UND QUALITAET IM GESUNDHEITSWESEN, 2021, 165 : 77 - 82
  • [33] Circular economy of medical waste: novel intelligent medical waste management framework based on extension linear Diophantine fuzzy FDOSM and neural network approach
    XinYing Chew
    Khai Wah Khaw
    Alhamzah Alnoor
    Marcos Ferasso
    Hussam Al Halbusi
    Yousif Raad Muhsen
    Environmental Science and Pollution Research, 2023, 30 : 60473 - 60499
  • [34] Fuzzy Deep Medical Diagnostic System: Gray Relation Framework and the Guiding Functionalities for the Professional Sports Club Social Responsibility
    Qiao, Zebo
    Yin, Jianjun
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2020, 10 (05) : 1084 - 1090
  • [35] Evaluation of Medical Problem Extraction from Electronic Clinical Documents Using MetaMap Transfer (MMTx)
    Meystre, Stephane
    Haug, Peter J.
    CONNECTING MEDICAL INFORMATICS AND BIO-INFORMATICS, 2005, 116 : 823 - 828
  • [36] Efficient and Generic Interactive Segmentation Framework to Correct Mispredictions During Clinical Evaluation of Medical Images
    Sambaturu, Bhavani
    Gupta, Ashutosh
    Jawahar, C., V
    Arora, Chetan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II, 2021, 12902 : 625 - 635
  • [37] Novel architecture for supporting medical decision making of different data types based on Fuzzy Cognitive Map Framework
    Papageorgiou, Elpiniki
    Stylios, Chrysostomos
    Groumpos, Peter
    2007 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-16, 2007, : 1192 - +
  • [38] Correction to: Circular economy of medical waste: novel intelligent medical waste management framework based on extension linear Diophantine fuzzy FDOSM and neural network approach
    XinYing Chew
    Khai Wah Khaw
    Alhamzah Alnoor
    Marcos Ferasso
    Hussam Al Halbusi
    Yousif Raad Muhsen
    Environmental Science and Pollution Research, 2023, 30 : 66428 - 66428
  • [39] Chapter-Level Stepwise Temporal Relation Extraction Based on Event Information for Chinese Clinical Medical Texts
    Xiang, Wenjun
    Zhang, Zhichang
    Zhang, Ziqin
    Yin, Deyue
    HEALTH INFORMATION PROCESSING, CHIP 2023, 2023, 1993 : 164 - 181
  • [40] Whom Should Be Saved? A Proposed Ethical Framework for Allocating Scarce Medical Resources to COVID-19 Patients Using Fuzzy Logic
    Saadeh, Heba
    Saadeh, Maha
    Almobaideen, Wesam
    FRONTIERS IN MEDICINE, 2021, 8