A Novel Evaluation Framework for Medical LLMs: Combining Fuzzy Logic and MCDM for Medical Relation and Clinical Concept Extraction

被引:0
|
作者
Alamoodi, A. H. [1 ,2 ,3 ]
Zughoul, Omar [4 ]
David, Dianese [5 ]
Garfan, Salem [5 ]
Pamucar, Dragan [6 ,7 ,8 ]
Albahri, O. S. [9 ,10 ]
Albahri, A. S. [11 ,12 ]
Yussof, Salman [1 ,13 ]
Sharaf, Iman Mohamad [14 ]
机构
[1] Univ Tenaga Nas, Inst Informat & Comp Energy, Kajang, Malaysia
[2] Appl Sci Private Univ, Appl Sci Res Ctr, Amman, Jordan
[3] Middle East Univ, MEU Res Unit, Amman, Jordan
[4] Ahmed bin Mohammed Mil Coll, Informat Syst & Comp Sci Dept, Al Shahaniya, Qatar
[5] Univ Pendidikan Sultan Idris UPSI, Fac Comp & Meta Technol FKMT, Perak, Malaysia
[6] Szechenyi Istvan Univ, Gyor, Hungary
[7] Yuan Ze Univ, Dept Ind Engn & Management, Taoyuan 320315, Taiwan
[8] Western Caspian Univ, Dept Mech & Math, Baku, Azerbaijan
[9] Australian Tech & Management Coll, Melbourne, Australia
[10] Mazaya Univ Coll, Comp Tech Engn Dept, Nasiriyah, Iraq
[11] Imam Jaafar Al Sadiq Univ, Tech Coll, Baghdad, Iraq
[12] Iraqi Commiss Comp & Informat ICCI, Baghdad, Iraq
[13] Univ Tenaga Nas, Coll Comp & Informat, Dept Comp, Kajang, Malaysia
[14] Higher Technol Inst, Dept Basic Sci, Tenth Of Ramadan City, Egypt
关键词
FWZIC; MAIRCA; p; q-QROFS; Medical Relation Extraction; Clinical Concept Extraction; MODEL;
D O I
10.1007/s10916-024-02090-y
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Artificial intelligence (AI) has become a crucial element of modern technology, especially in the healthcare sector, which is apparent given the continuous development of large language models (LLMs), which are utilized in various domains, including medical beings. However, when it comes to using these LLMs for the medical domain, there's a need for an evaluation platform to determine their suitability and drive future development efforts. Towards that end, this study aims to address this concern by developing a comprehensive Multi-Criteria Decision Making (MCDM) approach that is specifically designed to evaluate medical LLMs. The success of AI, particularly LLMs, in the healthcare domain, depends on their efficacy, safety, and ethical compliance. Therefore, it is essential to have a robust evaluation framework for their integration into medical contexts. This study proposes using the Fuzzy-Weighted Zero-InConsistency (FWZIC) method extended to p, q-quasirung orthopair fuzzy set (p, q-QROFS) for weighing evaluation criteria. This extension enables the handling of uncertainties inherent in medical decision-making processes. The approach accommodates the imprecise and multifaceted nature of real-world medical data and criteria by incorporating fuzzy logic principles. The MultiAtributive Ideal-Real Comparative Analysis (MAIRCA) method is employed for the assessment of medical LLMs utilized in the case study of this research. The results of this research revealed that "Medical Relation Extraction" criteria with its sub-levels had more importance with (0.504) than "Clinical Concept Extraction" with (0.495). For the LLMs evaluated, out of 6 alternatives, (A4) "GatorTron S 10B" had the 1st rank as compared to (A1) "GatorTron 90B" had the 6th rank. The implications of this study extend beyond academic discourse, directly impacting healthcare practices and patient outcomes. The proposed framework can help healthcare professionals make more informed decisions regarding the adoption and utilization of LLMs in medical settings.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Clinical named entity recognition and relation extraction using natural language processing of medical free text: A systematic review
    Navarro, David Fraile
    Ijaz, Kiran
    Rezazadegan, Dana
    Rahimi-Ardabili, Hania
    Dras, Mark
    Coiera, Enrico
    Berkovsky, Shlomo
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2023, 177
  • [42] Design and Transition of an Emergency E-Learning Pathology Course for Medical Students-Evaluation of a Novel Course Concept
    Holzmann-Littig, Christopher
    Jedlicska, Nana
    Wijnen-Meijer, Marjo
    Liesche-Starnecker, Friederike
    Schmidt-Baese, Karen
    Renders, Lutz
    Weimann, Katja
    Konukiewitz, Bjoern
    Schlegel, Juergen
    EUROPEAN JOURNAL OF INVESTIGATION IN HEALTH PSYCHOLOGY AND EDUCATION, 2023, 13 (01) : 112 - 129
  • [43] Hybrid Diagnosis Models for Autism Patients Based on Medical and Sociodemographic Features Using Machine Learning and Multicriteria Decision-Making (MCDM) Techniques: An Evaluation and Benchmarking Framework
    Alqaysi M.E.
    Albahri A.S.
    Hamid R.A.
    Computational and Mathematical Methods in Medicine, 2022, 2022
  • [44] Circular economy of medical waste: novel intelligent medical waste management framework based on extension linear Diophantine fuzzy FDOSM and neural network approach (vol 30, pg 60473, 2023)
    Chew, XinYing
    Khaw, Khai Wah
    Alnoor, Alhamzah
    Ferasso, Marcos
    Al Halbusi, Hussam
    Muhsen, Yousif Raad
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2023, 30 (24) : 66428 - 66428
  • [46] Design, Implementation and Evaluation of a Distance Learning Framework to Expedite Medical Education during COVID-19 pandemic: A Proof-of-Concept Study
    Azar, Aida J.
    Khamis, Amar Hassan
    Naidoo, Nerissa
    Lindsbro, Marjam
    Boukhaled, Juliana Helena
    Gonuguntla, Suneetha
    Davis, David
    Banerjee, Yajnavalka
    JOURNAL OF MEDICAL EDUCATION AND CURRICULAR DEVELOPMENT, 2021, 8
  • [47] HD Constellation: A novel patient-centred approach to enriching clinical datasets by combining medical record chart review and Enroll-HD
    Belendiuk, K.
    Sacksteder, J. P.
    Sanii, X.
    Neacy, E.
    Levey, J.
    Korner, E.
    Houghton, R.
    Yohrling, G.
    MOVEMENT DISORDERS, 2020, 35 : S94 - S94
  • [48] HD Constellation: a Novel Patient-Centered Approach to Enriching Clinical Datasets by Combining Medical Record Chart Review and Enroll-HD
    Belendiuk, Katherine
    Sacksteder, J. P.
    Sanii, Xerxes
    Neacy, Eileen
    Levey, Jamie
    Korner, Eli
    Houghton, Richard
    Yohrling, George
    NEUROTHERAPEUTICS, 2020, 17 (SUPPL 1) : 4 - 4
  • [49] Extraction of 3D Femur Neck Trabecular Bone Architecture from Clinical CT Images in Osteoporotic Evaluation: a Novel Framework
    Sapthagirivasan, V.
    Anburajan, M.
    Janarthanam, S.
    JOURNAL OF MEDICAL SYSTEMS, 2015, 39 (08)
  • [50] Extraction of 3D Femur Neck Trabecular Bone Architecture from Clinical CT Images in Osteoporotic Evaluation: a Novel Framework
    V. Sapthagirivasan
    M. Anburajan
    S. Janarthanam
    Journal of Medical Systems, 2015, 39