Enhancing Orthopedic Knowledge Assessments: The Performance of Specialized Generative Language Model Optimization

被引:0
|
作者
Zhou, Hong [1 ,2 ]
Wang, Hong-lin [1 ,2 ]
Duan, Yu-yu [2 ,3 ]
Yan, Zi-neng [1 ,2 ]
Luo, Rui [1 ,2 ]
Lv, Xiang-xin [1 ,2 ]
Xie, Yi [1 ,2 ]
Zhang, Jia-yao [1 ,2 ]
Yang, Jia-ming [1 ,2 ]
Xue, Ming-di [1 ,2 ]
Fang, Ying [1 ,2 ]
Lu, Lin [2 ,4 ]
Liu, Peng-ran [1 ,2 ]
Ye, Zhe-wei [1 ,2 ]
机构
[1] Huazhong Univ Sci & Technol, Union Hosp, Tongji Med Coll, Dept Orthoped Surg, Wuhan 430022, Peoples R China
[2] Huazhong Univ Sci & Technol, Union Hosp, Tongji Med Coll, Lab Intelligent Med, Wuhan 430022, Peoples R China
[3] Hubei Univ Chinese Med, Coll Chinese Med, Wuhan 433065, Peoples R China
[4] Wuhan Univ, Dept Orthoped, Renmin Hosp, Wuhan 433060, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
artificial intelligence; large language models; generative articial intelligence; orthopedics; CLINICAL-PRACTICE GUIDELINE; AMERICAN ACADEMY; HIP-FRACTURES; MANAGEMENT;
D O I
10.1007/s11596-024-2929-4
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
ObjectiveThis study aimed to evaluate and compare the effectiveness of knowledge base-optimized and unoptimized large language models (LLMs) in the field of orthopedics to explore optimization strategies for the application of LLMs in specific fields.MethodsThis research constructed a specialized knowledge base using clinical guidelines from the American Academy of Orthopaedic Surgeons (AAOS) and authoritative orthopedic publications. A total of 30 orthopedic-related questions covering aspects such as anatomical knowledge, disease diagnosis, fracture classification, treatment options, and surgical techniques were input into both the knowledge base-optimized and unoptimized versions of the GPT-4, ChatGLM, and Spark LLM, with their generated responses recorded. The overall quality, accuracy, and comprehensiveness of these responses were evaluated by 3 experienced orthopedic surgeons.ResultsCompared with their unoptimized LLMs, the optimized version of GPT-4 showed improvements of 15.3% in overall quality, 12.5% in accuracy, and 12.8% in comprehensiveness; ChatGLM showed improvements of 24.8%, 16.1%, and 19.6%, respectively; and Spark LLM showed improvements of 6.5%, 14.5%, and 24.7%, respectively.ConclusionThe optimization of knowledge bases significantly enhances the quality, accuracy, and comprehensiveness of the responses provided by the 3 models in the orthopedic field. Therefore, knowledge base optimization is an effective method for improving the performance of LLMs in specific fields.
引用
收藏
页码:1001 / 1005
页数:5
相关论文
共 50 条
  • [1] GenKP: generative knowledge prompts for enhancing large language models
    Li, Xinbai
    Peng, Shaowen
    Yada, Shuntaro
    Wakamiya, Shoko
    Aramaki, Eiji
    APPLIED INTELLIGENCE, 2025, 55 (06)
  • [2] GenKP: generative knowledge prompts for enhancing large language modelsGenKP: generative knowledge prompts for enhancing large language modelsX. Li et al.
    Xinbai Li
    Shaowen Peng
    Shuntaro Yada
    Shoko Wakamiya
    Eiji Aramaki
    Applied Intelligence, 2025, 55 (7)
  • [3] Medical language model specialized in extracting cardiac knowledge
    Gwon, Hansle
    Seo, Jiahn
    Park, Seohyun
    Kim, Young-Hak
    Jun, Tae Joon
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [4] CodeKGC: Code Language Model for Generative Knowledge Graph Construction
    Bi, Zhen
    Chen, Jing
    Jiang, Yinuo
    Xiong, Feiyu
    Guo, Wei
    Chen, Huajun
    Zhang, Ningyu
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (03)
  • [5] A partition-based optimization model and its performance benchmark for Generative Anatomy Modeling Language
    Demirel, Doga
    Cetinsaya, Berk
    Halic, Tansel
    Kockara, Sinan
    Reiners, Dirk
    Ahmadi, Shahryar
    Arikatla, Sreekanth
    COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 119
  • [6] A partition-based optimization model and its performance benchmark for Generative Anatomy Modeling Language
    Department of Computer Science, Florida Polytechnic University, Lakeland
    FL, United States
    不详
    FL, United States
    不详
    AR, United States
    不详
    AR, United States
    不详
    NC, United States
    Comput. Biol. Med., 2020,
  • [7] Better Together: Enhancing Generative Knowledge Graph Completion with Language Models and Neighborhood Information
    Chepurova, Alla
    Bulatov, Aydar
    Kuratov, Yuri
    Burtsev, Mikhail
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 5306 - 5316
  • [8] Enhancing Generative Class Incremental Learning Performance With a Model Forgetting Approach
    Togo, Taro
    Togo, Ren
    Maeda, Keisuke
    Ogawa, Takahiro
    Haseyama, Miki
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2025, 6 : 203 - 212
  • [9] KGGLM: A Generative Language Model for Generalizable Knowledge Graph Representation Learning in Recommendation
    Balloccu, Giacomo
    Boratto, Ludovico
    Fenu, Gianni
    Marras, Mirko
    Soccol, Alessandro
    PROCEEDINGS OF THE EIGHTEENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2024, 2024, : 1079 - 1084
  • [10] PU-GEN: Enhancing generative commonsense reasoning for language models with human-centered knowledge
    Seo, Jaehyung
    Oh, Dongsuk
    Eo, Sugyeong
    Park, Chanjun
    Yang, Kisu
    Moon, Hyeonseok
    Park, Kinam
    Lim, Heuiseok
    KNOWLEDGE-BASED SYSTEMS, 2022, 256