Enhancing Orthopedic Knowledge Assessments: The Performance of Specialized Generative Language Model Optimization

被引:0
|
作者
Zhou, Hong [1 ,2 ]
Wang, Hong-lin [1 ,2 ]
Duan, Yu-yu [2 ,3 ]
Yan, Zi-neng [1 ,2 ]
Luo, Rui [1 ,2 ]
Lv, Xiang-xin [1 ,2 ]
Xie, Yi [1 ,2 ]
Zhang, Jia-yao [1 ,2 ]
Yang, Jia-ming [1 ,2 ]
Xue, Ming-di [1 ,2 ]
Fang, Ying [1 ,2 ]
Lu, Lin [2 ,4 ]
Liu, Peng-ran [1 ,2 ]
Ye, Zhe-wei [1 ,2 ]
机构
[1] Huazhong Univ Sci & Technol, Union Hosp, Tongji Med Coll, Dept Orthoped Surg, Wuhan 430022, Peoples R China
[2] Huazhong Univ Sci & Technol, Union Hosp, Tongji Med Coll, Lab Intelligent Med, Wuhan 430022, Peoples R China
[3] Hubei Univ Chinese Med, Coll Chinese Med, Wuhan 433065, Peoples R China
[4] Wuhan Univ, Dept Orthoped, Renmin Hosp, Wuhan 433060, Peoples R China
来源
CURRENT MEDICAL SCIENCE | 2024年
基金
中国国家自然科学基金;
关键词
artificial intelligence; large language models; generative articial intelligence; orthopedics; CLINICAL-PRACTICE GUIDELINE; AMERICAN ACADEMY; HIP-FRACTURES; MANAGEMENT;
D O I
10.1007/s11596-024-2929-4
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
ObjectiveThis study aimed to evaluate and compare the effectiveness of knowledge base-optimized and unoptimized large language models (LLMs) in the field of orthopedics to explore optimization strategies for the application of LLMs in specific fields.MethodsThis research constructed a specialized knowledge base using clinical guidelines from the American Academy of Orthopaedic Surgeons (AAOS) and authoritative orthopedic publications. A total of 30 orthopedic-related questions covering aspects such as anatomical knowledge, disease diagnosis, fracture classification, treatment options, and surgical techniques were input into both the knowledge base-optimized and unoptimized versions of the GPT-4, ChatGLM, and Spark LLM, with their generated responses recorded. The overall quality, accuracy, and comprehensiveness of these responses were evaluated by 3 experienced orthopedic surgeons.ResultsCompared with their unoptimized LLMs, the optimized version of GPT-4 showed improvements of 15.3% in overall quality, 12.5% in accuracy, and 12.8% in comprehensiveness; ChatGLM showed improvements of 24.8%, 16.1%, and 19.6%, respectively; and Spark LLM showed improvements of 6.5%, 14.5%, and 24.7%, respectively.ConclusionThe optimization of knowledge bases significantly enhances the quality, accuracy, and comprehensiveness of the responses provided by the 3 models in the orthopedic field. Therefore, knowledge base optimization is an effective method for improving the performance of LLMs in specific fields.
引用
收藏
页码:1001 / 1005
页数:5
相关论文
共 50 条
  • [41] Model-Driven Optimization: Towards Performance-Enhancing Low-Level Encodings
    van Arragon, Lars
    Damasceno, Carlos Diego
    Struber, Daniel
    2023 ACM/IEEE INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS COMPANION, MODELS-C, 2023, : 571 - 579
  • [42] A KNOWLEDGE-BASED MODEL FOR THE PERFORMANCE OPTIMIZATION OF A RELATIONAL DATA-BASE SYSTEM
    FRASSON, C
    MISSAOUI, R
    TSI-TECHNIQUE ET SCIENCE INFORMATIQUES, 1988, 7 (05): : 451 - 464
  • [43] Incorporating Syntactic Knowledge into Pre-trained Language Model using Optimization for Overcoming Catastrophic Forgetting
    Iwamoto, Ran
    Yoshida, Issei
    Kanayama, Hiroshi
    Ohkot, Takuya
    Muraoka, Masayasu
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 10981 - 10993
  • [44] MetaQA: Enhancing human-centered data search using Generative Pre-trained Transformer (GPT) language model and artificial intelligence
    Li, Diya
    Zhang, Zhe
    PLOS ONE, 2023, 18 (11):
  • [45] Achieving GPT-4o level performance in astronomy with a specialized 8B-parameter large language model
    Tijmen de Haan
    Yuan-Sen Ting
    Tirthankar Ghosal
    Tuan Dung Nguyen
    Alberto Accomazzi
    Azton Wells
    Nesar Ramachandra
    Rui Pan
    Zechang Sun
    Scientific Reports, 15 (1)
  • [46] Evolutionary optimization of model specification searches between project management knowledge and construction engineering performance
    Chou, Jui-Sheng
    Yang, Jung-Ghun
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (11) : 4414 - 4426
  • [47] EvoText: Enhancing Natural Language Generation Models via Self-Escalation Learning for Up-to-Date Knowledge and Improved Performance
    Yuan, Zhengqing
    Xue, Huiwen
    Zhang, Chao
    Liu, Yongming
    APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [48] A novel fully hybrid simulation-optimization approach for enhancing the calibration and verification performance of the TUW hydrological model
    Durgut, Pinar G.
    Ayvaz, M. Tamer
    JOURNAL OF HYDROLOGY, 2023, 617
  • [49] Enhancing Textile Wastewater Treatment Performance: Optimization and Troubleshooting (Decision Support) via GPS-X Model
    Wondim, Tilik Tena
    Dzwairo, Rimuka Bloodless
    Aklog, Dagnachew
    Janka, Eshetu
    Samarakoon, Gamunu
    PROCESSES, 2023, 11 (10)
  • [50] Large Language Models Can Connect the Dots: Exploring Model Optimization Bugs with Domain Knowledge-Aware Prompts
    Guan, Hao
    Bai, Guangdong
    Liu, Yepang
    PROCEEDINGS OF THE 33RD ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2024, 2024, : 1579 - 1591