Performance evaluation of large language models with chain-of-thought reasoning ability in clinical laboratory case interpretation

被引:0
|
作者
Yang, He S. [1 ]
Li, Jieli [2 ]
Yi, Xin [1 ,3 ]
Wang, Fei [4 ]
机构
[1] Weill Cornell Med, Dept Pathol & Lab Med, 525 E 68th St,F707, New York, NY 10065 USA
[2] Ohio State Univ, Wexner Med Ctr, Dept Pathol, Columbus, OH USA
[3] Houston Methodist Hosp, Dept Pathol & Genom Med, Houston, TX USA
[4] Weill Cornell Med, Dept Populat Hlth Sci, New York, NY USA
关键词
large language models; chain-of-thought; retrieval augmented generation; AI Chatbot; laboratory medicine;
D O I
10.1515/cclm-2025-0055
中图分类号
R446 [实验室诊断]; R-33 [实验医学、医学实验];
学科分类号
1001 ;
摘要
引用
收藏
页数:3
相关论文
共 50 条
  • [31] Chain of Thought Utilization in Large Language Models and Application in Nephrology
    Miao, Jing
    Thongprayoon, Charat
    Suppadungsuk, Supawadee
    Krisanapan, Pajaree
    Radhakrishnan, Yeshwanter
    Cheungpasitporn, Wisit
    MEDICINA-LITHUANIA, 2024, 60 (01):
  • [32] CoF-CoT: Enhancing Large Language Models with Coarse-to-Fine Chain-of-Thought Prompting for Multi-domain NLU Tasks
    Nguyen, Hoang H.
    Liu, Ye
    Zhang, Chenwei
    Zhang, Tao
    Yu, Philip S.
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 12109 - 12119
  • [33] Chain of Logic: Rule-Based Reasoning with Large Language Models
    Servantez, Sergio
    Barrow, Joe
    Hammond, Kristian
    Jain, Rajiv
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 2721 - 2733
  • [34] On Implementing Case-Based Reasoning with Large Language Models
    Wilkerson, Kaitlynne
    Leake, David
    CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2024, 2024, 14775 : 404 - 417
  • [35] NPHardEval: Dynamic Benchmark on Reasoning Ability of Large Language Models via Complexity Classes
    Fan, Lizhou
    Hua, Wenyue
    Li, Lingyao
    Ling, Haoyang
    Zhang, Yongfeng
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 4092 - 4114
  • [36] Triplet-based contrastive method enhances the reasoning ability of large language models
    Chen, Hongwei
    Zhu, Jiahui
    Wang, Wei
    Zhu, Yuan
    Xi, Liya
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (04):
  • [37] An Evaluation of Reasoning Capabilities of Large Language Models in Financial Sentiment Analysis
    Du, Kelvin
    Xing, Frank
    Mao, Rui
    Cambria, Erik
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 189 - 194
  • [38] TIMEBENCH: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models
    Chu, Zheng
    Chen, Jingchang
    Chen, Qianglong
    Yu, Weijiang
    Wang, Haotian
    Liu, Ming
    Qin, Bing
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 1204 - 1228
  • [39] INTERVENOR: Prompting the Coding Ability of Large Language Models with the Interactive Chain of Repair
    Wang, Hanbin
    Liu, Zhenghao
    Wang, Shuo
    Cui, Ganqu
    Ding, Ning
    Liu, Zhiyuan
    Yu, Ge
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 2081 - 2107
  • [40] CoTEL-D3X: A chain-of-thought enhanced large language model for drug-drug interaction triplet extraction
    Hu, Haotian
    Yang, Alex Jie
    Deng, Sanhong
    Wang, Dongbo
    Song, Min
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 273