Performance evaluation of large language models with chain-of-thought reasoning ability in clinical laboratory case interpretation

被引:0
|
作者
Yang, He S. [1 ]
Li, Jieli [2 ]
Yi, Xin [1 ,3 ]
Wang, Fei [4 ]
机构
[1] Weill Cornell Med, Dept Pathol & Lab Med, 525 E 68th St,F707, New York, NY 10065 USA
[2] Ohio State Univ, Wexner Med Ctr, Dept Pathol, Columbus, OH USA
[3] Houston Methodist Hosp, Dept Pathol & Genom Med, Houston, TX USA
[4] Weill Cornell Med, Dept Populat Hlth Sci, New York, NY USA
关键词
large language models; chain-of-thought; retrieval augmented generation; AI Chatbot; laboratory medicine;
D O I
10.1515/cclm-2025-0055
中图分类号
R446 [实验室诊断]; R-33 [实验医学、医学实验];
学科分类号
1001 ;
摘要
引用
收藏
页数:3
相关论文
共 50 条
  • [1] Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
    Wei, Jason
    Wang, Xuezhi
    Schuurmans, Dale
    Bosma, Maarten
    Ichter, Brian
    Xia, Fei
    Chi, Ed H.
    Le, Quoc V.
    Zhou, Denny
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [2] Chain-of-Thought Reasoning in Tabular Language Models
    Zheng, Mingyu
    Hao, Yang
    Jiang, Wenbin
    Lin, Zheng
    Lyu, Yajuan
    She, Qiaoqiao
    Wang, Weiping
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 11006 - 11019
  • [3] On the Representational Capacity of Neural Language Models with Chain-of-Thought Reasoning
    Nowak, Franz
    Svete, Anej
    Butoi, Alexandra
    Cotterell, Ryan
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 12510 - 12548
  • [4] Active Prompting with Chain-of-Thought for Large Language Models
    Diao, Shizhe
    Wang, Pengcheng
    Lin, Yong
    Pan, Rui
    Liu, Xiang
    Zhang, Tong
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 1330 - 1350
  • [5] Moderating New Waves of Online Hate with Chain-of-Thought Reasoning in Large Language Models
    Vishwamitra, Nishant
    Guo, Keyan
    Romit, Farhan Tajwar
    Ondracek, Isabelle
    Cheng, Long
    Zhao, Ziming
    Hu, Hongxin
    45TH IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP 2024, 2024, : 788 - 806
  • [6] ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models
    Chen, Zhipeng
    Zhou, Kun
    Zhang, Beichen
    Gong, Zheng
    Zhao, Wayne Xin
    Wen, Ji-Rong
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 14777 - 14790
  • [7] Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models
    Wang, Lei
    Xu, Wanyu
    Lan, Yihuai
    Hu, Zhiqiang
    Lan, Yunshi
    Lee, Roy Ka-Wei
    Lim, Ee-Peng
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 2609 - 2634
  • [8] Chain-of-Thought Improves Text Generation with Citations in Large Language Models
    Ji, Bin
    Liu, Huijun
    Du, Mingzhe
    Ng, See-Kiong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 18345 - 18353
  • [9] DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language Models
    Zheng, Ge
    Yang, Bin
    Tang, Jiajin
    Zhou, Hong-Yu
    Yang, Sibei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [10] Multi-Modal Latent Space Learning for Chain-of-Thought Reasoning in Language Models
    He, Liqi
    Li, Zuchao
    Cai, Xiantao
    Wang, Ping
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 18180 - 18187