Performance evaluation of large language models with chain-of-thought reasoning ability in clinical laboratory case interpretation

被引:0
|
作者
Yang, He S. [1 ]
Li, Jieli [2 ]
Yi, Xin [1 ,3 ]
Wang, Fei [4 ]
机构
[1] Weill Cornell Med, Dept Pathol & Lab Med, 525 E 68th St,F707, New York, NY 10065 USA
[2] Ohio State Univ, Wexner Med Ctr, Dept Pathol, Columbus, OH USA
[3] Houston Methodist Hosp, Dept Pathol & Genom Med, Houston, TX USA
[4] Weill Cornell Med, Dept Populat Hlth Sci, New York, NY USA
关键词
large language models; chain-of-thought; retrieval augmented generation; AI Chatbot; laboratory medicine;
D O I
10.1515/cclm-2025-0055
中图分类号
R446 [实验室诊断]; R-33 [实验医学、医学实验];
学科分类号
1001 ;
摘要
引用
收藏
页数:3
相关论文
共 50 条
  • [21] Vietnamese Elementary Math Reasoning Using Large Language Model with Refined Translation and Dense-Retrieved Chain-of-Thought
    Nguyen-Khang Le
    Dieu-Hien Nguyen
    Dinh-Truong Do
    Chau Nguyen
    Minh Le Nguyen
    NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, JSAI-ISAI 2024, 2024, 14741 : 260 - 268
  • [22] DeCoT: Debiasing Chain-of-Thought for Knowledge-Intensive Tasks in Large Language Models via Causal Intervention
    Wu, Junda
    Yu, Tong
    Chen, Xiang
    Wang, Haoliang
    Rossi, Ryan A.
    Kim, Sungchul
    Rao, Anup
    McAuley, Julian
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 14073 - 14087
  • [23] T-SciQ: Teaching Multimodal Chain-of-Thought Reasoning via Large Language Model Signals for Science Question Answering
    Wang, Lei
    Hu, Yi
    He, Jiabang
    Xu, Xing
    Liu, Ning
    Liu, Hui
    Shen, Heng Tao
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19162 - 19170
  • [24] Rationality of Thought Improves Reasoning in Large Language Models
    Gou, Tian
    Zhang, Boyao
    Sun, Zhenglie
    Wang, Jing
    Liu, Fang
    Wang, Yangang
    Wang, Jue
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT IV, KSEM 2024, 2024, 14887 : 343 - 358
  • [25] LogicBench: Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models
    Parmar, Mihir
    Patel, Nisarg
    Varshney, Neeraj
    Nakamura, Mutsumi
    Luo, Man
    Mashetty, Santosh
    Mitra, Arindam
    Baral, Chitta
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 13679 - 13707
  • [26] Chain-of-Thought Tuning: Masked Language Models can also Think Step By Step in Natural Language Understanding
    Fan, Caoyun
    Tian, Jidong
    Li, Yitian
    Chen, Wenqing
    He, Hao
    Jin, Yaohui
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 14774 - 14785
  • [27] Exploring Reversal Mathematical Reasoning Ability for Large Language Models
    Guo, Pei
    You, Wangjie
    Li, Juntao
    Yan, Bowen
    Zhang, Min
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 13671 - 13685
  • [28] Towards Analysis and Interpretation of Large Language Models for Arithmetic Reasoning
    Akter, Mst Shapna
    Shahriar, Hossain
    Cuzzocrea, Alfredo
    2024 11TH IEEE SWISS CONFERENCE ON DATA SCIENCE, SDS 2024, 2024, : 267 - 270
  • [29] KG-CoT: Chain-of-Thought Prompting of Large Language Models over Knowledge Graphs for Knowledge-Aware Question Answering
    Zhao, Ruilin
    Zhao, Feng
    Wang, Long
    Wang, Xianzhi
    Xu, Guandong
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 6642 - 6650
  • [30] Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting
    Turpin, Miles
    Michael, Julian
    Perez, Ethan
    Bowman, Samuel R.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,