Over-Reasoning and Redundant Calculation of Large Language Models

被引：0

作者：

Chiang, Cheng-Han ^{[1
]}

Lee, Hung-yi ^{[1
]}

机构：

[1] Natl Taiwan Univ, Taipei, Taiwan

来源：

PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2: SHORT PAPERS | 2024年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Large language models (LLMs) can solve problems step-by-step. While this chain-of-thought (CoT) reasoning boosts LLMs' performance, it is unclear if LLMs know when to use CoT and whether those CoT are always necessary to answer the question. This paper shows that LLMs tend to generate redundant calculations and reasoning on a manually constructed math QA dataset, GSM8K-Zero. GSM8K-Zero is constructed such that the questions can be answered without any calculations, but LLMs, including Llama-2 models and Claude-2, tend to generate lengthy and unnecessary calculations to answer the questions. We also conduct experiments to explain why LLMs generate redundant calculations and reasonings. GSM8K-Zero is publicly available at https://github.com/d223302/Over-Reasoning-of- LLMs and https://huggingface.co/datasets/dcml0714/GSM8K-Zero.

引用

页码：161 / 169

页数：9

共 50 条

[21] Dynamic Voting for Efficient Reasoning in Large Language Models
Xue, Mingfeng
Liu, Dayiheng
Lei, Wenqiang
Ren, Xingzhang
Yang, Baosong
Xie, Jun
Zhang, Yidan
Peng, Dezhong
Lv, Jiancheng
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 3085 - 3104
[22] Reasoning with large language models for medical question answering
Lucas, Mary M.
Yang, Justin
Pomeroy, Jon K.
Yang, Christopher C.
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (09)
[23] Rationality of Thought Improves Reasoning in Large Language Models
Gou, Tian
Zhang, Boyao
Sun, Zhenglie
Wang, Jing
Liu, Fang
Wang, Yangang
Wang, Jue
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT IV, KSEM 2024, 2024, 14887 : 343 - 358
[24] NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
Zhou, Gengze
Hong, Yicong
Wu, Qi
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7641 - 7649
[25] IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models
You, Haoxuan
Sun, Rui
Wang, Zhecan
Chen, Long
Wang, Gengyu
Ayyubi, Hammad A.
Chang, Kai-Wei
Chang, Shih-Fu
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 11289 - 11303
[26] Towards Analysis and Interpretation of Large Language Models for Arithmetic Reasoning
Akter, Mst Shapna
Shahriar, Hossain
Cuzzocrea, Alfredo
2024 11TH IEEE SWISS CONFERENCE ON DATA SCIENCE, SDS 2024, 2024, : 267 - 270
[27] On Implementing Case-Based Reasoning with Large Language Models
Wilkerson, Kaitlynne
Leake, David
CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2024, 2024, 14775 : 404 - 417
[28] Reasoning with Large Language Models on Graph Tasks: The Influence of Temperature
Wang, Yiming
Zhang, Ziyang
Chen, Hanwei
Shen, Huayi
2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 630 - 634
[29] Exploring Reversal Mathematical Reasoning Ability for Large Language Models
Guo, Pei
You, Wangjie
Li, Juntao
Yan, Bowen
Zhang, Min
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 13671 - 13685
[30] Understanding Social Reasoning in Language Models with Language Models
Gandhi, Kanishk
Franken, J. -Philipp
Gerstenberg, Tobias
Goodman, Noah D.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,

← 1 2 3 4 5 →