Over-Reasoning and Redundant Calculation of Large Language Models

被引：0

作者：

Chiang, Cheng-Han ^{[1
]}

Lee, Hung-yi ^{[1
]}

机构：

[1] Natl Taiwan Univ, Taipei, Taiwan

来源：

PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2: SHORT PAPERS | 2024年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Large language models (LLMs) can solve problems step-by-step. While this chain-of-thought (CoT) reasoning boosts LLMs' performance, it is unclear if LLMs know when to use CoT and whether those CoT are always necessary to answer the question. This paper shows that LLMs tend to generate redundant calculations and reasoning on a manually constructed math QA dataset, GSM8K-Zero. GSM8K-Zero is constructed such that the questions can be answered without any calculations, but LLMs, including Llama-2 models and Claude-2, tend to generate lengthy and unnecessary calculations to answer the questions. We also conduct experiments to explain why LLMs generate redundant calculations and reasonings. GSM8K-Zero is publicly available at https://github.com/d223302/Over-Reasoning-of- LLMs and https://huggingface.co/datasets/dcml0714/GSM8K-Zero.

引用

页码：161 / 169

页数：9

共 50 条

[1] ThinkSum: Probabilistic reasoning over sets using large language models
Ozturkler, Batu
Malkin, Nikolay
Wang, Zhen
Jojic, Nebojsa
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 1216 - 1239
[2] Large Language Models Are Reasoning Teachers
Ho, Namgyu
Schmid, Laura
Yun, Se-Young
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 14852 - 14882
[3] KnowledgeNavigator: leveraging large language models for enhanced reasoning over knowledge graph
Guo, Tiezheng
Yang, Qingwen
Wang, Chen
Liu, Yanyi
Li, Pan
Tang, Jiawei
Li, Dapeng
Wen, Yingyou
COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (05) : 7063 - 7076
[4] Towards Reasoning in Large Language Models: A Survey
Huang, Jie
Chang, Kevin Chen-Chuan
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 1049 - 1065
[5] Conversations on reasoning: Large language models in diagnosis
Restrepo, Daniel
Rodman, Adam
Abdulnour, Raja-Elie
JOURNAL OF HOSPITAL MEDICINE, 2024, 19 (08) : 731 - 735
[6] Emergent analogical reasoning in large language models
Taylor Webb
Keith J. Holyoak
Hongjing Lu
Nature Human Behaviour, 2023, 7 : 1526 - 1541
[7] Large Language Models are Visual Reasoning Coordinators
Chen, Liangyu
Li, Bo
Shen, Sheng
Yang, Jingkang
Li, Chunyuan
Keutzer, Kurt
Darrell, Trevor
Liu, Ziwei
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[8] Inductive reasoning in humans and large language models
Han, Simon Jerome
Ransom, Keith J.
Perfors, Andrew
Kemp, Charles
COGNITIVE SYSTEMS RESEARCH, 2024, 83
[9] Conditional and Modal Reasoning in Large Language Models
Holliday, Wesley H.
Mandelkern, Matthew
Zhang, Cedegao E.
EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference, 2024, : 3800 - 3821
[10] Emergent analogical reasoning in large language models
Webb, Taylor
Holyoak, Keith J.
Lu, Hongjing
NATURE HUMAN BEHAVIOUR, 2023, 7 (09) : 1526 - 1541

← 1 2 3 4 5 →