Large Language Models are Versatile Decomposers: Decomposing Evidence and Questions for Table-based Reasoning

被引：11

作者：

Ye, Yunhu ^{[1
,4
]}

Hui, Binyuan ^{[2
]}

Yang, Min ^{[3
]}

Li, Binhua ^{[2
]}

Huang, Fei ^{[2
]}

Li, Yongbin ^{[2
]}

机构：

[1] Univ Sci & Technol China, Hefei, Peoples R China

[2] DAMO Acad, Alibaba Grp, Hangzhou, Peoples R China

[3] Chinese Acad Sci, SIAT, Shenzhen, Peoples R China

[4] Chinese Acad Sci, Shenzhen Inst Adv Technol SIAT, Shenzhen, Peoples R China

来源：

PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023 | 2023年

关键词：

Table-based reasoning; Large language models; Pre-trained language models;

D O I：

10.1145/3539618.3591708

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Table-based reasoning has shown remarkable progress in a wide range of table-based tasks. It is a challenging task, which requires reasoning over both free-form natural language (NL) questions and (semi-)structured tabular data. However, previous table-based reasoning solutions usually suffer from significant performance degradation on "huge" evidence (tables). In addition, most existing methods struggle to reason over complex questions since the essential information is scattered in different places. To alleviate the above challenges, we exploit large language models (LLMs) as decomposers for effective table-based reasoning, which (i) decompose huge evidence (a huge table) into sub-evidence (a small table) to mitigate the interference of useless information for table reasoning, and (ii) decompose a complex question into simpler sub-questions for text reasoning. First, we use a powerful LLM to decompose the evidence involved in the current question into the sub-evidence that retains the relevant information and excludes the remaining irrelevant information from the "huge" evidence. Second, we propose a novel "parsing-execution-filling" strategy to decompose a complex question into simper step-by-step sub-questions by generating intermediate SQL queries as a bridge to produce numerical and logical sub-questions with a powerful LLM. Finally, we leverage the decomposed sub-evidence and sub-questions to get the final answer with a few in-context prompting examples. Extensive experiments on three benchmark datasets (TabFact, WikiTableQuestion, and FetaQA) demonstrate that our method achieves significantly better results than competitive baselines for table-based reasoning. Notably, our method outperforms human performance for the first time on the TabFact dataset. In addition to impressive overall performance, our method also has the advantage of interpretability, where the returned results are to some extent tractable with the generated sub-evidence and sub-questions. For reproducibility, we release our source code and data at: https://github.com/AlibabaResearch/DAMO-ConvAI.

引用

页码：174 / 184

页数：11

共 50 条

[1] Are Large Language Models Table-based Fact-Checkers?
Zhang, Hanwen
Si, Qingyi
Fu, Peng
Lin, Zheng
Wang, Weiping
PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 3086 - 3091
[2] A survey of table reasoning with large language models
Zhang, Xuanliang
Wang, Dingzirui
Dou, Longxu
Zhu, Qingfu
Che, Wanxiang
FRONTIERS OF COMPUTER SCIENCE, 2025, 19 (09)
[3] IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models
You, Haoxuan
Sun, Rui
Wang, Zhecan
Chen, Long
Wang, Gengyu
Ayyubi, Hammad A.
Chang, Kai-Wei
Chang, Shih-Fu
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 11289 - 11303
[4] RERG: Reinforced evidence reasoning with graph neural network for table-based fact verification
Zhao, Guangzhen
Yang, Peng
Yao, Yu
APPLIED INTELLIGENCE, 2023, 53 (10) : 12308 - 12323
[5] RERG: Reinforced evidence reasoning with graph neural network for table-based fact verification
Guangzhen Zhao
Peng Yang
Yu Yao
Applied Intelligence, 2023, 53 : 12308 - 12323
[6] Table-based Knowledge Representations for Industrial Feature Models
Felfernig, Alexander
Ortner, Bettina
Le, Viet-Man
26TH ACM INTERNATIONAL SYSTEMS AND SOFTWARE PRODUCT LINE CONFERENCE, SPLC 2022, VOL B, 2022, : 245 - 248
[7] SimSync: A Table-based Constraint Processing Language for Synchronization Control
Huang Ke
Li Zhihong
Koo, Benjamin
Wang Tianju
2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS, PROCEEDINGS, 2009, : 691 - 695
[8] TabMoE: A General Framework for Diverse Table-Based Reasoning with Mixture-of-Experts
Wu, Jie
Hou, Mengshu
MATHEMATICS, 2024, 12 (19)
[9] Large Language Models Are Reasoning Teachers
Ho, Namgyu
Schmid, Laura
Yun, Se-Young
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 14852 - 14882
[10] On Implementing Case-Based Reasoning with Large Language Models
Wilkerson, Kaitlynne
Leake, David
CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2024, 2024, 14775 : 404 - 417

← 1 2 3 4 5 →