SATLM: Satisfiability-Aided Language Models Using Declarative Prompting

被引：0

作者：

Ye, Xi ^{[1
]}

Chen, Qiaochu ^{[1
]}

Dillig, Isil ^{[1
]}

Durrett, Greg ^{[1
]}

机构：

[1] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Prior work has combined chain-of-thought prompting in large language models (LLMs) with programmatic representations to perform effective and transparent reasoning. While such an approach works well for tasks that only require forward reasoning (e.g., straightforward arithmetic), it is less effective for constraint solving problems that require more sophisticated planning and search. In this paper, we propose a new satisfiability-aided language modeling (SATLM) approach for improving the reasoning capabilities of LLMs. We use an LLM to generate a declarative task specification rather than an imperative program and leverage an off-the-shelf automated theorem prover to derive the final answer. This approach has two key advantages. The declarative specification is closer to the problem description than the reasoning steps are, so the LLM can parse it out of the description more accurately. Furthermore, by offloading the actual reasoning task to an automated theorem prover, our approach can guarantee the correctness of the answer with respect to the parsed specification and avoid planning errors in the solving process. We evaluate SATLM on 8 different datasets and show that it consistently outperforms program-aided LMs in the imperative paradigm. In particular, SATLM outperforms program-aided LMs by 23% on a challenging subset of the GSM arithmetic reasoning dataset; SATLM also achieves a new SoTA on LSAT and BOARDGAMEQA, surpassing previous models that are trained on the respective training sets.(1)

引用

页数：33

共 50 条

[31] Do Language Models Enjoy Their Own Stories? Prompting Large Language Models for Automatic Story Evaluation
Chhun, Cyril
Suchanek, Fabian M.
Clavel, Chloe
TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2024, 12 : 1122 - 1142
[32] DecoMT: Decomposed Prompting for Machine Translation Between Related Languages using Large Language Models
Puduppully, Ratish
Kunchukuttan, Anoop
Dabre, Raj
Aw, Ai Ti
Chen, Nancy F.
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 4586 - 4602
[33] uCAP: An Unsupervised Prompting Method for Vision-Language Models
Nguyen, A. Tuan
Tai, Kai Sheng
Chen, Bor-Chun
Shukla, Satya Narayan
Yu, Harichao
Torr, Philip
Tian, Tai-Peng
Lim, Ser-Nam
COMPUTER VISION - ECCV 2024, PT LXXIV, 2025, 15132 : 425 - 439
[34] Guiding Large Language Models via Directional Stimulus Prompting
Li, Zekun
Peng, Baolin
He, Pengcheng
Galley, Michel
Gao, Jianfeng
Yan, Xifeng
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[35] Prompting Visual-Language Models for Efficient Video Understanding
Ju, Chen
Han, Tengda
Zheng, Kunhao
Zhang, Ya
Xie, Weidi
COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 105 - 124
[36] The Art of Asking: Prompting Large Language Models for Serendipity Recommendations
Fu, Zhe
Niu, Xi
PROCEEDINGS OF THE 2024 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2024, 2024, : 157 - 166
[37] Attention Prompting on Image for Large Vision-Language Models
Yu, Runpeng
Yu, Weihao
Wang, Xinchao
COMPUTER VISION - ECCV 2024, PT XXX, 2025, 15088 : 251 - 268
[38] Exploring the Transferability of Visual Prompting for Multimodal Large Language Models
Zhang, Yichi
Dong, Yinpeng
Zhang, Siyuan
Min, Tianzan
Su, Hang
Zhu, Jun
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 26552 - 26562
[39] A Universal Prompting Strategy for Extracting Process Model Information from Natural Language Text Using Large Language Models
Neuberger, Julian
Ackermann, Lars
van der Aa, Han
Jablonski, Stefan
CONCEPTUAL MODELING, ER 2024, 2025, 15238 : 38 - 55
[40] Does Metacognitive Prompting Improve Causal Inference in Large Language Models?
Ohtani, Ryusei
Sakurai, Yuko
Oyama, Satoshi
2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 458 - 459

← 1 2 3 4 5 →