SATLM: Satisfiability-Aided Language Models Using Declarative Prompting

被引：0

作者：

Ye, Xi ^{[1
]}

Chen, Qiaochu ^{[1
]}

Dillig, Isil ^{[1
]}

Durrett, Greg ^{[1
]}

机构：

[1] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Prior work has combined chain-of-thought prompting in large language models (LLMs) with programmatic representations to perform effective and transparent reasoning. While such an approach works well for tasks that only require forward reasoning (e.g., straightforward arithmetic), it is less effective for constraint solving problems that require more sophisticated planning and search. In this paper, we propose a new satisfiability-aided language modeling (SATLM) approach for improving the reasoning capabilities of LLMs. We use an LLM to generate a declarative task specification rather than an imperative program and leverage an off-the-shelf automated theorem prover to derive the final answer. This approach has two key advantages. The declarative specification is closer to the problem description than the reasoning steps are, so the LLM can parse it out of the description more accurately. Furthermore, by offloading the actual reasoning task to an automated theorem prover, our approach can guarantee the correctness of the answer with respect to the parsed specification and avoid planning errors in the solving process. We evaluate SATLM on 8 different datasets and show that it consistently outperforms program-aided LMs in the imperative paradigm. In particular, SATLM outperforms program-aided LMs by 23% on a challenging subset of the GSM arithmetic reasoning dataset; SATLM also achieves a new SoTA on LSAT and BOARDGAMEQA, surpassing previous models that are trained on the respective training sets.(1)

引用

页数：33

共 50 条

[41] Multi-Modal Attribute Prompting for Vision-Language Models
Liu, Xin
Wu, Jiamin
Yang, Wenfei
Zhou, Xu
Zhang, Tianzhu
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (11) : 11579 - 11591
[42] PAM: Prompting Audio-Language Models for Audio Quality Assessment
Deshmukh, Soham
Alharthi, Dareen
Elizalde, Benjamin
Gamper, Hannes
Al Ismail, Mahmoud
Singh, Rita
Raj, Bhiksha
Wang, Huaming
INTERSPEECH 2024, 2024, : 3320 - 3324
[43] On Hardware Security Bug Code Fixes by Prompting Large Language Models
Ahmad, Baleegh
Thakur, Shailja
Tan, Benjamin
Karri, Ramesh
Pearce, Hammond
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 4043 - 4057
[44] Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Wei, Jason
Wang, Xuezhi
Schuurmans, Dale
Bosma, Maarten
Ichter, Brian
Xia, Fei
Chi, Ed H.
Le, Quoc V.
Zhou, Denny
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[45] The Application of Probing Prompting Learning Models in Mastering Foreign Language Vocabulary
Usman, Misnawaty
Adys, Himala Praptami
Rosmaladewi, Rosmaladewi
Asnur, Muhammad Nur Ashar
IJOLE-INTERNATIONAL JOURNAL OF LANGUAGE EDUCATION, 2023, 7 (04): : 702 - 710
[46] Correction: Standardized nomenclature for litigational legal prompting in generative language models
Aditya Sivakumar
Ben Gelman
Robert Simmons
Discover Artificial Intelligence, 4 (1):
[47] A Communication Theory Perspective on Prompting Engineering Methods for Large Language Models
Song, Yuan-Feng
He, Yuan-Qin
Zhao, Xue-Fang
Gu, Han-Lin
Jiang, Di
Yang, Hai-Jun
Fan, Li-Xin
Journal of Computer Science and Technology, 2024, 39 (04) : 984 - 1004
[48] PromptSMILES: prompting for scaffold decoration and fragment linking in chemical language models
Thomas, Morgan
Ahmad, Mazen
Tresadern, Gary
de Fabritiis, Gianni
JOURNAL OF CHEMINFORMATICS, 2024, 16 (01):
[49] Predicting Design conflicts using Declarative Language Approach
Cheng, Yuan
He, Fazhi
Lv, Xiao
PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 268 - 273
[50] Query decomposition using the XML declarative description language
Thuy, LTT
Duong, DD
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2005, PT 2, 2005, 3481 : 1066 - 1075

← 1 2 3 4 5 →