SATLM: Satisfiability-Aided Language Models Using Declarative Prompting

被引:0
|
作者
Ye, Xi [1 ]
Chen, Qiaochu [1 ]
Dillig, Isil [1 ]
Durrett, Greg [1 ]
机构
[1] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prior work has combined chain-of-thought prompting in large language models (LLMs) with programmatic representations to perform effective and transparent reasoning. While such an approach works well for tasks that only require forward reasoning (e.g., straightforward arithmetic), it is less effective for constraint solving problems that require more sophisticated planning and search. In this paper, we propose a new satisfiability-aided language modeling (SATLM) approach for improving the reasoning capabilities of LLMs. We use an LLM to generate a declarative task specification rather than an imperative program and leverage an off-the-shelf automated theorem prover to derive the final answer. This approach has two key advantages. The declarative specification is closer to the problem description than the reasoning steps are, so the LLM can parse it out of the description more accurately. Furthermore, by offloading the actual reasoning task to an automated theorem prover, our approach can guarantee the correctness of the answer with respect to the parsed specification and avoid planning errors in the solving process. We evaluate SATLM on 8 different datasets and show that it consistently outperforms program-aided LMs in the imperative paradigm. In particular, SATLM outperforms program-aided LMs by 23% on a challenging subset of the GSM arithmetic reasoning dataset; SATLM also achieves a new SoTA on LSAT and BOARDGAMEQA, surpassing previous models that are trained on the respective training sets.(1)
引用
收藏
页数:33
相关论文
共 50 条
  • [41] Multi-Modal Attribute Prompting for Vision-Language Models
    Liu, Xin
    Wu, Jiamin
    Yang, Wenfei
    Zhou, Xu
    Zhang, Tianzhu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (11) : 11579 - 11591
  • [42] PAM: Prompting Audio-Language Models for Audio Quality Assessment
    Deshmukh, Soham
    Alharthi, Dareen
    Elizalde, Benjamin
    Gamper, Hannes
    Al Ismail, Mahmoud
    Singh, Rita
    Raj, Bhiksha
    Wang, Huaming
    INTERSPEECH 2024, 2024, : 3320 - 3324
  • [43] On Hardware Security Bug Code Fixes by Prompting Large Language Models
    Ahmad, Baleegh
    Thakur, Shailja
    Tan, Benjamin
    Karri, Ramesh
    Pearce, Hammond
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 4043 - 4057
  • [44] Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
    Wei, Jason
    Wang, Xuezhi
    Schuurmans, Dale
    Bosma, Maarten
    Ichter, Brian
    Xia, Fei
    Chi, Ed H.
    Le, Quoc V.
    Zhou, Denny
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [45] The Application of Probing Prompting Learning Models in Mastering Foreign Language Vocabulary
    Usman, Misnawaty
    Adys, Himala Praptami
    Rosmaladewi, Rosmaladewi
    Asnur, Muhammad Nur Ashar
    IJOLE-INTERNATIONAL JOURNAL OF LANGUAGE EDUCATION, 2023, 7 (04): : 702 - 710
  • [46] Correction: Standardized nomenclature for litigational legal prompting in generative language models
    Aditya Sivakumar
    Ben Gelman
    Robert Simmons
    Discover Artificial Intelligence, 4 (1):
  • [47] A Communication Theory Perspective on Prompting Engineering Methods for Large Language Models
    Song, Yuan-Feng
    He, Yuan-Qin
    Zhao, Xue-Fang
    Gu, Han-Lin
    Jiang, Di
    Yang, Hai-Jun
    Fan, Li-Xin
    Journal of Computer Science and Technology, 2024, 39 (04) : 984 - 1004
  • [48] PromptSMILES: prompting for scaffold decoration and fragment linking in chemical language models
    Thomas, Morgan
    Ahmad, Mazen
    Tresadern, Gary
    de Fabritiis, Gianni
    JOURNAL OF CHEMINFORMATICS, 2024, 16 (01):
  • [49] Predicting Design conflicts using Declarative Language Approach
    Cheng, Yuan
    He, Fazhi
    Lv, Xiao
    PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 268 - 273
  • [50] Query decomposition using the XML declarative description language
    Thuy, LTT
    Duong, DD
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2005, PT 2, 2005, 3481 : 1066 - 1075