SATLM: Satisfiability-Aided Language Models Using Declarative Prompting

被引:0
|
作者
Ye, Xi [1 ]
Chen, Qiaochu [1 ]
Dillig, Isil [1 ]
Durrett, Greg [1 ]
机构
[1] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prior work has combined chain-of-thought prompting in large language models (LLMs) with programmatic representations to perform effective and transparent reasoning. While such an approach works well for tasks that only require forward reasoning (e.g., straightforward arithmetic), it is less effective for constraint solving problems that require more sophisticated planning and search. In this paper, we propose a new satisfiability-aided language modeling (SATLM) approach for improving the reasoning capabilities of LLMs. We use an LLM to generate a declarative task specification rather than an imperative program and leverage an off-the-shelf automated theorem prover to derive the final answer. This approach has two key advantages. The declarative specification is closer to the problem description than the reasoning steps are, so the LLM can parse it out of the description more accurately. Furthermore, by offloading the actual reasoning task to an automated theorem prover, our approach can guarantee the correctness of the answer with respect to the parsed specification and avoid planning errors in the solving process. We evaluate SATLM on 8 different datasets and show that it consistently outperforms program-aided LMs in the imperative paradigm. In particular, SATLM outperforms program-aided LMs by 23% on a challenging subset of the GSM arithmetic reasoning dataset; SATLM also achieves a new SoTA on LSAT and BOARDGAMEQA, surpassing previous models that are trained on the respective training sets.(1)
引用
收藏
页数:33
相关论文
共 50 条
  • [21] Toward a Declarative Language to Generate Explorable Sets of Models
    Le Calvar, Theo
    Chhel, Fabien
    Jouault, Frederic
    Saubion, Frederic
    SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 1837 - 1844
  • [22] Analyzing Declarative Deployment Code with Large Language Models
    Lanciano, Giacomo
    Stein, Manuel
    Hilt, Volker
    Cucinotta, Tommaso
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE, CLOSER 2023, 2023, : 289 - 296
  • [23] Grammar Prompting for Domain-Specific Language Generation with Large Language Models
    Wang, Bailin
    Wang, Zi
    Wang, Xuezhi
    Cao, Yuan
    Saurous, Rif A.
    Kim, Yoon
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [24] SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks
    Chang, Kai-Wei
    Wu, Haibin
    Wang, Yu-Kai
    Wu, Yuan-Kuei
    Shen, Hua
    Tseng, Wei-Cheng
    Kang, Iu-Thing
    Li, Shang-Wen
    Lee, Hung-Yi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 3730 - 3744
  • [25] Improve Performance of Fine-tuning Language Models with Prompting
    Yang, Zijian Gyozo
    Ligeti-Nagy, Noenn
    INFOCOMMUNICATIONS JOURNAL, 2023, 15 : 62 - 68
  • [26] How to Optimize Prompting for Large Language Models in Clinical Research
    Lee, Jeong Hyun
    Shin, Jaeseung
    KOREAN JOURNAL OF RADIOLOGY, 2024, 25 (10) : 869 - 873
  • [27] Standardized nomenclature for litigational legal prompting in generative language models
    Sivakumar A.
    Gelman B.
    Simmons R.
    Discover Artificial Intelligence, 2024, 4 (01):
  • [28] Prompting large language models for inner gains in radiology studies
    India, Partha Pratim Ray
    CLINICAL IMAGING, 2025, 120
  • [29] Active Prompting with Chain-of-Thought for Large Language Models
    Diao, Shizhe
    Wang, Pengcheng
    Lin, Yong
    Pan, Rui
    Liu, Xiang
    Zhang, Tong
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 1330 - 1350
  • [30] PIQARD System for Experimenting and Testing Language Models with Prompting Strategies
    Korcz, Marcin
    Plaskowski, Dawid
    Politycki, Mateusz
    Stefanowski, Jerzy
    Terentowicz, Alex
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VII, 2023, 14175 : 320 - 323