On Conditional and Compositional Language Model Differentiable Prompting

被引：0

作者：

Pilault, Jonathan ^{[1
]}

Liu, Can ^{[2
]}

Bansal, Mohit ^{[3
]}

Dreyer, Markus ^{[2
]}

机构：

[1] Polytech Montreal, Mila Quebec AI Inst, Montreal, PQ, Canada

[2] Amazon Alexa, Seattle, WA USA

[3] Univ North Carolina Chapel Hill, Chapel Hill, NC USA

来源：

PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023 | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Prompts have been shown to be an effective method to adapt a frozen Pretrained Language Model (PLM) to perform well on downstream tasks. Prompts can be represented by a human-engineered word sequence or by a learned continuous embedding. In this work, we investigate conditional and compositional differentiable prompting. We propose a new model, Prompt Production System (PROPS), which learns to transform task instructions or input metadata, into continuous prompts that elicit task-specific outputs from the PLM. Our model uses a modular network structure based on our neural formulation of Production Systems, which allows the model to learn discrete rules - neural functions that learn to specialize in transforming particular prompt input patterns, making it suitable for compositional transfer learning and few-shot learning. We present extensive empirical and theoretical analysis and show that PROPS consistently surpasses other PLM adaptation techniques, and often improves upon fully fine-tuned models, on compositional generalization tasks, controllable summarization and multilingual translation, while needing fewer trainable parameters.

引用

页码：4136 / 4144

页数：9

共 50 条

[1] Reasoning with Language Model Prompting: A Survey
Qiao, Shuofei
Ou, Yixin
Zhang, Ningyu
Chen, Xiang
Yao, Yunzhi
Deng, Shumin
Tan, Chuanqi
Huang, Fei
Chen, Huajun
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 5368 - 5393
[2] Compositional Prompting Video-language Models to Understand Procedure in Instructional Videos
Hu, Guyue
He, Bin
Zhang, Hanwang
MACHINE INTELLIGENCE RESEARCH, 2023, 20 (02) : 249 - 262
[3] A Prompting Framework to Enhance Language Model Output
Ratnayake, Himath
Wang, Can
ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2023, PT II, 2024, 14472 : 66 - 81
[4] ALLIES: Prompting Large Language Model with Beam Search
Sun, Hao
Liu, Xiao
Gong, Yeyun
Zhang, Yan
Jiang, Daxin
Yang, Linjun
Duan, Nan
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 3794 - 3805
[5] A Comparison of Prompting by Exclusion and Delayed Prompting during Conditional Discrimination Training
Cariveau, Tom
Ellington, Paige
Brown, Alexandria
Platt, Delanie F.
EDUCATION AND TREATMENT OF CHILDREN, 2023, 46 (02) : 107 - 119
[6] Topic Compositional Neural Language Model
Wang, Wenlin
Gan, Zhe
Wang, Wenqi
Shen, Dinghan
Huang, Jiaji
Ping, Wei
Satheesh, Sanjeev
Carin, Lawrence
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
[7] A Comparison of Prompting by Exclusion and Delayed Prompting during Conditional Discrimination Training
Tom Cariveau
Paige Ellington
Alexandria Brown
Delanie F. Platt
Education and Treatment of Children, 2023, 46 : 107 - 119
[8] Mind the Biases: Quantifying Cognitive Biases in Language Model Prompting
Lin, Ruixi
Ng, Hwee Tou
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 5269 - 5281
[9] Hierarchical Prompting Assists Large Language Model on Web Navigation
Sridhar, Abishek
Lo, Chi-Fan
Xu, Frank F.
Zhu, Hao
Zhou, Shuyan
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 10217 - 10244
[10] A Differentiable Language Model Adversarial Attack on Text Classifiers
Fursov, Ivan
Zaytsev, Alexey
Burnyshev, Pavel
Dmitrieva, Ekaterina
Klyuchnikov, Nikita
Kravchenko, Andrey
Artemova, Ekaterina
Komleva, Evgenia
Burnaev, Evgeny
IEEE ACCESS, 2022, 10 : 17966 - 17976

← 1 2 3 4 5 →