On Conditional and Compositional Language Model Differentiable Prompting

被引:0
|
作者
Pilault, Jonathan [1 ]
Liu, Can [2 ]
Bansal, Mohit [3 ]
Dreyer, Markus [2 ]
机构
[1] Polytech Montreal, Mila Quebec AI Inst, Montreal, PQ, Canada
[2] Amazon Alexa, Seattle, WA USA
[3] Univ North Carolina Chapel Hill, Chapel Hill, NC USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prompts have been shown to be an effective method to adapt a frozen Pretrained Language Model (PLM) to perform well on downstream tasks. Prompts can be represented by a human-engineered word sequence or by a learned continuous embedding. In this work, we investigate conditional and compositional differentiable prompting. We propose a new model, Prompt Production System (PROPS), which learns to transform task instructions or input metadata, into continuous prompts that elicit task-specific outputs from the PLM. Our model uses a modular network structure based on our neural formulation of Production Systems, which allows the model to learn discrete rules - neural functions that learn to specialize in transforming particular prompt input patterns, making it suitable for compositional transfer learning and few-shot learning. We present extensive empirical and theoretical analysis and show that PROPS consistently surpasses other PLM adaptation techniques, and often improves upon fully fine-tuned models, on compositional generalization tasks, controllable summarization and multilingual translation, while needing fewer trainable parameters.
引用
收藏
页码:4136 / 4144
页数:9
相关论文
共 50 条
  • [1] Reasoning with Language Model Prompting: A Survey
    Qiao, Shuofei
    Ou, Yixin
    Zhang, Ningyu
    Chen, Xiang
    Yao, Yunzhi
    Deng, Shumin
    Tan, Chuanqi
    Huang, Fei
    Chen, Huajun
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 5368 - 5393
  • [2] Compositional Prompting Video-language Models to Understand Procedure in Instructional Videos
    Hu, Guyue
    He, Bin
    Zhang, Hanwang
    MACHINE INTELLIGENCE RESEARCH, 2023, 20 (02) : 249 - 262
  • [3] A Prompting Framework to Enhance Language Model Output
    Ratnayake, Himath
    Wang, Can
    ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2023, PT II, 2024, 14472 : 66 - 81
  • [4] ALLIES: Prompting Large Language Model with Beam Search
    Sun, Hao
    Liu, Xiao
    Gong, Yeyun
    Zhang, Yan
    Jiang, Daxin
    Yang, Linjun
    Duan, Nan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 3794 - 3805
  • [5] A Comparison of Prompting by Exclusion and Delayed Prompting during Conditional Discrimination Training
    Cariveau, Tom
    Ellington, Paige
    Brown, Alexandria
    Platt, Delanie F.
    EDUCATION AND TREATMENT OF CHILDREN, 2023, 46 (02) : 107 - 119
  • [6] Topic Compositional Neural Language Model
    Wang, Wenlin
    Gan, Zhe
    Wang, Wenqi
    Shen, Dinghan
    Huang, Jiaji
    Ping, Wei
    Satheesh, Sanjeev
    Carin, Lawrence
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [7] A Comparison of Prompting by Exclusion and Delayed Prompting during Conditional Discrimination Training
    Tom Cariveau
    Paige Ellington
    Alexandria Brown
    Delanie F. Platt
    Education and Treatment of Children, 2023, 46 : 107 - 119
  • [8] Mind the Biases: Quantifying Cognitive Biases in Language Model Prompting
    Lin, Ruixi
    Ng, Hwee Tou
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 5269 - 5281
  • [9] Hierarchical Prompting Assists Large Language Model on Web Navigation
    Sridhar, Abishek
    Lo, Chi-Fan
    Xu, Frank F.
    Zhu, Hao
    Zhou, Shuyan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 10217 - 10244
  • [10] A Differentiable Language Model Adversarial Attack on Text Classifiers
    Fursov, Ivan
    Zaytsev, Alexey
    Burnyshev, Pavel
    Dmitrieva, Ekaterina
    Klyuchnikov, Nikita
    Kravchenko, Andrey
    Artemova, Ekaterina
    Komleva, Evgenia
    Burnaev, Evgeny
    IEEE ACCESS, 2022, 10 : 17966 - 17976