Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models

被引:0
|
作者
Logan, Robert L. [1 ]
Balazevic, Ivana [2 ,4 ]
Wallace, Eric [3 ]
Petroni, Fabio [4 ]
Singh, Sameer [1 ]
Riedel, Sebastian [4 ,5 ]
机构
[1] UC Irvine, Irvine, CA 92697 USA
[2] DeepMind, London, England
[3] Univ Calif Berkeley, Berkeley, CA USA
[4] Facebook AI Res, Menlo Pk, CA USA
[5] UCL, London, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prompting language models (LMs) with training examples and task descriptions has been seen as critical to recent successes in few-shot learning. In this work, we show that finetuning LMs in the few-shot setting can considerably reduce the need for prompt engineering. In fact, one can use null prompts, prompts that contain neither task-specific templates nor training examples, and achieve competitive accuracy to manually-tuned prompts across a wide range of tasks. While finetuning LMs does introduce new parameters for each downstream task, we show that this memory overhead can be substantially reduced-finetuning only the bias terms can achieve comparable or better accuracy than standard finetuning while only updating 0.1% of the parameters. All in all, we recommend finetuning LMs for few-shot learning as it is more accurate, has relatively stable performance across different prompts, and can be made nearly as efficient as using frozen LMs.
引用
收藏
页码:2824 / 2835
页数:12
相关论文
共 50 条
  • [31] Task Contamination: Language Models May Not Be Few-Shot Anymore
    Li, Changmao
    Flanigan, Jeffrey
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 18471 - 18480
  • [32] Reconsidering learnable fine-grained text prompts for few-shot anomaly detection in visual-language models
    Han, Delong
    Xu, Luo
    Zhou, Mingle
    Wan, Jin
    Li, Min
    Li, Gang
    NEURAL NETWORKS, 2025, 182
  • [33] Constrained Language Models Yield Few-Shot Semantic Parsers
    Shin, Richard
    Lin, Christopher H.
    Thomson, Sam
    Chen, Charles
    Roy, Subhro
    Platanios, Emmanouil Antonios
    Pauls, Adam
    Klein, Dan
    Eisner, Jason
    Van Durme, Benjamin
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 7699 - 7715
  • [34] Few-Shot Adaptation of Medical Vision-Language Models
    Shakeri, Fereshteh
    Huang, Yunshi
    Silva-Rodriguez, Julio
    Bahig, Houda
    Tang, An
    Dolz, Jose
    Ben Ayed, Ismail
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT XII, 2024, 15012 : 553 - 563
  • [35] Getting to Production with Few-shot Natural Language Generation Models
    Heidari, Peyman
    Einolghozati, Arash
    Jain, Shashank
    Batra, Soumya
    Callender, Lee
    Arun, Ankit
    Mei, Shawn
    Gupta, Sonal
    Donmez, Pinar
    Bhardwaj, Vikas
    Kumar, Anuj
    White, Michael
    SIGDIAL 2021: 22ND ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2021), 2021, : 66 - 76
  • [36] Few-Shot Semantic Parsing with Language Models Trained on Code
    Shin, Richard
    Van Durme, Benjamin
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5417 - 5425
  • [37] AdapterHub Playground: Simple and Flexible Few-Shot Learning with Adapters
    Beck, Tilman
    Bohlender, Bela
    Viehmann, Christina
    Hane, Vincent
    Adamson, Yanik
    Khuri, Jaber
    Brossmann, Jonas
    Pfeiffer, Jonas
    Gurevych, Iryna
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2022, : 61 - 75
  • [38] FLamE: Few-shot Learning from Natural Language Explanations
    Zhou, Yangqiaoyu
    Zhang, Yiming
    Tan, Chenhao
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 6743 - 6763
  • [39] Defensive Few-Shot Learning
    Li, Wenbin
    Wang, Lei
    Zhang, Xingxing
    Qi, Lei
    Huo, Jing
    Gao, Yang
    Luo, Jiebo
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 5649 - 5667
  • [40] Federated Few-shot Learning
    Wang, Song
    Fu, Xingbo
    Ding, Kaize
    Chen, Chen
    Chen, Huiyuan
    Li, Jundong
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 2374 - 2385