Learning to Prompt for Continual Learning

被引:252
|
作者
Wang, Zifeng [1 ]
Zhang, Zizhao [2 ]
Lee, Hen Yu [2 ]
Zhang, Han [3 ]
Sun, Ruoxi [2 ]
Ren, Xiaoqi [2 ]
Su, Guolong [3 ]
Perot, Vincent [3 ]
Dy, Jennifer [1 ]
Pfister, Tomas [2 ]
机构
[1] Northeastern Univ, Boston, MA 02115 USA
[2] Google Cloud AI, Sunnyvale, CA USA
[3] Google Res, Sunnyvale, CA USA
关键词
SYSTEMS;
D O I
10.1109/CVPR52688.2022.00024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The mainstream paradigm behind continual learning has been to adapt the model parameters to non-stationary data distributions, where catastrophic forgetting is the central challenge. Typical methods rely on a rehearsal buffer or known task identity at test time to retrieve learned knowledge and address forgetting, while this work presents a new paradigm for continual learning that aims to train a more succinct memory system without accessing task identity at test time. Our method learns to dynamically prompt (L2P) a pre-trained model to learn tasks sequentially under different task transitions. In our proposed framework, prompts are small learnable parameters, which are maintained in a memory space. The objective is to optimize prompts to instruct the model prediction and explicitly manage task-invariant and task-specific knowledge while maintaining model plasticity. We conduct comprehensive experiments under popular image classification benchmarks with different challenging continual learning settings, where L2P consistently outperforms prior state-ofthe-art methods. Surprisingly, L2P achieves competitive results against rehearsal-based methods even without a rehearsal buffer and is directly applicable to challenging taskagnostic continual learning. Source code is available at https:// github.com/ google- research/l2p.
引用
收藏
页码:139 / 149
页数:11
相关论文
共 50 条
  • [21] Heterogeneous Continual Learning
    Madaan, Divyam
    Yin, Hongxu
    Byeon, Wonmin
    Kautz, Jan
    Molchanov, Pavlo
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15985 - 15995
  • [22] Residual Continual Learning
    Lee, Janghyeon
    Joo, Donggyu
    Hong, Hyeong Gwon
    Kim, Junmo
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 4553 - 4560
  • [23] Reinforced Continual Learning
    Xu, Ju
    Zhu, Zhanxing
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [24] Flashback for Continual Learning
    Mahmoodi, Leila
    Harandi, Mehrtash
    Moghadam, Peyman
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3426 - 3435
  • [25] Kernel Continual Learning
    Derakhshani, Mohammad Mahdi
    Zhen, Xiantong
    Shao, Ling
    Snoek, Cees G. M.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [26] Open-world continual learning: Unifying novelty detection and continual learning
    Kim, Gyuhak
    Xiao, Changnan
    Konishi, Tatsuya
    Ke, Zixuan
    Liu, Bing
    ARTIFICIAL INTELLIGENCE, 2025, 338
  • [27] Continual compression model for online continual learning
    Ye, Fei
    Bors, Adrian G.
    APPLIED SOFT COMPUTING, 2024, 167
  • [28] Decouple Before Interact: Multi-Modal Prompt Learning for Continual Visual Question Answering
    Qian, Zi
    Wang, Xin
    Duan, Xuguang
    Qin, Pengda
    Li, Yuhong
    Zhu, Wenwu
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2941 - 2950
  • [29] Hierarchical Decomposition of Prompt-Based Continual Learning: Rethinking Obscured Sub-optimality
    Wang, Liyuan
    Xie, Jingyi
    Zhang, Xingxing
    Huang, Mingyi
    Su, Hang
    Zhu, Jun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [30] Dynamic learning rates for continual unsupervised learning
    David Fernandez-Rodriguez, Jose
    Jose Palomo, Esteban
    Miguel Ortiz-De-Lazcano-Lobato, Juan
    Ramos-Jimenez, Gonzalo
    Lopez-Rubio, Ezequiel
    INTEGRATED COMPUTER-AIDED ENGINEERING, 2023, 30 (03) : 257 - 273