Bilateral Memory Consolidation for Continual Learning

被引:2
|
作者
Nie, Xing [1 ,2 ]
Xu, Shixiong [1 ,2 ]
Liu, Xiyan [3 ]
Meng, Gaofeng [1 ,2 ,4 ]
Huo, Chunlei [1 ,2 ]
Xiang, Shiming [1 ,2 ]
机构
[1] Chinese Acad Sci, State Key Lab Multimodal Artificial Intelligence, Inst Automat, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
[3] Baidu Inc, Beijing, Peoples R China
[4] Chinese Acad Sci, HK Inst Sci & Innovat, Ctr Artificial Intelligence & Robot, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.01538
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Humans are proficient at continuously acquiring and integrating new knowledge. By contrast, deep models forget catastrophically, especially when tackling highly long task sequences. Inspired by the way our brains constantly rewrite and consolidate past recollections, we propose a novel Bilateral Memory Consolidation (BiMeCo) framework that focuses on enhancing memory interaction capabilities. Specifically, BiMeCo explicitly decouples model parameters into short-term memory module and long-term memory module, responsible for representation ability of the model and generalization over all learned tasks, respectively. BiMeCo encourages dynamic interactions between two memory modules by knowledge distillation and momentum-based updating for forming generic knowledge to prevent forgetting. The proposed BiMeCo is parameter-efficient and can be integrated into existing methods seamlessly. Extensive experiments on challenging benchmarks show that BiMeCo significantly improves the performance of existing continual learning methods. For example, combined with the state-of-the-art method CwD [55], BiMeCo brings in significant gains of around 2% to 6% while using 2x fewer parameters on CIFAR-100 under ResNet-18.
引用
收藏
页码:16026 / 16035
页数:10
相关论文
共 50 条
  • [31] Continual learning via region-aware memory
    Kai Zhao
    Zhenyong Fu
    Jian Yang
    Applied Intelligence, 2023, 53 : 8389 - 8401
  • [32] Closed-Loop Memory GAN for Continual Learning
    Rios, Amanda
    Itti, Laurent
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3332 - 3338
  • [33] Saliency-Augmented Memory Completion for Continual Learning
    Bai, Guangji
    Ling, Chen
    Gao, Yuyang
    Zhao, Liang
    PROCEEDINGS OF THE 2023 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2023, : 244 - 252
  • [34] Continual Learning of Fault Prediction for Turbofan Engines using Deep Learning with Elastic Weight Consolidation
    Maschler, Benjamin
    Vietz, Hannes
    Jazdi, Nasser
    Weyrich, Michael
    2020 25TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2020, : 959 - 966
  • [35] Projected Latent Distillation for Data-Agnostic Consolidation in distributed continual learning
    Carta, Antonio
    Cossu, Andrea
    Lomonaco, Vincenzo
    Bacciu, Davide
    van de Weijer, Joost
    NEUROCOMPUTING, 2024, 598
  • [36] Schematic memory persistence and transience for efficient and robust continual learning
    Gao, Yuyang
    Ascoli, Giorgio A.
    Zhao, Liang
    NEURAL NETWORKS, 2021, 144 : 49 - 60
  • [37] Continual learning for seizure prediction via memory projection strategy
    Shi, Yufei
    Tang, Shishi
    Li, Yuxuan
    He, Zhipeng
    Tang, Shengsheng
    Wang, Ruixuan
    Zheng, Weishi
    Chen, Ziyi
    Zhou, Yi
    Computers in Biology and Medicine, 2024, 181
  • [38] Dynamic Memory-Based Continual Learning with Generating and Screening
    Tao, Siying
    Huang, Jinyang
    Zhang, Xiang
    Sun, Xiao
    Gu, Yu
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT III, 2023, 14256 : 365 - 376
  • [39] Consolidation of learning and memory in Parkinson's disease
    Sharp, M. E.
    Duncan, K.
    Foerde, K.
    Kahane, R.
    Shohamy, D.
    MOVEMENT DISORDERS, 2016, 31 : S479 - S480
  • [40] Distributionally Robust Memory Evolution With Generalized Divergence for Continual Learning
    Wang, Zhenyi
    Shen, Li
    Duan, Tiehang
    Suo, Qiuling
    Fang, Le
    Liu, Wei
    Gao, Mingchen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 14337 - 14352