Online Continual Learning via the Meta-learning update with Multi-scale Knowledge Distillation and Data Augmentation

被引:3
|
作者
Han, Ya-nan [1 ]
Liu, Jian-wei [1 ]
机构
[1] China Univ Petr, Coll Informat Sci & Engn, Dept Automat, Beijing, Peoples R China
关键词
Continual learning; The stability-plasticity dilemma; Meta-learning; Knowledge distillation; Data augmentation; NEURAL-NETWORKS;
D O I
10.1016/j.engappai.2022.104966
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Continual learning aims to rapidly and continually learn the current task from a sequence of tasks, using the knowledge obtained in the past, while performing well on prior tasks. A key challenge in this setting is the stability-plasticity dilemma existing in current and previous tasks, i.e., a high-stability network is weak to learn new knowledge in an effort to maintain previous knowledge. Correspondingly, a high-plasticity network can easily forget old tasks while dealing with well on the new task. Compared to other kinds of methods, the methods based on experience replay have shown great advantages to overcome catastrophic forgetting. One common limitation of this method is the data imbalance between the previous and current tasks, which would further aggravate forgetting. Moreover, how to effectively address the stability-plasticity dilemma in this setting is also an urgent problem to be solved. In this paper, we overcome these challenges by proposing a novel framework called Meta-learning update via Multi-scale Knowledge Distillation and Data Augmentation (MMKDDA). Specifically, we apply multi-scale knowledge distillation to grasp the evolution of long-range and short-range spatial relationships at different feature levels to alleviate the problem of data imbalance. Besides, our method mixes the samples from the episodic memory and current task in the online continual training procedure, thus alleviating the side influence due to the change of probability distribution. Moreover, we optimize our model via the meta-learning update by resorting to the number of tasks seen previously, which is helpful to keep a better balance between stability and plasticity. Finally, our extensive experiments on four benchmark datasets show the effectiveness of the proposed MMKDDA framework against other popular baselines, and ablation studies are also conducted to further analyze the role of each component in our framework.
引用
下载
收藏
页数:17
相关论文
共 50 条
  • [31] Automated Data Pre-processing via Meta-learning
    Bilalli, Besim
    Abello, Alberto
    Aluja-Banet, Tomas
    Wrembel, Robert
    MODEL AND DATA ENGINEERING, 2016, 9893 : 194 - 208
  • [32] Multi-Scale Contourlet Knowledge Guide Learning Segmentation
    Liu, Mengkun
    Jiao, Licheng
    Liu, Xu
    Li, Lingling
    Liu, Fang
    Yang, Shuyuan
    Wang, Shuang
    Hou, Biao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4831 - 4845
  • [33] Selecting Related Knowledge via Efficient Channel Attention for Online Continual Learning
    Han, Ya-nan
    Liu, Jian-we
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [34] Multi-scale online learning: Theory and applications to online auctions and pricing
    Bubeck, Sébastien
    Devanur, Nikhil R.
    Huang, Zhiyi
    Niazadeh, Rad
    Journal of Machine Learning Research, 2019, 20
  • [35] Meta-Learning without Data via Unconditional Diffusion Models
    Wei Y.
    Hu Z.
    Shen L.
    Wang Z.
    Li L.
    Li Y.
    Yuan C.
    IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (11) : 1 - 1
  • [36] Online continual learning via the knowledge invariant and spread-out properties
    Han, Ya-nan
    Liu, Jian-wei
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [37] Multi-scale Online Learning: Theory and Applications to Online Auctions and Pricing
    Bubeck, Sebastien
    Devanur, Nikhil R.
    Huang, Zhiyi
    Niazadeh, Rad
    JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20 : 1 - 37
  • [38] MEDA: Meta-Learning with Data Augmentation for Few-Shot Text Classification
    Sun, Pengfei
    Ouyang, Yawen
    Zhang, Wenming
    Dai, Xin-yu
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3929 - 3935
  • [39] Multi-scale object retrieval via learning on graph from multimodal data
    Zhang, Yongsheng
    Yamamoto, Tsuyoshi
    Dobashi, Yoshinori
    NEUROCOMPUTING, 2016, 207 : 684 - 692
  • [40] Cross-data Automatic Feature Engineering via Meta-learning and Reinforcement Learning
    Zhang, Jianyu
    Hao, Jianye
    Fogelman-Soulie, Francoise
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT I, 2020, 12084 : 818 - 829