Constrained Few-shot Class-incremental Learning

被引:61
|
作者
Hersche, Michael [1 ,2 ]
Karunaratne, Geethan [1 ,2 ]
Cherubini, Giovanni [1 ]
Benini, Luca [2 ]
Sebastian, Abu [1 ]
Rahimi, Abbas [1 ]
机构
[1] IBM Res Zurich, Zurich, Switzerland
[2] Swiss Fed Inst Technol, Zurich, Switzerland
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2022年
关键词
D O I
10.1109/CVPR52688.2022.00885
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Continually learning new classes from fresh data without forgetting previous knowledge of old classes is a very challenging research problem. Moreover, it is imperative that such learning must respect certain memory and computational constraints such as (i) training samples are limited to only a few per class, (ii) the computational cost of learning a novel class remains constant, and (iii) the memory footprint of the model grows at most linearly with the number of classes observed. To meet the above constraints, we propose C-FSCIL, which is architecturally composed of a frozen meta-learned feature extractor, a trainable fixed-size fully connected layer, and a rewritable dynamically growing memory that stores as many vectors as the number of encountered classes. C-FSCIL provides three update modes that offer a trade-off between accuracy and compute-memory cost of learning novel classes. C-FSCIL exploits hyperdimensional embedding that allows to continually express many more classes than the fixed dimensions in the vector space, with minimal interference. The quality of class vector representations is further improved by aligning them quasi-orthogonally to each other by means of novel loss functions. Experiments on the CIFAR100, mini-ImageNet, and Omniglot datasets show that C-FSCIL outperforms the baselines with remarkable accuracy and compression. It also scales up to the largest problem size ever tried in this few-shot setting by learning 423 novel classes on top of 1200 base classes with less than 1.6% accuracy drop. Our code is available at https://github.com/IBM/constrained-FSCIL.
引用
收藏
页码:9047 / 9057
页数:11
相关论文
共 50 条
  • [41] Geometer: Graph Few-Shot Class-Incremental Learning via Prototype Representation
    Lu, Bin
    Gan, Xiaoying
    Yang, Lina
    Zhang, Weinan
    Fu, Luoyi
    Wang, Xinbing
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 1152 - 1161
  • [42] Language-Inspired Relation Transfer for Few-Shot Class-Incremental Learning
    Zhao, Yifan
    Li, Jia
    Song, Zeyin
    Tian, Yonghong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (02) : 1089 - 1102
  • [43] Synthesized Feature based Few-Shot Class-Incremental Learning on a Mixture of Subspaces
    Cheraghian, Ali
    Rahman, Shafin
    Ramasinghe, Sameera
    Fang, Pengfei
    Simon, Christian
    Petersson, Lars
    Harandi, Mehrtash
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8641 - 8650
  • [44] Self-Promoted Prototype Refinement for Few-Shot Class-Incremental Learning
    Zhu, Kai
    Cao, Yang
    Zhai, Wei
    Cheng, Jie
    Zha, Zheng-Jun
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6797 - 6806
  • [45] Semantic-visual Guided Transformer for Few-shot Class-incremental Learning
    Qiu, Wenhao
    Fu, Sichao
    Zhang, Jingyi
    Lei, Chengxiang
    Peng, Qinmu
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2885 - 2890
  • [46] Semantic-aware Knowledge Distillation for Few-Shot Class-Incremental Learning
    Cheraghian, Ali
    Rahman, Shafin
    Fang, Pengfei
    Roy, Soumava Kumar
    Petersson, Lars
    Harandi, Mehrtash
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2534 - 2543
  • [47] Few-Shot Class-Incremental Learning by Sampling Multi-Phase Tasks
    Zhou, Da-Wei
    Ye, Han-Jia
    Ma, Liang
    Xie, Di
    Pu, Shiliang
    Zhan, De-Chuan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 12816 - 12831
  • [48] DyCR: A Dynamic Clustering and Recovering Network for Few-Shot Class-Incremental Learning
    Pan, Zicheng
    Yu, Xiaohan
    Zhang, Miaohua
    Zhang, Weichuan
    Gao, Yongsheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 14
  • [49] Few-Shot Class-Incremental Learning from an Open-Set Perspective
    Peng, Can
    Zhao, Kun
    Wang, Tianren
    Li, Meng
    Lovell, Brian C.
    COMPUTER VISION, ECCV 2022, PT XXV, 2022, 13685 : 382 - 397
  • [50] Sharpness-aware gradient guidance for few-shot class-incremental learning
    Chen, Runhang
    Jing, Xiao-Yuan
    Wu, Fei
    Chen, Haowen
    KNOWLEDGE-BASED SYSTEMS, 2024, 299