PAMK: Prototype Augmented Multi-Teacher Knowledge Transfer Network for Continual Zero-Shot Learning

被引:0
|
作者
Lu, Junxin [1 ]
Sun, Shiliang [1 ,2 ]
机构
[1] East China Normal Univ, Sch Comp Sci & Technol, Shanghai 200062, Peoples R China
[2] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Continual zero-shot learning; catastrophic forgetting; negative transfer; multi-teacher; prototype augmentation;
D O I
10.1109/TIP.2024.3403053
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Continual zero-shot learning (CZSL) aims to develop a model that accumulates historical knowledge to recognize unseen tasks, while eliminating catastrophic forgetting for seen tasks when learning new tasks. However, existing CZSL methods, while mitigating catastrophic forgetting for old tasks, often lead to negative transfer problem for new tasks by over-focusing on accumulating old knowledge and neglecting the plasticity of the model for learning new tasks. To tackle these problems, we propose PAMK, a prototype augmented multi-teacher knowledge transfer network that strikes a trade-off between recognition stability for old tasks and generalization plasticity for new tasks. PAMK consists of a prototype augmented contrastive generation (PACG) module and a multi-teacher knowledge transfer (MKT) module. To reduce the cumulative semantic decay of the class representation embedding and mitigate catastrophic forgetting, we propose a continual prototype augmentation strategy based on relevance scores in PACG. Furthermore, by introducing the prototype augmented semantic-visual contrastive loss, PACG promotes intra-class compactness for all classes across all tasks. MKT effectively accumulates semantic knowledge learned from old tasks to recognize new tasks via the proposed multi-teacher knowledge transfer, eliminating the negative transfer problem. Extensive experiments on various CZSL settings demonstrate the superior performance of PAMK compared to state-of-the-art methods. In particular, in the practical task-free CZSL setting, PAMK achieves impressive gains of 3.28%, 3.09% and 3.71% in mean harmonic accuracy on the CUB, AWA1, and AWA2 datasets, respectively.
引用
收藏
页码:3353 / 3368
页数:16
相关论文
共 50 条
  • [1] Relational Knowledge Transfer for Zero-Shot Learning
    Wang, Donghui
    Li, Yanan
    Lin, Yuetan
    Zhuang, Yueting
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2145 - 2151
  • [2] Prototype-Augmented Self-Supervised Generative Network for Generalized Zero-Shot Learning
    Wu, Jiamin
    Zhang, Tianzhu
    Zha, Zheng-Jun
    Luo, Jiebo
    Zhang, Yongdong
    Wu, Feng
    [J]. IEEE Transactions on Image Processing, 2024, 33 : 1938 - 1951
  • [3] Prototype-Augmented Self-Supervised Generative Network for Generalized Zero-Shot Learning
    Wu, Jiamin
    Zhang, Tianzhu
    Zha, Zheng-Jun
    Luo, Jiebo
    Zhang, Yongdong
    Wu, Feng
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1938 - 1951
  • [4] HAPZSL: A hybrid attention prototype network for knowledge graph zero-shot relational learning
    Li, Xuewei
    Ma, Jinming
    Yu, Jian
    Xu, Tianyi
    Zhao, Mankun
    Liu, Hongwei
    Yu, Mei
    Yu, Ruiguo
    [J]. NEUROCOMPUTING, 2022, 508 : 324 - 336
  • [5] Prototype rectification for zero-shot learning
    Yi, Yuanyuan
    Zeng, Guolei
    Ren, Bocheng
    Yang, Laurence T.
    Chai, Bin
    Li, Yuxin
    [J]. PATTERN RECOGNITION, 2024, 156
  • [6] Dual Progressive Prototype Network for Generalized Zero-Shot Learning
    Wang, Chaoqun
    Mina, Shaobo
    Chenl, Xuejin
    Sun, Xiaoyan
    Li, Houqiang
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [7] Zero-shot Learning via Recurrent Knowledge Transfer
    Zhao, Bo
    Sun, Xinwei
    Hong, Xiaopeng
    Yao, Yuan
    Wang, Yizhou
    [J]. 2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1308 - 1317
  • [8] A Multi-Teacher Policy Distillation Framework for Enhancing Zero-Shot Generalization of Autonomous Driving Policies
    Yang, Jiachen
    Zhang, Jipeng
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (07) : 9734 - 9746
  • [9] Residual-Prototype Generating Network for Generalized Zero-Shot Learning
    Zhang, Zeqing
    Li, Xiaofan
    Ma, Tai
    Gao, Zuodong
    Li, Cuihua
    Lin, Weiwei
    [J]. MATHEMATICS, 2022, 10 (19)
  • [10] Domain-Aware Prototype Network for Generalized Zero-Shot Learning
    Hu, Yongli
    Feng, Lincong
    Jiang, Huajie
    Liu, Mengting
    Yin, Baocai
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3180 - 3191