Dynamic collaborative learning with heterogeneous knowledge transfer for long-tailed visual recognition

被引:1
|
作者
Zhou, Hao [1 ]
Luo, Tingjin [2 ]
He, Yongming [3 ]
机构
[1] Naval Univ Engn, Dept Operat Res & Planning, Wuhan, Hubei, Peoples R China
[2] Natl Univ Def Technol, Coll Sci, Changsha, Hunan, Peoples R China
[3] Natl Univ Def Technol, Coll Syst Engn, Changsha, Hunan, Peoples R China
关键词
Long-tailed recognition; Heterogeneous knowledge transfer; Dynamic adaptive sinusoidal weight; Multi-experts collaboration; Inference uncertainty and complexity;
D O I
10.1016/j.inffus.2024.102734
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Solving the long-tailed visual recognition with deep convolutional neural networks is still a challenging task. As a mainstream method, multi-experts models achieve SOTA accuracy for tackling this problem, the uncertainty in network learning and the complexity infusion inference constrain the performance practicality of the multi-experts models. To remedy this, we propose a novel dynamic collaborative learning with heterogeneous knowledge transfer model (DCHKT) in this paper, in which experts with different expertise collaborate to make predictions. DCHKT consists of two core components: dynamic adaptive weight adjustment and heterogeneous knowledge transfer learning. First, the dynamic adaptive weight adjustment is designed to shift the focus of model training between the global expert and domain experts via dynamic adaptive weight. By modulating the trade-off between the learning of features and classifier, the dynamic adaptive weight adjustment can enhance the discriminative ability of each expert and alleviate the uncertainty model learning. Then, heterogeneous knowledge transfer learning, which measures the distribution differences between the fusion logits of multiple experts and the predicted logits of each expert with different specialties, can achieve message passing between experts and enhance the consistency of ensemble prediction in model training and inference to promote their collaborations. Finally, extensive experimental results on public longtailed datasets: CIFAR-LT, ImageNet-LT, Place-LT and iNaturalist2018, demonstrate the effectiveness superiority of our DCHKT.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Nested Collaborative Learning for Long-Tailed Visual Recognition
    Li, Jun
    Tan, Zichang
    Wan, Jun
    Lei, Zhen
    Guo, Guodong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6939 - 6948
  • [2] bt-vMF Contrastive and Collaborative Learning for Long-Tailed Visual Recognition
    Du, Jinhao
    Luo, Guibo
    Zhu, Yuesheng
    Bai, Zhiqiang
    2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 573 - 577
  • [3] NCL plus plus : Nested Collaborative Learning for long-tailed visual recognition
    Tan, Zichang
    Li, Jun
    Du, Jinhao
    Wan, Jun
    Lei, Zhen
    Guo, Guodong
    PATTERN RECOGNITION, 2024, 147
  • [4] Towards Effective Collaborative Learning in Long-Tailed Recognition
    Xu, Zhengzhuo
    Chai, Zenghao
    Xu, Chengyin
    Yuan, Chun
    Yang, Haiqin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3754 - 3764
  • [5] Long-Tailed Visual Recognition via Self-Heterogeneous Integration with Knowledge Excavation
    Jin, Yan
    Li, Mengke
    Lu, Yang
    Cheung, Yiu-ming
    Wang, Hanzi
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23695 - 23704
  • [6] Probabilistic Contrastive Learning for Long-Tailed Visual Recognition
    Du, Chaoqun
    Wang, Yulin
    Song, Shiji
    Huang, Gao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 5890 - 5904
  • [7] Balanced Contrastive Learning for Long-Tailed Visual Recognition
    Zhu, Jianggang
    Wang, Zheng
    Chen, Jingjing
    Chen, Yi-Ping Phoebe
    Jiang, Yu-Gang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6898 - 6907
  • [8] Exploring the auxiliary learning for long-tailed visual recognition
    Zhang, Junjie
    Liu, Lingqiao
    Wang, Peng
    Zhang, Jian
    NEUROCOMPUTING, 2021, 449 : 303 - 314
  • [9] Dynamic Learnable Logit Adjustment for Long-Tailed Visual Recognition
    Zhang, Enhao
    Geng, Chuanxing
    Li, Chaohua
    Chen, Songcan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 7986 - 7997
  • [10] Dynamic prior probability network for long-tailed visual recognition
    Zhou, Xuesong
    Sun, Jiaqi
    Zhai, Junhai
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 268