A knowledge-guide hierarchical learning method for long-tailed image classification

被引:16
|
作者
Chen, Qiong [1 ]
Liu, Qingfa [1 ]
Lin, Enlu [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
关键词
Imbalanced data; Long-tailed distribution; Image classification;
D O I
10.1016/j.neucom.2021.07.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep visual recognition methods have achieved excellent performance on artificially constructed image datasets where the data distribution is balanced. However, in real-world scenarios, data distribution is usually extremely imbalanced and exhibit a long-tailed distribution where data in each head class is more than the class in the tail. Many efficient deep learning methods fail to work normally, i.e., they perform well in the head class while poor in the tail class. In this paper, we propose a two-layer HierarchicalLearning Long-Tailed Recognition (HL-LTR) algorithm which transforms the long-tailed problem into a hierarchical classification problem by constructing a hierarchical superclass tree in which each layer corresponds to a recognition task. In the first layer of the tree, the degree of data imbalance is largely decreased. The recognition task of the second layer is the original long-tailed recognition problem. The training of HL-LTR is top-down. The knowledge learned by the first layer transfers to classes of the second layer and guides the feature learning of the second layer by using attention mechanism module and knowledge distillation method. Compared with directly solving the most difficult long-tailed recognition task, HL-LTR achieves better performance due to its progressive learning method from easy to difficult and effective knowledge transfer strategy. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:408 / 418
页数:11
相关论文
共 50 条
  • [41] Probability Guided Loss for Long-Tailed Multi-Label Image Classification
    Lin, Dekun
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1577 - 1585
  • [42] Effect of Stage Training for Long-Tailed Multi-Label Image Classification
    Yamagishi, Yosuke
    Hanaoka, Shohei
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2713 - 2720
  • [43] Inverse Image Frequency for Long-Tailed Image Recognition
    Alexandridis, Konstantinos Panagiotis
    Luo, Shan
    Nguyen, Anh
    Deng, Jiankang
    Zafeiriou, Stefanos
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5721 - 5736
  • [44] A Novel Semi-Supervised Long-Tailed Learning Framework With Spatial Neighborhood Information for Hyperspectral Image Classification
    Feng, Yining
    Song, Ruoxi
    Ni, Weihan
    Zhu, Junheng
    Wang, Xianghai
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [45] Hierarchical Equalization Loss for Long-Tailed Instance Segmentation
    Zhao, Yaochi
    Chen, Sen
    Liu, Shiguang
    Hu, Zhuhua
    Xia, Jingwen
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6943 - 6955
  • [46] Learning Only When It Matters: Cost-Aware Long-Tailed Classification
    He, Yu-Cheng
    Ding, Yao-Xiang
    Ye, Han-Jia
    Zhou, Zhi-Hua
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 12411 - 12420
  • [47] Hierarchical long-tailed classification based on multi-granularity knowledge transfer driven by multi-scale feature fusion
    Zhao, Wei
    Zhao, Hong
    PATTERN RECOGNITION, 2024, 145
  • [48] Exploring Contrastive Learning for Long-Tailed Multi-label Text Classification
    Audibert, Alexandre
    Gauffre, Aurelien
    Amini, Massih-Reza
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT VII, ECML PKDD 2024, 2024, 14947 : 245 - 261
  • [49] Dynamic collaborative learning with heterogeneous knowledge transfer for long-tailed visual recognition
    Zhou, Hao
    Luo, Tingjin
    He, Yongming
    INFORMATION FUSION, 2025, 115
  • [50] Learning Multi-Expert Distribution Calibration for Long-Tailed Video Classification
    Hu, Yufan
    Gao, Junyu
    Xu, Changsheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 555 - 567