Robust Fine-Grained Visual Recognition With Neighbor-Attention Label Correction

被引:0
|
作者
Mao, Shunan [1 ]
Zhang, Shiliang [1 ]
机构
[1] Peking Univ, Sch Comp Sci, Natl Key Lab Multimedia Informat Proc, Beijing 100871, Peoples R China
关键词
Training; Noise measurement; Visualization; Optimization; Task analysis; Feature extraction; Annotations; Noisy label; meta learning; image retrieval; person re-id; semantic segmentation; PERSON REIDENTIFICATION; MEMORY;
D O I
10.1109/TIP.2024.3378461
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing deep learning methods for fine-grained visual recognition often rely on large-scale, well-annotated training data. Obtaining fine-grained annotations in the wild typically requires concentration and expertise, such as fine category annotation for species recognition, instance annotation for person re-identification (re-id) and dense annotation for segmentation, which inevitably leads to label noise. This paper aims to tackle label noise in deep model training for fine-grained visual recognition. We propose a Neighbor-Attention Label Correction (NALC) model to correct labels during the training stage. NALC samples a training batch and a validation batch from the training set. It hence leverages a meta-learning framework to correct labels in the training batch based on the validation batch. To enhance the optimization efficiency, we introduce a novel nested optimization algorithm for the meta-learning framework. The proposed training procedure consistently improves label accuracy in the training batch, consequently enhancing the learned image representation. Experimental results demonstrate that our method significantly increases label accuracy from 70% to over 98% and outperforms recent approaches by up to 13.4% in mean Average Precision (mAP) on various fine-grained image retrieval (FGIR) tasks, including instance retrieval on CUB200 and person re-id on Market1501. We also demonstrate the efficacy of NALC on noisy semantic segmentation datasets generated from Cityscapes, where it achieves a significant 7.8% improvement in mIOU score. NALC also exhibits robustness to different types of noise, including simulated noise such as Asymmetric, Pair-Flip, and Pattern noise, as well as practical noisy labels generated by tracklets and clustering.
引用
收藏
页码:2614 / 2626
页数:13
相关论文
共 50 条
  • [1] ProtoSimi: label correction for fine-grained visual categorization
    Shen, Jialiang
    Yao, Yu
    Huang, Shaoli
    Wang, Zhiyong
    Zhang, Jing
    Wang, Ruxing
    Yu, Jun
    Liu, Tongliang
    [J]. MACHINE LEARNING, 2024, 113 (04) : 1903 - 1920
  • [2] ProtoSimi: label correction for fine-grained visual categorization
    Jialiang Shen
    Yu Yao
    Shaoli Huang
    Zhiyong Wang
    Jing Zhang
    Ruxing Wang
    Jun Yu
    Tongliang Liu
    [J]. Machine Learning, 2024, 113 : 1903 - 1920
  • [3] Meta label associated loss for fine-grained visual recognition
    Li, Yanchao
    Xiao, Fu
    Li, Hao
    Li, Qun
    Yu, Shui
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (06)
  • [4] Meta label associated loss for fine-grained visual recognition
    Yanchao LI
    Fu XIAO
    Hao LI
    Qun LI
    Shui YU
    [J]. Science China(Information Sciences), 2024, 67 (06) : 230 - 247
  • [5] A Fine-Grained Visual Attention Approach for Fingerspelling Recognition in the Wild
    Gajurel, Kamala
    Zhong, Cuncong
    Wang, Guanghui
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 3266 - 3271
  • [6] A Streamlined Attention Mechanism for Image Classification and Fine-Grained Visual Recognition
    Dakshayani Himabindu, D.
    Praveen Kumar, S.
    [J]. Mendel, 2021, 27 (02) : 59 - 67
  • [7] Robust fine-grained visual recognition with images based on internet of things
    Cai, Zhenhuang
    Yan, Shuai
    Huang, Dan
    [J]. COMPUTATIONAL INTELLIGENCE, 2024, 40 (02)
  • [8] A weakly supervised spatial group attention network for fine-grained visual recognition
    Xie, Jiangjian
    Zhong, Yujie
    Zhang, Junguo
    Zhang, Changchun
    Schuller, Bjoern W.
    [J]. APPLIED INTELLIGENCE, 2023, 53 (20) : 23301 - 23315
  • [9] Attention-Guided Spatial Transformer Networks for Fine-Grained Visual Recognition
    Liu, Dichao
    Wang, Yu
    Kato, Jien
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (12) : 2577 - 2586
  • [10] A weakly supervised spatial group attention network for fine-grained visual recognition
    Jiangjian Xie
    Yujie Zhong
    Junguo Zhang
    Changchun Zhang
    Björn W Schuller
    [J]. Applied Intelligence, 2023, 53 : 23301 - 23315