Robust Fine-Grained Visual Recognition With Neighbor-Attention Label Correction

被引：0

作者：

Mao, Shunan ^{[1
]}

Zhang, Shiliang ^{[1
]}

机构：

[1] Peking Univ, Sch Comp Sci, Natl Key Lab Multimedia Informat Proc, Beijing 100871, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2024年 / 33卷

关键词：

Training; Noise measurement; Visualization; Optimization; Task analysis; Feature extraction; Annotations; Noisy label; meta learning; image retrieval; person re-id; semantic segmentation; PERSON REIDENTIFICATION; MEMORY;

D O I：

10.1109/TIP.2024.3378461

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Existing deep learning methods for fine-grained visual recognition often rely on large-scale, well-annotated training data. Obtaining fine-grained annotations in the wild typically requires concentration and expertise, such as fine category annotation for species recognition, instance annotation for person re-identification (re-id) and dense annotation for segmentation, which inevitably leads to label noise. This paper aims to tackle label noise in deep model training for fine-grained visual recognition. We propose a Neighbor-Attention Label Correction (NALC) model to correct labels during the training stage. NALC samples a training batch and a validation batch from the training set. It hence leverages a meta-learning framework to correct labels in the training batch based on the validation batch. To enhance the optimization efficiency, we introduce a novel nested optimization algorithm for the meta-learning framework. The proposed training procedure consistently improves label accuracy in the training batch, consequently enhancing the learned image representation. Experimental results demonstrate that our method significantly increases label accuracy from 70% to over 98% and outperforms recent approaches by up to 13.4% in mean Average Precision (mAP) on various fine-grained image retrieval (FGIR) tasks, including instance retrieval on CUB200 and person re-id on Market1501. We also demonstrate the efficacy of NALC on noisy semantic segmentation datasets generated from Cityscapes, where it achieves a significant 7.8% improvement in mIOU score. NALC also exhibits robustness to different types of noise, including simulated noise such as Asymmetric, Pair-Flip, and Pattern noise, as well as practical noisy labels generated by tracklets and clustering.

引用

页码：2614 / 2626

页数：13

共 50 条

[1] ProtoSimi: label correction for fine-grained visual categorization
Shen, Jialiang
Yao, Yu
Huang, Shaoli
Wang, Zhiyong
Zhang, Jing
Wang, Ruxing
Yu, Jun
Liu, Tongliang
[J]. MACHINE LEARNING, 2024, 113 (04) : 1903 - 1920
[2] ProtoSimi: label correction for fine-grained visual categorization
Jialiang Shen
Yu Yao
Shaoli Huang
Zhiyong Wang
Jing Zhang
Ruxing Wang
Jun Yu
Tongliang Liu
[J]. Machine Learning, 2024, 113 : 1903 - 1920
[3] Meta label associated loss for fine-grained visual recognition
Li, Yanchao
Xiao, Fu
Li, Hao
Li, Qun
Yu, Shui
[J]. SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (06)
[4] Meta label associated loss for fine-grained visual recognition
Yanchao LI
Fu XIAO
Hao LI
Qun LI
Shui YU
[J]. Science China(Information Sciences), 2024, 67 (06) : 230 - 247
[5] A Fine-Grained Visual Attention Approach for Fingerspelling Recognition in the Wild
Gajurel, Kamala
Zhong, Cuncong
Wang, Guanghui
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 3266 - 3271
[6] A Streamlined Attention Mechanism for Image Classification and Fine-Grained Visual Recognition
Dakshayani Himabindu, D.
Praveen Kumar, S.
[J]. Mendel, 2021, 27 (02) : 59 - 67
[7] Robust fine-grained visual recognition with images based on internet of things
Cai, Zhenhuang
Yan, Shuai
Huang, Dan
[J]. COMPUTATIONAL INTELLIGENCE, 2024, 40 (02)
[8] A weakly supervised spatial group attention network for fine-grained visual recognition
Xie, Jiangjian
Zhong, Yujie
Zhang, Junguo
Zhang, Changchun
Schuller, Bjoern W.
[J]. APPLIED INTELLIGENCE, 2023, 53 (20) : 23301 - 23315
[9] Attention-Guided Spatial Transformer Networks for Fine-Grained Visual Recognition
Liu, Dichao
Wang, Yu
Kato, Jien
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (12) : 2577 - 2586
[10] A weakly supervised spatial group attention network for fine-grained visual recognition
Jiangjian Xie
Yujie Zhong
Junguo Zhang
Changchun Zhang
Björn W Schuller
[J]. Applied Intelligence, 2023, 53 : 23301 - 23315

← 1 2 3 4 5 →