Multi-proxy feature learning for robust fine-grained visual recognition

被引:0
|
作者
Mao, Shunan [1 ]
Wang, Yaowei [2 ]
Wang, Xiaoyu [3 ]
Zhang, Shiliang [1 ]
机构
[1] Peking Univ, Sch Comp Sci, Beijing 100871, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518038, Peoples R China
[3] Intellifusion, Shenzhen 518000, Peoples R China
关键词
Fine-grained visual recognition; Noisy label; Long tail; Proxy learning;
D O I
10.1016/j.patcog.2023.109779
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual representation for fine-grained visual recognition can be learned by mandatorily enforcing all samples of the same category into a uniform representation. This strict training objective performs well under closed-set setting but is not applicable to data in the wild containing noisy annotations and long-tailed distributions, e. g., it may lead to a feature space biased to head categories. This paper tackles this challenge by pursuing a more balanced and discriminative feature space by first retaining intra-class variances to isolate noises, then eliminating intra-class variances to improve the visual recognition performance. We propose the Compact Memory Updater to maintain a memory bank, which memorizes proxy features to represent multiple typical appearances of each category in the training set. The Proxy-based Feature Enhancement hence leverages proxy features to ensure samples of the same category have similar features. Iteratively running those two modules boosts the robustness and discriminative power of the learnt representation, hence facilitates various fine-grained visual recognition tasks including person re-identification (re-id), image classification and retrieval. Extensive experiments on noisy and long-tailed training sets show this Multi-Proxy Feature Learning (MPFL) framework achieves promising performance. For instance on a training set with 90% one-shot categories, MPFL outperforms the recent long-tailed person re-id method LEAP-AF by 16.9% in rank-1 accuracy.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Feature Combination with Multi-Kernel Learning for Fine-Grained Visual Classification
    Angelova, Anelia
    Niculescu-Mizil, Alexandru
    [J]. 2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 241 - 246
  • [2] Multi-Granularity Feature Distillation Learning Network for Fine-Grained Visual Classification
    Cai, Yuhang
    Ke, Xiao
    [J]. 2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 300 - 303
  • [3] Multi-Scale Feature Fusion of Covariance Pooling Networks for Fine-Grained Visual Recognition
    Qian, Lulu
    Yu, Tan
    Yang, Jianyu
    [J]. SENSORS, 2023, 23 (08)
  • [4] Multi-View Active Fine-Grained Visual Recognition
    Du, Ruoyi
    Yu, Wenqing
    Wang, Heqing
    Lin, Ting-En
    Chang, Dongliang
    Ma, Zhanyu
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1568 - 1578
  • [5] Robust fine-grained visual recognition with images based on internet of things
    Cai, Zhenhuang
    Yan, Shuai
    Huang, Dan
    [J]. COMPUTATIONAL INTELLIGENCE, 2024, 40 (02)
  • [6] Multi-layer and multi-order fine-grained feature learning for artwork attribute recognition
    Gao, Yang
    Chang, Neng
    Shang, Kai
    [J]. COMPUTER COMMUNICATIONS, 2021, 173 : 214 - 219
  • [7] LEARNING DEEP AND SPARSE FEATURE REPRESENTATION FOR FINE-GRAINED OBJECT RECOGNITION
    Srinivas, M.
    Lin, Yen-Yu
    Liao, Hong-Yuan Mark
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1458 - 1463
  • [8] Robust learning from noisy web data for fine-Grained recognition
    Cai, Zhenhuang
    Xie, Guo-Sen
    Huang, Xingguo
    Huang, Dan
    Yao, Yazhou
    Tang, Zhenmin
    [J]. PATTERN RECOGNITION, 2023, 134
  • [9] DEEP MULTI-CONTEXT NETWORK FOR FINE-GRAINED VISUAL RECOGNITION
    Ou, Xinyu
    Wei, Zhen
    Ling, Hefei
    Liu, Si
    Cao, Xiaochun
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2016,
  • [10] Multilayer feature descriptors fusion CNN models for fine-grained visual recognition
    Hou, Yong
    Luo, Hangzai
    Zhao, Wanqing
    Zhang, Xiang
    Wang, Jun
    Peng, Jinye
    [J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2019, 30 (3-4)