Fine-Grained Recognition With Learnable Semantic Data Augmentation

被引:8
|
作者
Pu, Yifan [1 ]
Han, Yizeng [1 ]
Wang, Yulin [1 ]
Feng, Junlan [2 ]
Deng, Chao [2 ]
Huang, Gao [1 ]
机构
[1] Tsinghua Univ, Dept Automat, BNRist, Beijing 100084, Peoples R China
[2] China Mobile Res Inst, Beijing 100053, Peoples R China
关键词
Fine-grained recognition; data augmentation; meta-learning; deep learning; CLASSIFICATION; IMAGE;
D O I
10.1109/TIP.2024.3364500
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained image recognition is a longstanding computer vision challenge that focuses on differentiating objects belonging to multiple subordinate categories within the same meta-category. Since images belonging to the same meta-category usually share similar visual appearances, mining discriminative visual cues is the key to distinguishing fine-grained categories. Although commonly used image-level data augmentation techniques have achieved great success in generic image classification problems, they are rarely applied in fine-grained scenarios, because their random editing-region behavior is prone to destroy the discriminative visual cues residing in the subtle regions. In this paper, we propose diversifying the training data at the feature-level to alleviate the discriminative region loss problem. Specifically, we produce diversified augmented samples by translating image features along semantically meaningful directions. The semantic directions are estimated with a covariance prediction network, which predicts a sample-wise covariance matrix to adapt to the large intra-class variation inherent in fine-grained images. Furthermore, the covariance prediction network is jointly optimized with the classification network in a meta-learning manner to alleviate the degenerate solution problem. Experiments on four competitive fine-grained recognition benchmarks (CUB-200-2011, Stanford Cars, FGVC Aircrafts, NABirds) demonstrate that our method significantly improves the generalization performance on several popular classification networks (e.g., ResNets, DenseNets, EfficientNets, RegNets and ViT). Combined with a recently proposed method, our semantic data augmentation approach achieves state-of-the-art performance on the CUB-200-2011 dataset. Source code is available at https://github.com/LeapLabTHU/LearnableISDA.
引用
收藏
页码:3130 / 3144
页数:15
相关论文
共 50 条
  • [41] Fine-Grained Entity Typing via Label Noise Reduction and Data Augmentation
    Li, Haoyang
    Lin, Xueling
    Chen, Lei
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT I, 2021, 12681 : 356 - 374
  • [42] Robust learning from noisy web data for fine-Grained recognition
    Cai, Zhenhuang
    Xie, Guo-Sen
    Huang, Xingguo
    Huang, Dan
    Yao, Yazhou
    Tang, Zhenmin
    PATTERN RECOGNITION, 2023, 134
  • [43] Human Action Recognition Using Deep Data: A Fine-Grained Study
    Rao, D. Surendra
    Potturu, Sudharsana Rao
    Bhagyaraju, V
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (06): : 97 - 108
  • [44] Saliency for fine-grained object recognition in domains with scarce training data
    Figueroa Flores, Carola
    Gonzalez-Garcia, Abel
    van de Weijer, Joost
    Raducanu, Bogdan
    PATTERN RECOGNITION, 2019, 94 : 62 - 73
  • [45] Data for Image Recognition Tasks: An Efficient Tool for Fine-Grained Annotations
    Filax, Marco
    Gonschorek, Tim
    Ortmeier, Frank
    ICPRAM: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2019, : 900 - 907
  • [46] Fine-Grained Text Classification Based on Label Augmentation
    Guo, Ruiqiang
    Yang, Shilong
    Jia, Xiaowen
    Wei, Qianqiang
    Computer Engineering and Applications, 60 (21): : 134 - 141
  • [47] SPDA-CNN: Unifying Semantic Part Detection and Abstraction for Fine-grained Recognition
    Zhang, Han
    Xu, Tao
    Elhoseiny, Mohamed
    Huang, Xiaolei
    Zhang, Shaoting
    Elgammal, Ahmed
    Metaxas, Dimitris
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1143 - 1152
  • [48] Fine-grained Activities Recognition with Coarse-grained Labeled Multi-modal Data
    Hu, Zhizhang
    Yu, Tong
    Zhang, Yue
    Pan, Shijia
    UBICOMP/ISWC '20 ADJUNCT: PROCEEDINGS OF THE 2020 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2020 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2020, : 644 - 649
  • [49] Fine-Grained Facial Expression Recognition in the Wild
    Liang, Liqian
    Lang, Congyan
    Li, Yidong
    Feng, Songhe
    Zhao, Jian
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 482 - 494
  • [50] Fine-Grained Named Entity Recognition for Sinhala
    Azeez, Rameela
    Ranathunga, Surangika
    MERCON 2020: 6TH INTERNATIONAL MULTIDISCIPLINARY MORATUWA ENGINEERING RESEARCH CONFERENCE (MERCON), 2020, : 295 - 300