Fine-Grained Recognition With Learnable Semantic Data Augmentation

被引:8
|
作者
Pu, Yifan [1 ]
Han, Yizeng [1 ]
Wang, Yulin [1 ]
Feng, Junlan [2 ]
Deng, Chao [2 ]
Huang, Gao [1 ]
机构
[1] Tsinghua Univ, Dept Automat, BNRist, Beijing 100084, Peoples R China
[2] China Mobile Res Inst, Beijing 100053, Peoples R China
关键词
Fine-grained recognition; data augmentation; meta-learning; deep learning; CLASSIFICATION; IMAGE;
D O I
10.1109/TIP.2024.3364500
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained image recognition is a longstanding computer vision challenge that focuses on differentiating objects belonging to multiple subordinate categories within the same meta-category. Since images belonging to the same meta-category usually share similar visual appearances, mining discriminative visual cues is the key to distinguishing fine-grained categories. Although commonly used image-level data augmentation techniques have achieved great success in generic image classification problems, they are rarely applied in fine-grained scenarios, because their random editing-region behavior is prone to destroy the discriminative visual cues residing in the subtle regions. In this paper, we propose diversifying the training data at the feature-level to alleviate the discriminative region loss problem. Specifically, we produce diversified augmented samples by translating image features along semantically meaningful directions. The semantic directions are estimated with a covariance prediction network, which predicts a sample-wise covariance matrix to adapt to the large intra-class variation inherent in fine-grained images. Furthermore, the covariance prediction network is jointly optimized with the classification network in a meta-learning manner to alleviate the degenerate solution problem. Experiments on four competitive fine-grained recognition benchmarks (CUB-200-2011, Stanford Cars, FGVC Aircrafts, NABirds) demonstrate that our method significantly improves the generalization performance on several popular classification networks (e.g., ResNets, DenseNets, EfficientNets, RegNets and ViT). Combined with a recently proposed method, our semantic data augmentation approach achieves state-of-the-art performance on the CUB-200-2011 dataset. Source code is available at https://github.com/LeapLabTHU/LearnableISDA.
引用
收藏
页码:3130 / 3144
页数:15
相关论文
共 50 条
  • [21] Fine-Grained Representation Learning and Recognition by Exploiting Hierarchical Semantic Embedding
    Chen, Tianshui
    Wu, Wenxi
    Gao, Yuefang
    Dong, Le
    Luo, Xiaonan
    Lin, Liang
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 2023 - 2031
  • [22] A Semantic-driven Image Scene Fine-grained Enhancement Recognition
    Qu, Dongyang
    Li, Yaling
    Luo, Xiaoyan
    Shi, Xiaofeng
    SEVENTH ASIA PACIFIC CONFERENCE ON OPTICS MANUFACTURE (APCOM 2021), 2022, 12166
  • [23] Fine-Grained Semantic Conceptualization of FrameNet
    Park, Jin-woo
    Hwang, Seung-won
    Wang, Haixun
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2638 - 2644
  • [24] Fine-Grained Object Detection Using Transfer Learning and Data Augmentation
    Dalal, Rahul
    Moh, Teng-Sheng
    2018 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2018, : 893 - 896
  • [25] SDHC: Joint Semantic-Data Guided Hierarchical Classification for Fine-Grained HRRP Target Recognition
    Liu, Yichen
    Long, Teng
    Zhang, Liang
    Wang, Yanhua
    Zhang, Xin
    Li, Yang
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (04) : 3993 - 4009
  • [26] Category-specific Semantic Coherency Learning for Fine-grained Image Recognition
    Wang, Shijie
    Wang, Zhihui
    Li, Haojie
    Ouyang, Wanli
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 174 - 183
  • [27] Semantic-Guided Information Alignment Network for Fine-Grained Image Recognition
    Wang, Shijie
    Wang, Zhihui
    Li, Haojie
    Chang, Jianlong
    Ouyang, Wanli
    Tian, Qi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 6558 - 6570
  • [28] Dynamic semantic structure distillation for low-resolution fine-grained recognition
    Liang, Mingjiang
    Huang, Shaoli
    Liu, Wei
    PATTERN RECOGNITION, 2024, 148
  • [29] Fine-Grained Car Recognition Model Based on Semantic DCNN Features Fusion
    Yang J.
    Cao H.
    Wang R.
    Xue L.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (01): : 141 - 157
  • [30] Towards Fine-Grained Recognition: Joint Learning for Object Detection and Fine-Grained Classification
    Wang, Qiaosong
    Rasmussen, Christopher
    ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT II, 2019, 11845 : 332 - 344