Fast Fine-Grained Image Classification via Weakly Supervised Discriminative Localization

被引:55
|
作者
He, Xiangteng [1 ]
Peng, Yuxin [1 ]
Zhao, Junjie [1 ]
机构
[1] Peking Univ, Inst Comp Sci & Technol, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
Fast fine-grained image classification; weakly supervised discriminative localization; multi-level attention; FEATURES; CNNS;
D O I
10.1109/TCSVT.2018.2834480
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Fine-grained image classification is to recognize hundreds of subcategories in each basic-level category. Existing methods employ discriminative localization to find the key distinctions between similar subcategories. However, they generally have two limitations: 1) discriminative localization relies on region proposal methods to hypothesize the locations of discriminative regions, which are time-consuming and the bottleneck of improving classification speed and 2) the training of discriminative localization depends on object or part annotations which are heavily labor-consuming and the obstacle of marching toward practical application. It is highly challenging to address the two limitations simultaneously, while existing methods only focus on one of them. Therefore, we propose a weakly supervised discriminative localization approach (WSDL) for fast fine-grained image classification to address the two limitations at the same time, and its main advantages are: 1) multi-level attention guided localization learning is proposed to localize discriminative regions with different focuses automatically, without using object and part annotations, avoiding the labor consumption. Different level attentions focus on different characteristics of the image, which are complementary and boost classification accuracy and 2) n-pathway end-to-end discriminative localization network is proposed to improve classification speed, which simultaneously localizes multiple different discriminative regions for one image to boost classification accuracy, and shares full-image convolutional features generated by a region proposal network to accelerate the process of generating region proposals as well as reduce the computation of convolutional operation. Both are jointly employed to simultaneously improve classification speed and eliminate dependence on object and part annotations. Comparing with state-of-the-art methods on two widely used fine-grained image classification data sets, our WSDL approach achieves the best accuracy and the efficiency of classification.
引用
收藏
页码:1394 / 1407
页数:14
相关论文
共 50 条
  • [41] Discriminative feature mining hashing for fine-grained image retrieval
    Lang, Wenxi
    Sun, Han
    Xu, Can
    Liu, Ningzhong
    Zhou, Huiyu
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 87
  • [42] Weakly supervised fine-grained semantic segmentation via spatial correlation-guided learning
    Dong, Zihao
    Fang, Tiyu
    Li, Jinping
    Shao, Xiuli
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 236
  • [43] Image Classification With Tailored Fine-Grained Dictionaries
    Shu, Xiangbo
    Tang, Jinhui
    Qi, Guo-Jun
    Li, Zechao
    Jiang, Yu-Gang
    Yan, Shuicheng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (02) : 454 - 467
  • [44] Fine-grained Discriminative Localization via Saliency-guided Faster R-CNN
    He, Xiangteng
    Peng, Yuxin
    Zhao, Junjie
    [J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 627 - 635
  • [45] Weakly Supervised Fine-grained Recognition in a Segmentation-attention Network
    Yu, Nannan
    Zhang, Wenfeng
    Cai, Huanhuan
    [J]. ICMLC 2020: 2020 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2018, : 324 - 329
  • [46] Fine-grained visual classification via multilayer bilinear pooling with object localization
    Li, Ming
    Lei, Lin
    Sun, Hao
    Li, Xiao
    Kuang, Gangyao
    [J]. VISUAL COMPUTER, 2022, 38 (03): : 811 - 820
  • [47] Fine-grained visual classification via multilayer bilinear pooling with object localization
    Ming Li
    Lin Lei
    Hao Sun
    Xiao Li
    Gangyao Kuang
    [J]. The Visual Computer, 2022, 38 : 811 - 820
  • [48] Learning How to Zoom In: Weakly Supervised ROI-Based-DAM for Fine-Grained Visual Classification
    Chen, Wenjie
    Ran, Shuang
    Wang, Tian
    Cao, Lihong
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT II, 2021, 12892 : 118 - 130
  • [49] Weakly supervised instance attention for multisource fine-grained object recognition with an application to tree species classification
    Aygunes, Bulut
    Cinbis, Ramazan Gokberk
    Aksoy, Selim
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 176 : 262 - 274
  • [50] Improving classification with semi-supervised and fine-grained learning
    Lai, Danyu
    Tian, Wei
    Chen, Long
    [J]. PATTERN RECOGNITION, 2019, 88 : 547 - 556