Increasingly Specialized Generative Adversarial Network for fine-grained visual categorization

被引:2
|
作者
Lin, Zhongqi [1 ,2 ]
Gao, Wanlin [1 ,2 ]
Huang, Feng [3 ]
Jia, Jingdun [2 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China
[2] Minist Agr & Rural Affairs, Key Lab Agr Informatizat Standardizat, Beijing 100083, Peoples R China
[3] China Agr Univ, Coll Sci, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Generative adversarial networks; Fine-grained visual categorization; Laplacian residual; Patch proposal; Increasingly specialized perception; MODEL; SEGMENTATION; RECOGNITION;
D O I
10.1016/j.knosys.2021.107480
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained visual categorization is challenging because the subordinate categories within an entrylevel category can only be distinguished by subtle discriminations. This necessitates to localize key (most discriminative) regions and extract domain-specific features alternately, since implicit to finegrained specialization is the existence of an entry-category visual shared among all classes. Existing methods predominantly implement fine-grained categorization independently, while neglecting that patch proposal and discrimination extraction are mutually correlated and can reinforce each other in an increasingly specialized manner. In this work, we concretize the above pipeline as an Increasing Specialized Generative Adversarial Network (IS-GAN), which recursively shapes a coarse-to-fine representation. It is a three-scale framework consisting of two highlights: a three-player expert GAN at each scale for feature extraction, and a Patch Proposal Network (PPN) between two adjacent scales for target positioning. To better anatomize pixel-to-pixel correlations at various octaves, the Gaussian pyramid and Laplacian pyramid descriptions are also integrated in each GAN. The PPN zooms the areas to shift the focus on the most representative regions by taking previous prediction of classifier as a reference, whilst a finer scale network receives an amplified attended region from previous scale. Overall, IS-GAN is driven by three focal losses from GANs and a converged object-level loss. Experiments demonstrate that IS-GAN can simultaneously (1) deliver competitive categorization performance among state-of the-arts, i.e., validation accuracy achieves 92.23% and testing accuracy achieves 90.27%, and (2) recover fine-grained textures with high Peak Signal-to-Noise Ratios (PSNRs) (32.937) and Structural Similarities (SSIMs) (0.8607) from hand-crafted and public benchmarks. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Increasingly Specialized Perception Network for Fine-Grained Visual Categorization of Butterfly Specimens
    Lin, Zhongqi
    Jia, Jingdun
    Gao, Wanlin
    Huang, Feng
    IEEE ACCESS, 2019, 7 : 123367 - 123392
  • [2] Alignment Enhancement Network for Fine-grained Visual Categorization
    Hu, Yutao
    Liu, Xuhui
    Zhang, Baochang
    Han, Jungong
    Cao, Xianbin
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (01)
  • [3] Multiresolution Discriminative Mixup Network for Fine-Grained Visual Categorization
    Xu, Kunran
    Lai, Rui
    Gu, Lin
    Li, Yishi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (07) : 3488 - 3500
  • [4] Coupled Generative Adversarial Network for Continuous Fine-grained Action Segmentation
    Gammulle, Harshala
    Fernando, Tharindu
    Denman, Simon
    Sridharan, Sridha
    Fookes, Clinton
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 200 - 209
  • [5] Multiscale attention dynamic aware network for fine-grained visual categorization
    Ou, Jichu
    Li, Wanyi
    Huang, Jingmin
    Huang, Xiaojie
    Xie, Xuan
    ELECTRONICS LETTERS, 2023, 59 (01)
  • [6] PFNet: a novel part fusion network for fine-grained visual categorization
    Jingyun Liang
    Jinlin Guo
    Yanming Guo
    Songyang Lao
    Multimedia Tools and Applications, 2020, 79 : 33397 - 33416
  • [7] PFNet: a novel part fusion network for fine-grained visual categorization
    Liang, Jingyun
    Guo, Jinlin
    Guo, Yanming
    Lao, Songyang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (45-46) : 33397 - 33416
  • [8] Feathers Dataset for Fine-Grained Visual Categorization
    Belko, Alina
    Dobratulin, Konstantin
    Kuznetsov, Andrey
    THIRTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2020), 2021, 11605
  • [9] Fine-grained image inpainting with scale-enhanced generative adversarial network
    Liu, Weirong
    Cao, Chengrui
    Liu, Jie
    Ren, Chenwen
    Wei, Yulin
    Guo, Honglin
    PATTERN RECOGNITION LETTERS, 2021, 143 : 81 - 87
  • [10] Coarse-to-Fine Description for Fine-Grained Visual Categorization
    Yao, Hantao
    Zhang, Shiliang
    Zhang, Yongdong
    Li, Jintao
    Tian, Qi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (10) : 4858 - 4872