Increasingly Specialized Generative Adversarial Network for fine-grained visual categorization

被引:2
|
作者
Lin, Zhongqi [1 ,2 ]
Gao, Wanlin [1 ,2 ]
Huang, Feng [3 ]
Jia, Jingdun [2 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China
[2] Minist Agr & Rural Affairs, Key Lab Agr Informatizat Standardizat, Beijing 100083, Peoples R China
[3] China Agr Univ, Coll Sci, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Generative adversarial networks; Fine-grained visual categorization; Laplacian residual; Patch proposal; Increasingly specialized perception; MODEL; SEGMENTATION; RECOGNITION;
D O I
10.1016/j.knosys.2021.107480
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained visual categorization is challenging because the subordinate categories within an entrylevel category can only be distinguished by subtle discriminations. This necessitates to localize key (most discriminative) regions and extract domain-specific features alternately, since implicit to finegrained specialization is the existence of an entry-category visual shared among all classes. Existing methods predominantly implement fine-grained categorization independently, while neglecting that patch proposal and discrimination extraction are mutually correlated and can reinforce each other in an increasingly specialized manner. In this work, we concretize the above pipeline as an Increasing Specialized Generative Adversarial Network (IS-GAN), which recursively shapes a coarse-to-fine representation. It is a three-scale framework consisting of two highlights: a three-player expert GAN at each scale for feature extraction, and a Patch Proposal Network (PPN) between two adjacent scales for target positioning. To better anatomize pixel-to-pixel correlations at various octaves, the Gaussian pyramid and Laplacian pyramid descriptions are also integrated in each GAN. The PPN zooms the areas to shift the focus on the most representative regions by taking previous prediction of classifier as a reference, whilst a finer scale network receives an amplified attended region from previous scale. Overall, IS-GAN is driven by three focal losses from GANs and a converged object-level loss. Experiments demonstrate that IS-GAN can simultaneously (1) deliver competitive categorization performance among state-of the-arts, i.e., validation accuracy achieves 92.23% and testing accuracy achieves 90.27%, and (2) recover fine-grained textures with high Peak Signal-to-Noise Ratios (PSNRs) (32.937) and Structural Similarities (SSIMs) (0.8607) from hand-crafted and public benchmarks. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:24
相关论文
共 50 条
  • [11] FINE-GRAINED VISUAL CATEGORIZATION WITH FINE-TUNED SEGMENTATION
    Li, Lingyun
    Guo, Yanqing
    Xie, Lingxi
    Kong, Xiangwei
    Tian, Qi
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2025 - 2029
  • [12] INCREASINGLY SPECIALIZED ENSEMBLE OF CONVOLUTIONAL NEURAL NETWORKS FOR FINE-GRAINED RECOGNITION
    Simonelli, Andrea
    Messelodi, Stefano
    De Natale, Francesco
    Bulo, Samuel Rota
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 594 - 598
  • [13] Evolving Convolutional Neural Network and Its Application in Fine-Grained Visual Categorization
    Xuan, Qi
    Xiao, Haoquan
    Fu, Chenbo
    Liu, Yi
    IEEE ACCESS, 2018, 6 : 31110 - 31116
  • [14] Squeezed Bilinear Pooling for Fine-Grained Visual Categorization
    Liao, Qiyu
    Wang, Dadong
    Holewa, Hamish
    Xu, Min
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 728 - 732
  • [15] Adaptive Triplet Model for Fine-Grained Visual Categorization
    Liang, Jingyun
    Guo, Jinlin
    Guo, Yanming
    Lao, Songyang
    IEEE ACCESS, 2018, 6 : 76776 - 76786
  • [16] ProtoSimi: label correction for fine-grained visual categorization
    Shen, Jialiang
    Yao, Yu
    Huang, Shaoli
    Wang, Zhiyong
    Zhang, Jing
    Wang, Ruxing
    Yu, Jun
    Liu, Tongliang
    MACHINE LEARNING, 2024, 113 (04) : 1903 - 1920
  • [17] Discriminative Suprasphere Embedding for Fine-Grained Visual Categorization
    Ye, Shuo
    Peng, Qinmu
    Sun, Wenju
    Xu, Jiamiao
    Wang, Yu
    You, Xinge
    Cheung, Yiu-Ming
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5092 - 5102
  • [18] Hierarchical Part Matching for Fine-Grained Visual Categorization
    Xie, Lingxi
    Tian, Qi
    Hong, Richang
    Yan, Shuicheng
    Zhang, Bo
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1641 - 1648
  • [19] Fine-Grained Semantic Image Synthesis with Object-Attention Generative Adversarial Network
    Wang, Min
    Lang, Congyan
    Liang, Liqian
    Feng, Songhe
    Wang, Tao
    Gao, Yutong
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (05)
  • [20] Fine-Grained Visual Categorization of Fasteners in Overhaul Processes
    Taheritanjani, Sajjad
    Haladjian, Juan
    Bruegge, Bernd
    CONFERENCE PROCEEDINGS OF 2019 5TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2019, : 241 - 248