A coarse-to-fine capsule network for fine-grained image categorization

被引:7
|
作者
Lin, Zhongqi [1 ,2 ]
Jia, Jingdun [2 ]
Huang, Feng [3 ]
Gao, Wanlin [1 ,2 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China
[2] Minist Agr & Rural Affairs, Key Lab Agr Informatizat Standardizat, Beijing 100083, Peoples R China
[3] China Agr Univ, Coll Sci, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Capsule network (CapsNet); Fine-grained image classification; Coarse-to-fine attention; Increasingly specialized perception; MODEL;
D O I
10.1016/j.neucom.2021.05.032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained image categorization is challenging due to the subordinate categories within an entry-level category can only be distinguished by subtle discriminations. This necessitates localizing key (most dis-criminative) regions and extract domain-specific features alternately. Existing methods predominantly realize fine-grained categorization independently, while ignoring that representation learning and fore-ground localization can reinforce each other iteratively. Sharing the state-of-the-art performance of cap-sule encoding for abstract semantic representation, we formalize our pipeline as a coarse-to-fine capsule network (CTF-CapsNet). It consists of customized expert CapsNets arranged in each perception scale and region proposal networks (RPNs) between two adjacent scales. Their mutually motivated self-optimization can achieve increasingly specialized cross-utilization of object-level and component-level descriptions. The RPN zooms the areas to turn the attention to the most distinctive regions by concerning preceding informations learned by expert CapsNet for references, whilst a finer-scale model takes as feed an amplified attended patch from last scale. Overall, CTF-CapsNet is driven by three focal margin losses between label prediction and ground truth, and three regeneration losses between original input images/ feature maps and reconstructed images. Experiments demonstrate that without any prior knowledge or strongly-supervised supports (e.g., bounding-box/part annotations), CTF-CapsNet can deliver competitive categorization performance among state-of-the-arts, i.e., testing accuracy achieves 89.57%, 88.63%, 90.51%, and 91.53% on our hand-crafted rice growth image set and three public benchmarks, i.e., CUB Birds, Stanford Dogs, and Stanford Cars, respectively. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:200 / 219
页数:20
相关论文
共 50 条
  • [21] Local Alignments for Fine-Grained Categorization
    Gavves, Efstratios
    Fernando, Basura
    Snoek, Cees G. M.
    Smeulders, Arnold W. M.
    Tuytelaars, Tinne
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (02) : 191 - 212
  • [22] Attention-based cropping and erasing learning with coarse-to-fine refinement for fine-grained visual classification
    Chen, Jianpin
    Li, Heng
    Liang, Junlin
    Su, Xiaofan
    Zhai, Zhenzhen
    Chai, Xinyu
    NEUROCOMPUTING, 2022, 501 : 359 - 369
  • [23] A Saliency-based Weakly-supervised Network for Fine-Grained Image Categorization
    Han, Yawen
    Meng, Fang
    2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 270 - 274
  • [24] Multiresolution Discriminative Mixup Network for Fine-Grained Visual Categorization
    Xu, Kunran
    Lai, Rui
    Gu, Lin
    Li, Yishi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (07) : 3488 - 3500
  • [25] Grafit: Learning fine-grained image representations with coarse labels
    Touvron, Hugo
    Sablayrolles, Alexandre
    Douze, Matthijs
    Cord, Matthieu
    Jegou, Herve
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 854 - 864
  • [26] FINE-GRAINED IMAGE CLASSIFICATION WITH COARSE AND FINE LABELS ON ONE-SHOT LEARNING
    Jiao, Qihan
    Liu, Zhi
    Li, Gongyang
    Ye, Linwei
    Wang, Yang
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2020,
  • [27] A fine-grained distinction of coarse graining
    Morita, Kohei
    EUROPEAN JOURNAL FOR PHILOSOPHY OF SCIENCE, 2023, 13 (01)
  • [28] A fine-grained distinction of coarse graining
    Kohei Morita
    European Journal for Philosophy of Science, 2023, 13
  • [29] FINE-GRAINED VISUAL CATEGORIZATION WITH FINE-TUNED SEGMENTATION
    Li, Lingyun
    Guo, Yanqing
    Xie, Lingxi
    Kong, Xiangwei
    Tian, Qi
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2025 - 2029
  • [30] Fine-grained Network Traffic Prediction from Coarse Data
    Rusek, Krzysztof
    Drton, Mathias
    AUSTRIAN JOURNAL OF STATISTICS, 2022, : 114 - 123