Refined probability distribution module for fine-grained visual categorization

被引:2
|
作者
Zhao, Peipei [1 ]
Miao, Qiguang [1 ]
Li, Hongsheng [2 ]
Liu, Ruyi [1 ]
Quan, Yining [1 ]
Song, Jianfeng [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian, Shaanxi, Peoples R China
[2] Chinese Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
基金
中国博士后科学基金; 国家重点研发计划;
关键词
Image -to -image similarity scores; Batch random walk; Deep learning; Fine-grained visual categorization; PERSON REIDENTIFICATION;
D O I
10.1016/j.neucom.2022.10.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained visual categorization is an important task in computer vision. Prior works on fine-grained visual categorization have paid much attention to addressing intra-class variation and inter-class similar-ity. However, they rarely study that task from the perspective of probability distribution. In this paper, we propose a novel refined probability distribution module based on deep convolutional neural network. Our module computes the probability of an image by fully utilizing the similarity information between images. Firstly, we use deep neural networks to obtain the initial probability distribution and extract fea-tures. Then, we build a network whose inputs are features for calculating image-to-image similarity scores. Finally, our module refines the initial probability distribution based on an effective batch random walk operation with similarity scores. Our module can be plugged into many deep convolutional neural networks. Experimental results show that our approach outperforms state-of-the-art methods on the CUB-200-2011, FGVC-Aircraft and Stanford Cars datasets respectively.CO 2022 Published by Elsevier B.V.
引用
收藏
页码:533 / 544
页数:12
相关论文
共 50 条
  • [41] VegFru: A Domain-Specific Dataset for Fine-grained Visual Categorization
    Hou, Saihui
    Feng, Yushan
    Wang, Zilei
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 541 - 549
  • [42] Fair Comparison: Quantifying Variance in Results for Fine-grained Visual Categorization
    Gwilliam, Matthew
    Teuscher, Adam
    Anderson, Connor
    Farrell, Ryan
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3308 - 3317
  • [43] Data-free Knowledge Distillation for Fine-grained Visual Categorization
    Shao, Renrong
    Zhang, Wei
    Yin, Jianhua
    Wang, Jun
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1515 - 1525
  • [44] DSP: Discriminative Spatial Part modeling for Fine-Grained Visual Categorization
    Yao, Hantao
    Zhang, Dongming
    Li, Jintao
    Zhou, Jianshe
    Zhang, Shiliang
    Zhang, Yongdong
    IMAGE AND VISION COMPUTING, 2017, 63 : 24 - 37
  • [45] Increasingly Specialized Generative Adversarial Network for fine-grained visual categorization
    Lin, Zhongqi
    Gao, Wanlin
    Huang, Feng
    Jia, Jingdun
    KNOWLEDGE-BASED SYSTEMS, 2021, 232
  • [46] Local Alignments for Fine-Grained Categorization
    Efstratios Gavves
    Basura Fernando
    Cees G. M. Snoek
    Arnold W. M. Smeulders
    Tinne Tuytelaars
    International Journal of Computer Vision, 2015, 111 : 191 - 212
  • [47] Local Alignments for Fine-Grained Categorization
    Gavves, Efstratios
    Fernando, Basura
    Snoek, Cees G. M.
    Smeulders, Arnold W. M.
    Tuytelaars, Tinne
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (02) : 191 - 212
  • [48] A Survey of Fine-Grained Image Categorization
    Zheng, Min
    Li, Qingyong
    Geng, Yangli-ao
    Yu, Haomin
    Wang, Jianzhu
    Gan, Jinrui
    Xue, Wenyuan
    PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 533 - 538
  • [49] Exploring part-aware segmentation for fine-grained visual categorization
    Cheng Pang
    Hongxun Yao
    Xiaoshuai Sun
    Sicheng Zhao
    Yanhao Zhang
    Multimedia Tools and Applications, 2018, 77 : 30291 - 30310
  • [50] PFNet: a novel part fusion network for fine-grained visual categorization
    Liang, Jingyun
    Guo, Jinlin
    Guo, Yanming
    Lao, Songyang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (45-46) : 33397 - 33416