Multilayer feature descriptors fusion CNN models for fine-grained visual recognition

被引:5
|
作者
Hou, Yong [1 ]
Luo, Hangzai [1 ]
Zhao, Wanqing [1 ]
Zhang, Xiang [1 ]
Wang, Jun [1 ]
Peng, Jinye [1 ]
机构
[1] Northwest Univ, Sch Informat Sci & Technol, Xian 710127, Shaanxi, Peoples R China
关键词
convolutional neural network; deep learning; dimensionality reduction; fine-grained image classification; multilayer feature descriptors;
D O I
10.1002/cav.1897
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Fine-grained image classification is a challenging topic in the field of computer vision. General models based on first-order local features cannot achieve acceptable performance because the features are not so efficient in capturing fine-grained difference. A bilinear convolutional neural network (CNN) model exhibits that a second-order statistical feature is more efficient in capturing fine-grained difference than a first-order local feature. However, this framework only considers the extraction of a second-order feature descriptor, using a single convolutional layer. The potential effective classification features of other convolutional layers are ignored, resulting in loss of recognition accuracy. In this paper, a multilayer feature descriptors fusion CNN model is proposed. It fully considers the second-order feature descriptors and the first-order local feature descriptor generated by different layers. Experimental verification was carried out on fine-grained classification benchmark data sets, CUB-200-2011, Stanford Cars, and FGVC-aircraft. Compared with the bilinear CNN model, the proposed method has improved accuracy by 0.8%, 1.1%, and 5.5%. Compared with the compact bilinear pooling model, there is an accuracy increase of 0.64%, 1.63%, and 1.45%, respectively. In addition, the proposed model effectively uses multiple 1x1 convolution kernels to reduce dimension. The experimental results show that the multilayer low-dimensional second-order feature descriptors fusion model has comparable recognition accuracy of the original model.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] A Two-Stage Approach for Fine-Grained Visual Recognition via Confidence Ranking and Fusion
    Sun, Kangbo
    Zhu, Jie
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (12) : 2693 - 2700
  • [42] Meta label associated loss for fine-grained visual recognition
    Li, Yanchao
    Xiao, Fu
    Li, Hao
    Li, Qun
    Yu, Shui
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (06)
  • [43] Leveraging Fine-Grained Labels to Regularize Fine-Grained Visual Classification
    Wu, Junfeng
    Yao, Li
    Liu, Bin
    Ding, Zheyuan
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON COMPUTER MODELING AND SIMULATION (ICCMS 2019) AND 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND APPLICATIONS (ICICA 2019), 2019, : 133 - 136
  • [44] Meta label associated loss for fine-grained visual recognition
    Yanchao LI
    Fu XIAO
    Hao LI
    Qun LI
    Shui YU
    [J]. Science China(Information Sciences), 2024, 67 (06) : 230 - 247
  • [45] Multi-View Active Fine-Grained Visual Recognition
    Du, Ruoyi
    Yu, Wenqing
    Wang, Heqing
    Lin, Ting-En
    Chang, Dongliang
    Ma, Zhanyu
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1568 - 1578
  • [46] On the Eigenvalues of Global Covariance Pooling for Fine-Grained Visual Recognition
    Song, Yue
    Sebe, Nicu
    Wang, Wei
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3554 - 3566
  • [47] Audio Visual Attribute Discovery for Fine-Grained Object Recognition
    Zhang, Hua
    Cao, Xiaochun
    Wang, Rui
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7542 - 7549
  • [48] A Fine-Grained Visual Attention Approach for Fingerspelling Recognition in the Wild
    Gajurel, Kamala
    Zhong, Cuncong
    Wang, Guanghui
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 3266 - 3271
  • [49] Multiple Granularity Descriptors for Fine-grained Categorization
    Wang, Dequan
    Shen, Zhiqiang
    Shao, Jie
    Zhang, Wei
    Xue, Xiangyang
    Zhang, Zheng
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2399 - 2406
  • [50] MFVT: Multilevel Feature Fusion Vision Transformer and RAMix Data Augmentation for Fine-Grained Visual Categorization
    Lv, Xinyao
    Xia, Hao
    Li, Na
    Li, Xudong
    Lan, Ruoming
    [J]. ELECTRONICS, 2022, 11 (21)