Fine-grained image classification based on TinyVit object location and graph convolution network

被引:0
|
作者
Zheng, Shijie [1 ]
Wang, Gaocai [1 ]
Yuan, Yujian [1 ]
Huang, Shuqiang [2 ]
机构
[1] Guangxi Univ, Sch Comp & Elect & Informat, Nanning 530004, Peoples R China
[2] Jinan Univ, Coll Cyber Secur, Guangzhou 510632, Peoples R China
基金
中国国家自然科学基金;
关键词
Fine-grained image classification; TinyVit; Object location; Spatial relationship feature learning; Graph convolution network;
D O I
10.1016/j.jvcir.2024.104120
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fine-grained image classification is a branch of image classification. Recently, vision transformer has made excellent progress in the field of image recognition. Its self -attention mechanism can extract very effective image feature information. However, feeding fixed -size image blocks into the network introduces additional noise, which is detrimental to extract discriminative features for fine-grained images. The vision transformer's network model is large, making it difficult to utilize in practice. Moreover, many of today's fine-grained image classification methods focus on mining discriminative features while ignoring the connections within the image. To address these problems, we propose a novel method based on the lightweight TinyVit backbone network. Our approach utilizes the self -attention weight values of TinyVit as a guide to construct an effective object location (OL) module that cuts and enlarges the object area, providing the network with the opportunity to concentrate on the local object. Additionally, we employ the graph convolutional network (GCN) to create a spatial relationship feature learning (SRFL) module that captures spatial context information between image blocks in TinyVit with the help of the transformer's self -attention weights. OL and SRFL collaborate to jointly guide the classification task. The experimental results show that the proposed method achieved competitive performance, with the second -highest classification faccuracy on both the CUB -200-2011 and NABirds datasets. When tested on the Stanford Dogs dataset, our approach outperformed many popular methods. Our code is uploaded on https://gith ub.com/hhhj1999/SRFL_OL.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Fine-Grained Image Classification Based on Target Acquisition and Feature Fusion
    Chu, Yan
    Wang, Zhengkui
    Wang, Lina
    Zhao, Qingchao
    Shan, Wen
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, 2021, 12817 : 209 - 221
  • [42] Research on plant seeds recognition based on fine-grained image classification
    Yuan, Min
    Dong, Yongkang
    Lu, Fuxiang
    Zhan, Kun
    Zhu, Liye
    Shen, Jiacheng
    Ren, Dingbang
    Hu, Xiaowen
    Lv, Ningning
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (05)
  • [43] Fine-grained image classification method based on hybrid attention module
    Lu, Weixiang
    Yang, Ying
    Yang, Lei
    [J]. FRONTIERS IN NEUROROBOTICS, 2024, 18
  • [44] Fine-grained Question-Answer sentiment classification with hierarchical graph attention network
    Zeng, Jiandian
    Liu, Tianyi
    Jia, Weijia
    Zhou, Jiantao
    [J]. NEUROCOMPUTING, 2021, 457 : 214 - 224
  • [45] Fine-Grained Image Classification for Crop Disease Based on Attention Mechanism
    Yang, Guofeng
    He, Yong
    Yang, Yong
    Xu, Beibei
    [J]. FRONTIERS IN PLANT SCIENCE, 2020, 11
  • [46] An Encoder-Decoder Convolution Network With Fine-Grained Spatial Information for Hyperspectral Images Classification
    Li, Zhongwei
    Guo, Fangming
    Li, Qi
    Ren, Guangbo
    Wang, Leiquan
    [J]. IEEE ACCESS, 2020, 8 : 33600 - 33608
  • [47] Practical fine-grained learning based anomaly classification for ECG image
    Cao, Qing
    Du, Nan
    Yu, Li
    Zuo, Ming
    Lin, Jingsheng
    Liu, Nathan
    Zhong, Erheng
    Liu, Zizhu
    Chen, Qiaoran
    Shen, Ying
    Chen, Kang
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2021, 119
  • [48] Exploiting Temporal Information for DCNN-based Fine-Grained Object Classification
    Ge, ZongYuan
    McCool, Chris
    Sanderson, Conrad
    Wang, Peng
    Liu, Lingqiao
    Reid, Ian
    Corke, Peter
    [J]. 2016 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2016, : 442 - 447
  • [49] Fine-Grained Object Classification Based on Block Diagonal Feature and Ensemble Learning
    Xie, Xue
    Luan, Beiyi
    Song, Shiqing
    Wang, Jilin
    [J]. THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083
  • [50] Fine-Grained Semantic Image Synthesis with Object-Attention Generative Adversarial Network
    Wang, Min
    Lang, Congyan
    Liang, Liqian
    Feng, Songhe
    Wang, Tao
    Gao, Yutong
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (05)