Fine-grained image classification based on TinyVit object location and graph convolution network

被引：0

作者：

Zheng, Shijie ^{[1
]}

Wang, Gaocai ^{[1
]}

Yuan, Yujian ^{[1
]}

Huang, Shuqiang ^{[2
]}

机构：

[1] Guangxi Univ, Sch Comp & Elect & Informat, Nanning 530004, Peoples R China

[2] Jinan Univ, Coll Cyber Secur, Guangzhou 510632, Peoples R China

来源：

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION | 2024年 / 100卷

基金：

中国国家自然科学基金;

关键词：

Fine-grained image classification; TinyVit; Object location; Spatial relationship feature learning; Graph convolution network;

D O I：

10.1016/j.jvcir.2024.104120

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Fine-grained image classification is a branch of image classification. Recently, vision transformer has made excellent progress in the field of image recognition. Its self -attention mechanism can extract very effective image feature information. However, feeding fixed -size image blocks into the network introduces additional noise, which is detrimental to extract discriminative features for fine-grained images. The vision transformer's network model is large, making it difficult to utilize in practice. Moreover, many of today's fine-grained image classification methods focus on mining discriminative features while ignoring the connections within the image. To address these problems, we propose a novel method based on the lightweight TinyVit backbone network. Our approach utilizes the self -attention weight values of TinyVit as a guide to construct an effective object location (OL) module that cuts and enlarges the object area, providing the network with the opportunity to concentrate on the local object. Additionally, we employ the graph convolutional network (GCN) to create a spatial relationship feature learning (SRFL) module that captures spatial context information between image blocks in TinyVit with the help of the transformer's self -attention weights. OL and SRFL collaborate to jointly guide the classification task. The experimental results show that the proposed method achieved competitive performance, with the second -highest classification faccuracy on both the CUB -200-2011 and NABirds datasets. When tested on the Stanford Dogs dataset, our approach outperformed many popular methods. Our code is uploaded on https://gith ub.com/hhhj1999/SRFL_OL.

引用

页数：11

共 50 条

[41] Fine-Grained Image Classification Based on Target Acquisition and Feature Fusion
Chu, Yan
Wang, Zhengkui
Wang, Lina
Zhao, Qingchao
Shan, Wen
[J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, 2021, 12817 : 209 - 221
[42] Research on plant seeds recognition based on fine-grained image classification
Yuan, Min
Dong, Yongkang
Lu, Fuxiang
Zhan, Kun
Zhu, Liye
Shen, Jiacheng
Ren, Dingbang
Hu, Xiaowen
Lv, Ningning
[J]. JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (05)
[43] Fine-grained image classification method based on hybrid attention module
Lu, Weixiang
Yang, Ying
Yang, Lei
[J]. FRONTIERS IN NEUROROBOTICS, 2024, 18
[44] Fine-grained Question-Answer sentiment classification with hierarchical graph attention network
Zeng, Jiandian
Liu, Tianyi
Jia, Weijia
Zhou, Jiantao
[J]. NEUROCOMPUTING, 2021, 457 : 214 - 224
[45] Fine-Grained Image Classification for Crop Disease Based on Attention Mechanism
Yang, Guofeng
He, Yong
Yang, Yong
Xu, Beibei
[J]. FRONTIERS IN PLANT SCIENCE, 2020, 11
[46] An Encoder-Decoder Convolution Network With Fine-Grained Spatial Information for Hyperspectral Images Classification
Li, Zhongwei
Guo, Fangming
Li, Qi
Ren, Guangbo
Wang, Leiquan
[J]. IEEE ACCESS, 2020, 8 : 33600 - 33608
[47] Practical fine-grained learning based anomaly classification for ECG image
Cao, Qing
Du, Nan
Yu, Li
Zuo, Ming
Lin, Jingsheng
Liu, Nathan
Zhong, Erheng
Liu, Zizhu
Chen, Qiaoran
Shen, Ying
Chen, Kang
[J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2021, 119
[48] Exploiting Temporal Information for DCNN-based Fine-Grained Object Classification
Ge, ZongYuan
McCool, Chris
Sanderson, Conrad
Wang, Peng
Liu, Lingqiao
Reid, Ian
Corke, Peter
[J]. 2016 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2016, : 442 - 447
[49] Fine-Grained Object Classification Based on Block Diagonal Feature and Ensemble Learning
Xie, Xue
Luan, Beiyi
Song, Shiqing
Wang, Jilin
[J]. THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083
[50] Fine-Grained Semantic Image Synthesis with Object-Attention Generative Adversarial Network
Wang, Min
Lang, Congyan
Liang, Liqian
Feng, Songhe
Wang, Tao
Gao, Yutong
[J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (05)

← 1 2 3 4 5 →