Few-Shot Image Classification Algorithm of Graph Neural Network Based on Swin Transformer

被引：0

作者：

Wang Kai ^{[1
]}

Ren Jie ^{[1
]}

Zhang Weichuan ^{[2
]}

机构：

[1] Xian Polytech Univ, Sch Elect & Informat, Xian 710048, Shaanxi, Peoples R China

[2] Graiffith Univ, Inst Integrated & Intelligent Syst, Brisbane, Qld 4702, Australia

来源：

LASER & OPTOELECTRONICS PROGRESS | 2024年 / 61卷 / 12期

关键词：

graph neural network; few-shot learning; image classification; Swin Transformer; dual metric learning;

D O I：

10.3788/LOP231596

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In few-shot image classification tasks, capturing remote semantic information in feature extraction modules based on convolutional neural network and single measure of edge-feature similarity are challenging. Therefore, in this study, we present a few-shot image classification method utilizing a graph neural network based on Swin Transformer. First, the Swin Transformer is used to extract image features, which are utilized as node features in the graph neural network. Next, the edge-feature similarity measurement module is improved by adding additional metrics, thus forming a dual-measurement module to calculate the similarity between the node features. The obtained similarity is used as the edge- feature input of the graph neural network. Finally, the nodes and edges of the graph neural network are alternately updated to predict image class labels. The classification accuracy of our proposed method for a 5-way 1-shot task on Stanford Dogs, Stanford Cars, and CUB-200-2011 datasets is calculated as 85. 21%, 91.10%, and 91.08%, respectively, thereby achieving significant results in few-shot image classification.

引用

页数：9

共 30 条

[1] Rusu AA, 2019, Arxiv, DOI arXiv:1807.05960
[2] AutoAugment: Learning Augmentation Strategies from Data
Cubuk, Ekin D.
Zoph, Barret
Mane, Dandelion
Vasudevan, Vijay
Le, Quoc V.
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 113 - 123
[3] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[4] Doersch C, 2021, Arxiv, DOI [arXiv:2007.11498, 10.48550/arXiv.2007.11498]
[5] A Bayesian approach to unsupervised one-shot learning of object categories
Fei-Fei, L
Fergus, R
Perona, P
[J]. NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, 2003, : 1134 - 1141
[6] Finn C, 2017, PR MACH LEARN RES, V70
[7] Garcia V., 2017, arXiv
[8] Dynamic Few-Shot Visual Learning without Forgetting
Gidaris, Spyros
Komodakis, Nikos
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4367 - 4375
[9] Remote Sensing Image Classification Method Based on Fusion of CNN and Transformer
Jin Chuan
Tong Changqing
[J]. LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (20)
[10] Edge-Labeling Graph Neural Network for Few-shot Learning
Kim, Jongmin
Kim, Taesup
Kim, Sungwoong
Yoo, Chang D.
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11 - 20

← 1 2 3 →