Few-Shot Image Classification Algorithm of Graph Neural Network Based on Swin Transformer

被引:0
|
作者
Wang Kai [1 ]
Ren Jie [1 ]
Zhang Weichuan [2 ]
机构
[1] Xian Polytech Univ, Sch Elect & Informat, Xian 710048, Shaanxi, Peoples R China
[2] Graiffith Univ, Inst Integrated & Intelligent Syst, Brisbane, Qld 4702, Australia
关键词
graph neural network; few-shot learning; image classification; Swin Transformer; dual metric learning;
D O I
10.3788/LOP231596
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In few-shot image classification tasks, capturing remote semantic information in feature extraction modules based on convolutional neural network and single measure of edge-feature similarity are challenging. Therefore, in this study, we present a few-shot image classification method utilizing a graph neural network based on Swin Transformer. First, the Swin Transformer is used to extract image features, which are utilized as node features in the graph neural network. Next, the edge-feature similarity measurement module is improved by adding additional metrics, thus forming a dual-measurement module to calculate the similarity between the node features. The obtained similarity is used as the edge- feature input of the graph neural network. Finally, the nodes and edges of the graph neural network are alternately updated to predict image class labels. The classification accuracy of our proposed method for a 5-way 1-shot task on Stanford Dogs, Stanford Cars, and CUB-200-2011 datasets is calculated as 85. 21%, 91.10%, and 91.08%, respectively, thereby achieving significant results in few-shot image classification.
引用
收藏
页数:9
相关论文
共 30 条
  • [1] Rusu AA, 2019, Arxiv, DOI arXiv:1807.05960
  • [2] AutoAugment: Learning Augmentation Strategies from Data
    Cubuk, Ekin D.
    Zoph, Barret
    Mane, Dandelion
    Vasudevan, Vijay
    Le, Quoc V.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 113 - 123
  • [3] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [4] Doersch C, 2021, Arxiv, DOI [arXiv:2007.11498, 10.48550/arXiv.2007.11498]
  • [5] A Bayesian approach to unsupervised one-shot learning of object categories
    Fei-Fei, L
    Fergus, R
    Perona, P
    [J]. NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, 2003, : 1134 - 1141
  • [6] Finn C, 2017, PR MACH LEARN RES, V70
  • [7] Garcia V., 2017, arXiv
  • [8] Dynamic Few-Shot Visual Learning without Forgetting
    Gidaris, Spyros
    Komodakis, Nikos
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4367 - 4375
  • [9] Remote Sensing Image Classification Method Based on Fusion of CNN and Transformer
    Jin Chuan
    Tong Changqing
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (20)
  • [10] Edge-Labeling Graph Neural Network for Few-shot Learning
    Kim, Jongmin
    Kim, Taesup
    Kim, Sungwoong
    Yoo, Chang D.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11 - 20