High efficient framework for large-scale zero-shot image recognition

被引:0
|
作者
Zhang Z. [1 ,2 ]
Liu Q. [1 ,2 ]
Guo D. [3 ]
机构
[1] School of Microelectronics, Tianjin University, Tianjin
[2] Tianjin Key Laboratory of Imaging and Sensing Microelectronic Technology, Tianjin
[3] Tianjin Communication and Broadcasting Group Co.Ltd., Tianjin
关键词
deep learning; graph neural networks; knowledge graph;
D O I
10.19665/j.issn1001-2400.2022.06.013
中图分类号
学科分类号
摘要
For large-scale zero-shot image recognition tasks,because of a large number of classes,model training is difficult and training costs of the model are high.In order to solve those problems,this paper designs a high-efficient zero-shot learning framework,which improves the accuracy and generalization ability at low training costs.This framework designs the joint space,uses the image branch network and the semantic branch network to map different modal vectors to the joint space to complete model training and inference.In the image branch network,in order to change the distribution of image feature vectors,this paper uses the perceptron network to map image feature vectors to the joint space.In the semantic branch network,graph convolutional networks are used to map semantic vectors to the joint space.In addition,the loss function is designed to constrain the joint space,so that the discrimination of different classes in the joint space is increased,which is conducive to model training.Experimental results on the ImageNet show that on the "2-HOPS" test set,compared with existing methods without fine-tuning,the accuracy of our algorithm increases by 1.1%,and the training time decreases by 57.8%;compared with existing algorithms after fine-tuning,the accuracy of our algorithm saves 98.4% of training time without any loss of accuracy.Experimental results show that the method improves the model performance with low training costs. © 2022 Science Press. All rights reserved.
引用
收藏
页码:103 / 110
页数:7
相关论文
共 31 条
  • [1] SONG Jianfeng, MIAO Qiguang, WANG Chongxiao, Et al., Multi-Scale Single Object Tracking Based on the Attention Mechanism, Journal of Xidian University, 48, 5, pp. 110-116, (2021)
  • [2] HARL M, HERCHENBACH M, KRUSCHEL S, Et al., A Light in the Dark: Deep Learning Practices for Industrial Computer Vision, (2022)
  • [3] VIJ R, ARORA S., Computer Vision with Deep Learning Techniques for Neurodegenerative Diseases Analysis Using Neuroimaging: A Survey [C], International Conference on Innovative Computing and Communications, pp. 179-189, (2022)
  • [4] KONG Yueping, LIU Chu, ZHU Xudong, Faceanti-Spoofing Method Using the Optical Flow Features of Back Ground [J], Journal of Xidian University, 48, 5, pp. 86-91, (2021)
  • [5] XIAN Y, LAMPERT C H, SCHIELE B, Et al., Zero-Shot Learning-A Comprehensive Evaluation of the Good, the Bad and the Ugly, IEEE Transactions on Pattern Analysis and Machine Intelligence, 41, 9, pp. 2251-2265, (2018)
  • [6] VERMA V K, LIANG K, MEHTA N, Et al., Meta-Learned Attribute Self-Gating for Continual Generalized Zero-Shot Learning, (2021)
  • [7] NAM J, AHN D, KANG D, Et al., Zero-Shot Natural Language Video Localization [C], Proceedings of the IEEE International Conference on Computer Vision, pp. 1470-1479, (2021)
  • [8] QIN Y, ZHAO C, ZHU X, Et al., Learning Meta Model for Zero-and Few-Shot Face Anti-Spoof ing [C], Proceedings of the AAAI Conference on Artificial Intelligence, pp. 11916-11923, (2020)
  • [9] HUYNH D, ELHAMIFAR E., Fine-Grained Generalized Zero-Shot Learning Via Dense Attribute-Based Attention, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4483-4493, (2020)
  • [10] XU W, XIAN Y, WANG J, Et al., Attribute Prototype Network for Zero-Shot Learning [J/OL], (2020)