Graph-Based Visual-Semantic Entanglement Network for Zero-Shot Image Recognition

被引:8
|
作者
Hu, Yang [1 ,2 ]
Wen, Guihua [1 ]
Chapman, Adriane [2 ]
Yang, Pei [1 ]
Luo, Mingnan [1 ]
Xu, Yingxue [1 ]
Dai, Dan [1 ,3 ]
Hall, Wendy [2 ]
机构
[1] South China Univ Technol, Guangzhou 510006, Peoples R China
[2] Univ Southampton, Southampton SO17 1BJ, Hants, England
[3] Univ Lincoln, Lincoln LN6 7TS, England
基金
美国国家科学基金会;
关键词
Visualization; Semantics; Pipelines; Knowledge engineering; Load modeling; Task analysis; Couplings; Zero-shot learning; visual-semantic modeling; graph convolutional network; semantic knowledge graph; attribute word embedding;
D O I
10.1109/TMM.2021.3082292
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Zero-shot learning uses semantic attributes to connect the search space of unseen objects. In recent years, although the deep convolutional network brings powerful visual modeling capabilities to the ZSL task, its visual features have severe pattern inertia and lack of representation of semantic relationships, which leads to severe bias and ambiguity. In response to this, we propose the Graph-based Visual-Semantic Entanglement Network to conduct graph modeling of visual features, which is mapped to semantic attributes by using a knowledge graph, it contains several novel designs: 1. it establishes a multi-path entangled network with the convolutional neural network (CNN) and the graph convolutional network (GCN), which input the visual features from CNN to GCN to model the implicit semantic relations, then GCN feedback the graph modeled information to CNN features; 2. it uses attribute word vectors as the target for the graph semantic modeling of GCN, which forms a self-consistent regression for graph modeling and supervise GCN to learn more personalized attribute relations; 3. it fuses and supplements the hierarchical visual-semantic features refined by graph modeling into visual embedding. Our method outperforms state-of-the-art approaches on multiple representative ZSL datasets: AwA2, CUB, and SUN by promoting the semantic linkage modelling of visual features.
引用
收藏
页码:2473 / 2487
页数:15
相关论文
共 50 条
  • [1] Zero-shot learning with visual-semantic mutual reinforcement for image recognition
    Hefei University of Technology, Key Laboratory of Knowledge Engineering with Big Data, The Ministry of Education of China], Hefei, China
    不详
    不详
    [J]. J. Electron. Imaging, 5
  • [2] Spatiotemporal visual-semantic embedding network for zero-shot action recognition
    An, Rongqiao
    Miao, Zhenjiang
    Li, Qingyu
    Xu, Wanru
    Zhang, Qiang
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (02)
  • [3] Deep quantization network with visual-semantic alignment for zero-shot image retrieval
    Liu, Huixia
    Qin, Zhihong
    [J]. ELECTRONIC RESEARCH ARCHIVE, 2023, 31 (07): : 4232 - 4247
  • [4] Indirect visual-semantic alignment for generalized zero-shot recognition
    Chen, Yan-He
    Yeh, Mei-Chen
    [J]. MULTIMEDIA SYSTEMS, 2024, 30 (02)
  • [5] Visual-Semantic Aligned Bidirectional Network for Zero-Shot Learning
    Gao, Rui
    Hou, Xingsong
    Qin, Jie
    Shen, Yuming
    Long, Yang
    Liu, Li
    Zhang, Zhao
    Shao, Ling
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1649 - 1664
  • [6] Zero-Shot Recognition using Dual Visual-Semantic Mapping Paths
    Li, Yanan
    Wang, Donghui
    Hu, Huanhang
    Lin, Yuetan
    Zhuang, Yueting
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5207 - 5215
  • [7] Survey of Visual-Semantic Embedding Methods for Zero-Shot Image Retrieval
    Ueki, Kazuya
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 628 - 634
  • [8] Zero-shot image classification via Visual-Semantic Feature Decoupling
    Sun, Xin
    Tian, Yu
    Li, Haojie
    [J]. MULTIMEDIA SYSTEMS, 2024, 30 (02)
  • [9] Visual-semantic consistency matching network for generalized zero-shot learning
    Zhang, Zhenqi
    Cao, Wenming
    [J]. NEUROCOMPUTING, 2023, 536 : 30 - 39
  • [10] Transductive Visual-Semantic Embedding for Zero-shot Learning
    Xu, Xing
    Shen, Fumin
    Yang, Yang
    Shao, Jie
    Huang, Zi
    [J]. PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, : 41 - 49