Zero-shot Scene Graph Generation with Relational Graph Neural Networks

被引:0
|
作者
Yu, Xiang [1 ]
Li, Jie [1 ]
Yuan, Shijing [1 ]
Wang, Chao [1 ]
Wu, Chentao [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Comp Sci & Engn, Shanghai, Peoples R China
基金
国家重点研发计划;
关键词
D O I
10.1109/ICPR56361.2022.9956712
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing scene graph generation (SGG) methods are far from practical, primarily due to their poor performance on predicting zero-shot (i.e., unseen) subject-predicate-object triples. We observe that these SGG methods treat images along with the triples in them independently and thus fail to consider the complex and hidden information that is inherently implicit in the triples of other images. To this effect, our paper proposes a novel encoder-decoder SGG framework to leverage the semantic correlations between the triples of different images into the prediction of a zero-shot triple. Specifically, the encoder aggregates the triples in each image of training set into a large knowledge graph and learns the entity embeddings that capture the features of their neighborhoods with a relational graph neural network. The neighborhood-aware embeddings are then fed into the vision-based decoder to predict the predicates in images. Extensive experiments on the popular benchmark Visual Genome demonstrate that our proposed method outperforms the state-of-the-art methods in popular zero-shot metrics (i.e., zR@N, ng-zR@N) for all SGG tasks.
引用
下载
收藏
页码:1894 / 1900
页数:7
相关论文
共 50 条
  • [31] Transductive Zero-Shot Action Recognition via Visually Connected Graph Convolutional Networks
    Xu, Yangyang
    Han, Chu
    Qin, Jing
    Xu, Xuemiao
    Han, Guoqiang
    He, Shengfeng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (08) : 3761 - 3769
  • [32] Zero-Shot Multi-View Indoor Localization via Graph Location Networks
    Chiou, Meng-Jiun
    Liu, Zhenguang
    Yin, Yifang
    Liu, An-An
    Zimmermann, Roger
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3431 - 3440
  • [33] Inductive Zero-Shot Image Annotation via Embedding Graph
    Wang, Fangxin
    Liu, Jie
    Zhang, Shuwu
    Zhang, Guixuan
    Li, Yuejun
    Yuan, Fei
    IEEE ACCESS, 2019, 7 : 107816 - 107830
  • [34] Graph-Based Semantic Embedding Refinement for Zero-Shot Remote Sensing Image Scene Classification
    Shang, Junyuan
    Niu, Chang
    Zhou, Wenlve
    Zhou, Zhiheng
    Yang, Junmei
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (01) : 644 - 657
  • [35] A dynamic semantic knowledge graph for zero-shot object detection
    Wen Lv
    Hongbo Shi
    Shuai Tan
    Bing Song
    Yang Tao
    The Visual Computer, 2023, 39 : 4513 - 4527
  • [36] Zero-shot surface defect recognition with class knowledge graph
    Li, Zhaofu
    Gao, Liang
    Gao, Yiping
    Li, Xinyu
    Li, Hui
    ADVANCED ENGINEERING INFORMATICS, 2022, 54
  • [37] Zero-shot Node Classification with Decomposed Graph Prototype Network
    Wang, Zheng
    Wang, Jialong
    Guo, Yuchen
    Gong, Zhiguo
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 1769 - 1779
  • [38] Transductive semantic knowledge graph propagation for zero-shot learning
    Zhang, Hai-gang
    Que, Hao-yi
    Ren, Jin
    Wu, Zheng-guang
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (17): : 13108 - 13125
  • [39] A dynamic semantic knowledge graph for zero-shot object detection
    Lv, Wen
    Shi, Hongbo
    Tan, Shuai
    Song, Bing
    Tao, Yang
    VISUAL COMPUTER, 2023, 39 (10): : 4513 - 4527
  • [40] Zero-Shot Visual Question Answering Using Knowledge Graph
    Chen, Zhuo
    Chen, Jiaoyan
    Geng, Yuxia
    Pan, Jeff Z.
    Yuan, Zonggang
    Chen, Huajun
    SEMANTIC WEB - ISWC 2021, 2021, 12922 : 146 - 162