Zero-shot Scene Graph Generation with Relational Graph Neural Networks

被引：1

作者：

Yu, Xiang ^{[1
]}

Li, Jie ^{[1
]}

Yuan, Shijing ^{[1
]}

Wang, Chao ^{[1
]}

Wu, Chentao ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Sch Comp Sci & Engn, Shanghai, Peoples R China

来源：

2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2022年

基金：

国家重点研发计划;

关键词：

D O I：

10.1109/ICPR56361.2022.9956712

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Existing scene graph generation (SGG) methods are far from practical, primarily due to their poor performance on predicting zero-shot (i.e., unseen) subject-predicate-object triples. We observe that these SGG methods treat images along with the triples in them independently and thus fail to consider the complex and hidden information that is inherently implicit in the triples of other images. To this effect, our paper proposes a novel encoder-decoder SGG framework to leverage the semantic correlations between the triples of different images into the prediction of a zero-shot triple. Specifically, the encoder aggregates the triples in each image of training set into a large knowledge graph and learns the entity embeddings that capture the features of their neighborhoods with a relational graph neural network. The neighborhood-aware embeddings are then fed into the vision-based decoder to predict the predicates in images. Extensive experiments on the popular benchmark Visual Genome demonstrate that our proposed method outperforms the state-of-the-art methods in popular zero-shot metrics (i.e., zR@N, ng-zR@N) for all SGG tasks.

引用

页码：1894 / 1900

页数：7

共 50 条

[31] Transductive Zero-Shot Action Recognition via Visually Connected Graph Convolutional Networks
Xu, Yangyang
Han, Chu
Qin, Jing
Xu, Xuemiao
Han, Guoqiang
He, Shengfeng
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (08) : 3761 - 3769
[32] Zero-Shot Multi-View Indoor Localization via Graph Location Networks
Chiou, Meng-Jiun
Liu, Zhenguang
Yin, Yifang
Liu, An-An
Zimmermann, Roger
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3431 - 3440
[33] Graph-Based Semantic Embedding Refinement for Zero-Shot Remote Sensing Image Scene Classification
Shang, Junyuan
Niu, Chang
Zhou, Wenlve
Zhou, Zhiheng
Yang, Junmei
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (01) : 644 - 657
[34] Inductive Zero-Shot Image Annotation via Embedding Graph
Wang, Fangxin
Liu, Jie
Zhang, Shuwu
Zhang, Guixuan
Li, Yuejun
Yuan, Fei
IEEE ACCESS, 2019, 7 : 107816 - 107830
[35] Zero-shot Node Classification with Decomposed Graph Prototype Network
Wang, Zheng
Wang, Jialong
Guo, Yuchen
Gong, Zhiguo
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 1769 - 1779
[36] A dynamic semantic knowledge graph for zero-shot object detection
Wen Lv
Hongbo Shi
Shuai Tan
Bing Song
Yang Tao
The Visual Computer, 2023, 39 : 4513 - 4527
[37] Zero-shot surface defect recognition with class knowledge graph
Li, Zhaofu
Gao, Liang
Gao, Yiping
Li, Xinyu
Li, Hui
ADVANCED ENGINEERING INFORMATICS, 2022, 54
[38] Transductive semantic knowledge graph propagation for zero-shot learning
Zhang, Hai-gang
Que, Hao-yi
Ren, Jin
Wu, Zheng-guang
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (17): : 13108 - 13125
[39] A dynamic semantic knowledge graph for zero-shot object detection
Lv, Wen
Shi, Hongbo
Tan, Shuai
Song, Bing
Tao, Yang
VISUAL COMPUTER, 2023, 39 (10): : 4513 - 4527
[40] Zero-Shot Visual Question Answering Using Knowledge Graph
Chen, Zhuo
Chen, Jiaoyan
Geng, Yuxia
Pan, Jeff Z.
Yuan, Zonggang
Chen, Huajun
SEMANTIC WEB - ISWC 2021, 2021, 12922 : 146 - 162

← 1 2 3 4 5 →