Zero-shot Scene Graph Generation with Relational Graph Neural Networks

被引:0
|
作者
Yu, Xiang [1 ]
Li, Jie [1 ]
Yuan, Shijing [1 ]
Wang, Chao [1 ]
Wu, Chentao [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Comp Sci & Engn, Shanghai, Peoples R China
基金
国家重点研发计划;
关键词
D O I
10.1109/ICPR56361.2022.9956712
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing scene graph generation (SGG) methods are far from practical, primarily due to their poor performance on predicting zero-shot (i.e., unseen) subject-predicate-object triples. We observe that these SGG methods treat images along with the triples in them independently and thus fail to consider the complex and hidden information that is inherently implicit in the triples of other images. To this effect, our paper proposes a novel encoder-decoder SGG framework to leverage the semantic correlations between the triples of different images into the prediction of a zero-shot triple. Specifically, the encoder aggregates the triples in each image of training set into a large knowledge graph and learns the entity embeddings that capture the features of their neighborhoods with a relational graph neural network. The neighborhood-aware embeddings are then fed into the vision-based decoder to predict the predicates in images. Extensive experiments on the popular benchmark Visual Genome demonstrate that our proposed method outperforms the state-of-the-art methods in popular zero-shot metrics (i.e., zR@N, ng-zR@N) for all SGG tasks.
引用
下载
收藏
页码:1894 / 1900
页数:7
相关论文
共 50 条
  • [1] Zero-shot Scene Graph Generation via Triplet Calibration and Reduction
    Li, Jiankai
    Wang, Yunhong
    Li, Weixin
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (01)
  • [2] Zero-Shot Predicate Prediction for Scene Graph Parsing
    Li, Yiming
    Yang, Xiaoshan
    Huang, Xuhui
    Ma, Zhe
    Xu, Changsheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3140 - 3153
  • [3] Zero-shot learning of aerosol optical properties with graph neural networks
    Lamb, K. D.
    Gentine, P.
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [4] Zero-shot learning of aerosol optical properties with graph neural networks
    K. D. Lamb
    P. Gentine
    Scientific Reports, 13
  • [5] Zero-Shot Video Object Segmentation via Attentive Graph Neural Networks
    Wang, Wenguan
    Lu, Xiankai
    Shen, Jianbing
    Crandall, David
    Shao, Ling
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9235 - 9244
  • [6] Dynamic Gated Graph Neural Networks for Scene Graph Generation
    Khademi, Mahmoud
    Schulte, Oliver
    COMPUTER VISION - ACCV 2018, PT VI, 2019, 11366 : 669 - 685
  • [7] TGG: Transferable Graph Generation for Zero-shot and Few-shot Learning
    Zhang, Chenrui
    Lyu, Xiaoqing
    Tang, Zhi
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1641 - 1649
  • [8] From Node to Graph: Joint Reasoning on Visual-Semantic Relational Graph for Zero-Shot Detection
    Nie, Hui
    Wang, Ruiping
    Chen, Xilin
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1648 - 1657
  • [9] Zero-Shot Ingredient Recognition by Multi-Relational Graph Convolutional Network
    Chen, Jingjing
    Pan, Liangming
    Wei, Zhipeng
    Wang, Xiang
    Ngo, Chong-Wah
    Chua, Tat-Seng
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10542 - 10550
  • [10] Zero-Shot Scene Graph Relation Prediction Through Commonsense Knowledge Integration
    Kan, Xuan
    Cui, Hejie
    Yang, Carl
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT II, 2021, 12976 : 466 - 482