Constrained Structure Learning for Scene Graph Generation

被引:4
|
作者
Liu, Daqi [1 ]
Bober, Miroslaw [1 ]
Kittler, Josef [1 ]
机构
[1] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, England
基金
英国工程与自然科学研究理事会;
关键词
Scene graph generation; structured prediction; mean field variational Bayesian; message passing; constrained optimization;
D O I
10.1109/TPAMI.2023.3282889
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a structured prediction task, scene graph generation aims to build a visually-grounded scene graph to explicitly model objects and their relationships in an input image. Currently, the mean field variational Bayesian framework is the de facto methodology used by the existing methods, in which the unconstrained inference step is often implemented by a message passing neural network. However, such formulation fails to explore other inference strategies, and largely ignores the more general constrained optimization models. In this paper, we present a constrained structure learning method, for which an explicit constrained variational inference objective is proposed. Instead of applying the ubiquitous message-passing strategy, a generic constrained optimization method - entropic mirror descent - is utilized to solve the constrained variational inference step. We validate the proposed generic model on various popular scene graph generation benchmarks and show that it outperforms the state-of-the-art methods.
引用
收藏
页码:11588 / 11599
页数:12
相关论文
共 50 条
  • [21] Toward Region-Aware Attention Learning for Scene Graph Generation
    Liu, An-An
    Tian, Hongshuo
    Xu, Ning
    Nie, Weizhi
    Zhang, Yongdong
    Kankanhalli, Mohan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (12) : 7655 - 7666
  • [22] Semantic Scene Graph Generation Using RDF Model and Deep Learning
    Kim, Seongyong
    Jeon, Tae Hyeon
    Rhiu, Ilsun
    Ahn, Jinhyun
    Im, Dong-Hyuk
    APPLIED SCIENCES-BASEL, 2021, 11 (02): : 1 - 12
  • [23] DOMAIN-WISE INVARIANT LEARNING FOR PANOPTIC SCENE GRAPH GENERATION
    Li, Li
    Qin, You
    Ji, Wei
    Zhou, Yuxiao
    Zimmermann, Roger
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3165 - 3169
  • [24] Adaptive Fine-Grained Predicates Learning for Scene Graph Generation
    Lyu, Xinyu
    Gao, Lianli
    Zeng, Pengpeng
    Shen, Heng Tao
    Song, Jingkuan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13921 - 13940
  • [25] Improving Scene Graph Generation with Superpixel-Based Interaction Learning
    Wang, Jingyi
    Zhang, Can
    Huang, Jinfa
    Ren, Botao
    Deng, Zhidong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1809 - 1820
  • [26] Hierarchical Memory Learning for Fine-Grained Scene Graph Generation
    Deng, Youming
    Li, Yansheng
    Zhang, Yongjun
    Xiang, Xiang
    Wang, Jian
    Chen, Jingdong
    Ma, Jiayi
    COMPUTER VISION - ECCV 2022, PT XXVII, 2022, 13687 : 266 - 283
  • [27] Dynamic Scene Graph Generation of Point Clouds with Structural Representation Learning
    Qi, Chao
    Yin, Jianqin
    Zhang, Zhicheng
    Tang, Jin
    TSINGHUA SCIENCE AND TECHNOLOGY, 2024, 29 (01): : 232 - 243
  • [28] Beware of Overcorrection: Scene-induced Commonsense Graph for Scene Graph Generation
    Chen, Lianggangxu
    Lu, Jiale
    Song, Youqi
    Wang, Changbo
    He, Gaoqi
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2888 - 2897
  • [29] Multimodal graph inference network for scene graph generation
    Jingwen Duan
    Weidong Min
    Deyu Lin
    Jianfeng Xu
    Xin Xiong
    Applied Intelligence, 2021, 51 : 8768 - 8783
  • [30] Multimodal graph inference network for scene graph generation
    Duan, Jingwen
    Min, Weidong
    Lin, Deyu
    Xu, Jianfeng
    Xiong, Xin
    APPLIED INTELLIGENCE, 2021, 51 (12) : 8768 - 8783