Constrained Structure Learning for Scene Graph Generation

被引:4
|
作者
Liu, Daqi [1 ]
Bober, Miroslaw [1 ]
Kittler, Josef [1 ]
机构
[1] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, England
基金
英国工程与自然科学研究理事会;
关键词
Scene graph generation; structured prediction; mean field variational Bayesian; message passing; constrained optimization;
D O I
10.1109/TPAMI.2023.3282889
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a structured prediction task, scene graph generation aims to build a visually-grounded scene graph to explicitly model objects and their relationships in an input image. Currently, the mean field variational Bayesian framework is the de facto methodology used by the existing methods, in which the unconstrained inference step is often implemented by a message passing neural network. However, such formulation fails to explore other inference strategies, and largely ignores the more general constrained optimization models. In this paper, we present a constrained structure learning method, for which an explicit constrained variational inference objective is proposed. Instead of applying the ubiquitous message-passing strategy, a generic constrained optimization method - entropic mirror descent - is utilized to solve the constrained variational inference step. We validate the proposed generic model on various popular scene graph generation benchmarks and show that it outperforms the state-of-the-art methods.
引用
收藏
页码:11588 / 11599
页数:12
相关论文
共 50 条
  • [1] Importance Weighted Structure Learning for Scene Graph Generation
    Liu, Daqi
    Bober, Miroslaw
    Kittler, Josef
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (02) : 1231 - 1242
  • [2] Heterogeneous Learning for Scene Graph Generation
    He, Yunqing
    Ren, Tongwei
    Tang, Jinhui
    Wu, Gangshan
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4704 - 4713
  • [3] Predicate Correlation Learning for Scene Graph Generation
    Tao, Leitian
    Mi, Li
    Li, Nannan
    Cheng, Xianhang
    Hu, Yaosi
    Chen, Zhenzhong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 4173 - 4185
  • [4] Exploring and Exploiting the Hierarchical Structure of a Scene for Scene Graph Generation
    Kurosawa, Ikuto
    Kobayashi, Tetsunori
    Hayashi, Yoshihiko
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1422 - 1429
  • [5] Adaptive Feature Learning for Unbiased Scene Graph Generation
    Yang, Jiarui
    Wang, Chuan
    Yang, Liang
    Jiang, Yuchen
    Cao, Angelina
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2252 - 2265
  • [6] Energy-Based Learning for Scene Graph Generation
    Suhail, Mohammed
    Mittal, Abhay
    Siddiquie, Behjat
    Broaddus, Chris
    Eledath, Jayan
    Medioni, Gerard
    Sigal, Leonid
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13931 - 13940
  • [7] Debiased Scene Graph Generation for Dual Imbalance Learning
    Zhou, Hao
    Zhang, Jun
    Luo, Tingjin
    Yang, Yazhou
    Lei, Jun
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4274 - 4288
  • [8] Part-Aware Interactive Learning for Scene Graph Generation
    Tian, Hongshuo
    Xu, Ning
    Liu, An-An
    Zhang, Yongdong
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3155 - 3163
  • [9] Dark Knowledge Balance Learning for Unbiased Scene Graph Generation
    Chen, Zhiqing
    Luo, Yawei
    Shao, Jian
    Yang, Yi
    Wang, Chunping
    Chen, Lei
    Xiao, Jun
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4838 - 4847
  • [10] A causality guided loss for imbalanced learning in scene graph generation
    Peng, Ru
    Zhao, Chao
    Chen, Xingyu
    Wang, Ziru
    Liu, Yaxin
    Liu, Yulong
    Lan, Xuguang
    NEUROCOMPUTING, 2024, 599