Unbiased Scene Graph Generation in Videos

被引:9
|
作者
Nag, Sayak [1 ]
Min, Kyle [2 ]
Tripathi, Subama [2 ]
Roy-Chowdhury, Amit K. [1 ]
机构
[1] Univ Calif Riverside, Riverside, CA 92521 USA
[2] Intel Corp, Santa Clara, CA USA
关键词
D O I
10.1109/CVPR52729.2023.02184
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task of dynamic scene graph generation (SGG) from videos is complicated and challenging due to the inherent dynamics of a scene, temporal fluctuation of model predictions, and the long-tailed distribution of the visual relationships in addition to the already existing challenges in image-based SGG. Existing methods for dynamic SGG have primarily focused on capturing spatio-temporal context using complex architectures without addressing the challenges mentioned above, especially the long-tailed distribution of relationships. This often leads to the generation of biased scene graphs. To address these challenges, we introduce a new framework called TEMPURA: TEmporal consistency and Memory Prototype guided UnceRtainty Attenuation for unbiased dynamic SGG. TEMPURA employs object-level temporal consistencies via transformer-based sequence modeling, learns to synthesize unbiased relationship representations using memory-guided training, and attenuates the predictive uncertainty of visual relations using a Gaussian Mixture Model (GMM). Extensive experiments demonstrate that our method achieves significant (up to 10% in some cases) performance gain over existing methods highlighting its superiority in generating more unbiased scene graphs. Code: https://github.com/sayaknag/unbiasedSGG.git
引用
收藏
页码:22803 / 22813
页数:11
相关论文
共 50 条
  • [21] Unbiased Scene Graph Generation via Two-Stage Causal Modeling
    Sun, Shuzhou
    Zhi, Shuaifeng
    Liao, Qing
    Heikkila, Janne
    Liu, Li
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12562 - 12580
  • [22] Dual-Branch Hybrid Learning Network for Unbiased Scene Graph Generation
    Zheng, Chaofan
    Gao, Lianli
    Lyu, Xinyu
    Zeng, Pengpeng
    El Saddik, Abdulmotaleb
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1743 - 1756
  • [23] Attention redirection transformer with semantic oriented learning for unbiased scene graph generation
    Zhang, Ruonan
    An, Gaoyun
    Cen, Yigang
    Ruan, Qiuqi
    PATTERN RECOGNITION, 2025, 158
  • [24] PPDL: Predicate Probability Distribution based Loss for Unbiased Scene Graph Generation
    Li, Wei
    Zhang, Haiwei
    Bai, Qijie
    Zhao, Guoqing
    Jiang, Ning
    Yuan, Xiaojie
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19425 - 19434
  • [25] Bridging Visual and Textual Semantics: Towards Consistency for Unbiased Scene Graph Generation
    Zhang, Ruonan
    An, Gaoyun
    Hao, Yiqing
    Wu, Dapeng Oliver
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (11) : 7102 - 7119
  • [26] PCPL: Predicate-Correlation Perception Learning for Unbiased Scene Graph Generation
    Yan, Shaotian
    Shen, Chen
    Jin, Zhongming
    Huang, Jianqiang
    Jiang, Rongxin
    Chen, Yaowu
    Hua, Xian-Sheng
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 265 - 273
  • [27] Triple Correlations-Guided Label Supplementation for Unbiased Video Scene Graph Generation
    Wang, Wenqing
    Gao, Kaifeng
    Luo, Yawei
    Jiang, Tao
    Gao, Fei
    Shao, Jian
    Sun, Jianwen
    Xiao, Jun
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5153 - 5163
  • [28] A New Training Data Organization Form and Training Mode for Unbiased Scene Graph Generation
    Xu, Hongbo
    Wang, Lichun
    Xu, Kai
    Fu, Fangyu
    Yin, Baocai
    Huang, Qingming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5295 - 5305
  • [29] HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation
    Zhou, Zijian
    Shi, Miaojing
    Caesar, Holger
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21580 - 21591
  • [30] Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation
    Dong, Xingning
    Gan, Tian
    Song, Xuemeng
    Wu, Jianlong
    Cheng, Yuan
    Nie, Liqiang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19405 - 19414