Triple Correlations-Guided Label Supplementation for Unbiased Video Scene Graph Generation

被引:1
|
作者
Wang, Wenqing [1 ]
Gao, Kaifeng [1 ]
Luo, Yawei [1 ]
Jiang, Tao [1 ]
Gao, Fei [2 ]
Shao, Jian [1 ]
Sun, Jianwen [3 ]
Xiao, Jun [1 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Zhejiang Univ Technol, Hangzhou, Peoples R China
[3] Cent China Normal Univ, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
video scene graph generation; spatio-temporal correlations; long-tail problem; missing label supplementation;
D O I
10.1145/3581783.3612024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video-based scene graph generation (VidSGG) is an approach that aims to represent video content in a dynamic graph by identifying visual entities and their relationships. Due to the inherently biased distribution and missing annotations in the training data, current VidSGG methods have been found to perform poorly on less-represented predicates. In this paper, we propose an explicit solution to address this under-explored issue by supplementing missing predicates that should appear in the ground-truth annotations. Dubbed Trico, our method seeks to supplement the missing predicates by exploring three complementary spatio-temporal correlations. Guided by these correlations, the missing labels can be effectively supplemented thus achieving an unbiased predicate predictions. We validate the effectiveness of Trico on the most widely used VidSGG datasets, i.e., VidVRD and VidOR. Extensive experiments demonstrate the state-of-the-art performance achieved by Trico, particularly on those tail predicates. The code is available in https://github.com/Wq23333/Trico.git.
引用
收藏
页码:5153 / 5163
页数:11
相关论文
共 50 条
  • [31] Bridging Visual and Textual Semantics: Towards Consistency for Unbiased Scene Graph Generation
    Zhang, Ruonan
    An, Gaoyun
    Hao, Yiqing
    Wu, Dapeng Oliver
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (11) : 7102 - 7119
  • [32] PCPL: Predicate-Correlation Perception Learning for Unbiased Scene Graph Generation
    Yan, Shaotian
    Shen, Chen
    Jin, Zhongming
    Huang, Jianqiang
    Jiang, Rongxin
    Chen, Yaowu
    Hua, Xian-Sheng
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 265 - 273
  • [33] Learning to Generate an Unbiased Scene Graph by Using Attribute-Guided Predicate Features
    Wang, Lei
    Yuan, Zejian
    Chen, Badong
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2581 - 2589
  • [34] HIG: Hierarchical Interlacement Graph Approach to Scene Graph Generation in Video Understanding
    Trong-Thuan Nguyen
    Pha Nguyen
    Luu, Khoa
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 18384 - 18394
  • [35] Hypercomplex context guided interaction modeling for scene graph generation
    Wang, Zheng
    Xu, Xing
    Luo, Yadan
    Wang, Guoqing
    Yang, Yang
    PATTERN RECOGNITION, 2023, 141
  • [36] A causality guided loss for imbalanced learning in scene graph generation
    Peng, Ru
    Zhao, Chao
    Chen, Xingyu
    Wang, Ziru
    Liu, Yaxin
    Liu, Yulong
    Lan, Xuguang
    NEUROCOMPUTING, 2024, 599
  • [37] Target Adaptive Context Aggregation for Video Scene Graph Generation
    Teng, Yao
    Wang, Limin
    Li, Zhifeng
    Wu, Gangshan
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13668 - 13677
  • [38] Video Scene Graph Generation with Spatial-Temporal Knowledge
    Pu, Tao
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9340 - 9344
  • [39] NICEST: Noisy Label Correction and Training for Robust Scene Graph Generation
    Li, Lin
    Xiao, Jun
    Shi, Hanrong
    Zhang, Hanwang
    Yang, Yi
    Liu, Wei
    Chen, Long
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (10) : 6873 - 6888
  • [40] The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation
    Li, Lin
    Chen, Long
    Huang, Yifeng
    Zhang, Zhimeng
    Zhang, Songyang
    Xiao, Jun
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18847 - 18856