Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation

被引:58
|
作者
Dong, Xingning [1 ]
Gan, Tian [1 ]
Song, Xuemeng [1 ]
Wu, Jianlong [1 ]
Cheng, Yuan [2 ]
Nie, Liqiang [1 ]
机构
[1] Shandong Univ, Jinan, Peoples R China
[2] Ant Grp, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
COMPRESSION;
D O I
10.1109/CVPR52688.2022.01882
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene Graph Generation, which generally follows a regular encoder-decoder pipeline, aims to first encode the visual contents within the given image and then parse them into a compact summary graph. Existing SGG approaches generally not only neglect the insufficient modality fusion between vision and language, but also fail to provide informative predicates due to the biased relationship predictions, leading SGG far from practical. Towards this end, we first present a novel Stacked Hybrid-Attention network, which facilitates the intra-modal refinement as well as the intermodal interaction, to serve as the encoder. We then devise an innovative Group Collaborative Learning strategy to optimize the decoder. Particularly, based on the observation that the recognition capability of one classifier is limited towards an extremely unbalanced dataset, we first deploy a group of classifiers that are expert in distinguishing different subsets of classes, and then cooperatively optimize them from two aspects to promote the unbiased SGG. Experiments conducted on VG and GQA datasets demonstrate that, we not only establish a new state-of-the-art in the unbiased metric, but also nearly double the performance compared with two baselines. Our code is available at https://github.com/dongxingning/SHA-GCL-for-SGG.
引用
收藏
页码:19405 / 19414
页数:10
相关论文
共 50 条
  • [1] Attention redirection transformer with semantic oriented learning for unbiased scene graph generation
    Zhang, Ruonan
    An, Gaoyun
    Cen, Yigang
    Ruan, Qiuqi
    PATTERN RECOGNITION, 2025, 158
  • [2] Dual-Branch Hybrid Learning Network for Unbiased Scene Graph Generation
    Zheng, Chaofan
    Gao, Lianli
    Lyu, Xinyu
    Zeng, Pengpeng
    El Saddik, Abdulmotaleb
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1743 - 1756
  • [3] Hybrid-attention mechanism based heterogeneous graph representation learning
    Wang, Xiang
    Deng, Weikang
    Meng, Zhenyu
    Chen, Dewang
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 250
  • [4] Adaptive Feature Learning for Unbiased Scene Graph Generation
    Yang, Jiarui
    Wang, Chuan
    Yang, Liang
    Jiang, Yuchen
    Cao, Angelina
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2252 - 2265
  • [5] Dark Knowledge Balance Learning for Unbiased Scene Graph Generation
    Chen, Zhiqing
    Luo, Yawei
    Shao, Jian
    Yang, Yi
    Wang, Chunping
    Chen, Lei
    Xiao, Jun
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4838 - 4847
  • [6] Unbiased Scene Graph Generation in Videos
    Nag, Sayak
    Min, Kyle
    Tripathi, Subama
    Roy-Chowdhury, Amit K.
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 22803 - 22813
  • [7] PCPL: Predicate-Correlation Perception Learning for Unbiased Scene Graph Generation
    Yan, Shaotian
    Shen, Chen
    Jin, Zhongming
    Huang, Jianqiang
    Jiang, Rongxin
    Chen, Yaowu
    Hua, Xian-Sheng
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 265 - 273
  • [8] Unbiased Scene Graph Generation from Biased Training
    Tang, Kaihua
    Niu, Yulei
    Huang, Jianqiang
    Shi, Jiaxin
    Zhang, Hanwang
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3713 - 3722
  • [9] Unbiased Scene Graph Generation Using Predicate Similarities
    Matsui, Yusuke
    Ohashi, Misaki
    IEEE ACCESS, 2024, 12 : 95507 - 95516
  • [10] Compositional Feature Augmentation for Unbiased Scene Graph Generation
    Li, Lin
    Chen, Guikun
    Xiao, Jun
    Yang, Yi
    Wang, Chunping
    Chen, Long
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21628 - 21638