Compositional Feature Augmentation for Unbiased Scene Graph Generation

被引:11
|
作者
Li, Lin [1 ,2 ]
Chen, Guikun [1 ]
Xiao, Jun [1 ]
Yang, Yi [1 ]
Wang, Chunping [3 ]
Chen, Long [2 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[3] FinVolut, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICCV51070.2023.01982
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene Graph Generation (SGG) aims to detect all the visual relation triplets <sub, pred, obj> in a given image. With the emergence of various advanced techniques for better utilizing both the intrinsic and extrinsic information in each relation triplet, SGG has achieved great progress over the recent years. However, due to the ubiquitous long-tailed predicate distributions, today's SGG models are still easily biased to the head predicates. Currently, the most prevalent debiasing solutions for SGG are re-balancing methods, e.g., changing the distributions of original training samples. In this paper, we argue that all existing re-balancing strategies fail to increase the diversity of the relation triplet features of each predicate, which is critical for robust SGG. To this end, we propose a novel Compositional Feature Augmentation (CFA) strategy, which is the first unbiased SGG work to mitigate the bias issue from the perspective of increasing the diversity of triplet features. Specifically, we first decompose each relation triplet feature into two components: intrinsic feature and extrinsic feature, which correspond to the intrinsic characteristics and extrinsic contexts of a relation triplet, respectively. Then, we design two different feature augmentation modules to enrich the feature diversity of original relation triplets by replacing or mixing up either their intrinsic or extrinsic features from other samples. Due to its model-agnostic nature, CFA can be seamlessly incorporated into various SGG frameworks. Extensive ablations have shown that CFA achieves a new state-of-the-art performance on the trade-off between different metrics.
引用
收藏
页码:21628 / 21638
页数:11
相关论文
共 50 条
  • [1] Relation-Specific Feature Augmentation for unbiased scene graph generation
    Liu, Zhihong
    Wang, Jianji
    Chen, Hui
    Ma, Yongqiang
    Zheng, Nanning
    PATTERN RECOGNITION, 2025, 157
  • [2] Adaptive Feature Learning for Unbiased Scene Graph Generation
    Yang, Jiarui
    Wang, Chuan
    Yang, Liang
    Jiang, Yuchen
    Cao, Angelina
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2252 - 2265
  • [3] Fast Contextual Scene Graph Generation with Unbiased Context Augmentation
    Jin, Tianlei
    Guo, Fangtai
    Meng, Qiwei
    Zhu, Shiqiang
    Xi, Xiangming
    Wang, Wen
    Mu, Zonghao
    Song, Wei
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6302 - 6311
  • [4] TEMPLATE-GUIDED DATA AUGMENTATION FOR UNBIASED SCENE GRAPH GENERATION
    Zang, Yujie
    Li, Yaochen
    Cao, Luguang
    Lu, Ruitao
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3510 - 3514
  • [5] Unbiased Scene Graph Generation in Videos
    Nag, Sayak
    Min, Kyle
    Tripathi, Subama
    Roy-Chowdhury, Amit K.
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 22803 - 22813
  • [6] State-Aware Compositional Learning Toward Unbiased Training for Scene Graph Generation
    He, Tao
    Gao, Lianli
    Song, Jingkuan
    Li, Yuan-Fang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 43 - 56
  • [7] Unbiased Scene Graph Generation from Biased Training
    Tang, Kaihua
    Niu, Yulei
    Huang, Jianqiang
    Shi, Jiaxin
    Zhang, Hanwang
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3713 - 3722
  • [8] Unbiased Scene Graph Generation Using Predicate Similarities
    Matsui, Yusuke
    Ohashi, Misaki
    IEEE ACCESS, 2024, 12 : 95507 - 95516
  • [9] Vision Relation Transformer for Unbiased Scene Graph Generation
    Sudhakaran, Gopika
    Dhami, Devendra Singh
    Kersting, Kristian
    Roth, Stefan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21825 - 21836
  • [10] Geometric and Semantic Improvement for Unbiased Scene Graph Generation
    Zhang, Ruhui
    Xu, Pengcheng
    Kang, Kang
    Yang, You
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2023, 17 (10): : 2643 - 2657