Compositional Feature Augmentation for Unbiased Scene Graph Generation

被引:11
|
作者
Li, Lin [1 ,2 ]
Chen, Guikun [1 ]
Xiao, Jun [1 ]
Yang, Yi [1 ]
Wang, Chunping [3 ]
Chen, Long [2 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[3] FinVolut, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICCV51070.2023.01982
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene Graph Generation (SGG) aims to detect all the visual relation triplets <sub, pred, obj> in a given image. With the emergence of various advanced techniques for better utilizing both the intrinsic and extrinsic information in each relation triplet, SGG has achieved great progress over the recent years. However, due to the ubiquitous long-tailed predicate distributions, today's SGG models are still easily biased to the head predicates. Currently, the most prevalent debiasing solutions for SGG are re-balancing methods, e.g., changing the distributions of original training samples. In this paper, we argue that all existing re-balancing strategies fail to increase the diversity of the relation triplet features of each predicate, which is critical for robust SGG. To this end, we propose a novel Compositional Feature Augmentation (CFA) strategy, which is the first unbiased SGG work to mitigate the bias issue from the perspective of increasing the diversity of triplet features. Specifically, we first decompose each relation triplet feature into two components: intrinsic feature and extrinsic feature, which correspond to the intrinsic characteristics and extrinsic contexts of a relation triplet, respectively. Then, we design two different feature augmentation modules to enrich the feature diversity of original relation triplets by replacing or mixing up either their intrinsic or extrinsic features from other samples. Due to its model-agnostic nature, CFA can be seamlessly incorporated into various SGG frameworks. Extensive ablations have shown that CFA achieves a new state-of-the-art performance on the trade-off between different metrics.
引用
收藏
页码:21628 / 21638
页数:11
相关论文
共 50 条
  • [31] HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation
    Zhou, Zijian
    Shi, Miaojing
    Caesar, Holger
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21580 - 21591
  • [32] Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation
    Dong, Xingning
    Gan, Tian
    Song, Xuemeng
    Wu, Jianlong
    Cheng, Yuan
    Nie, Liqiang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19405 - 19414
  • [33] Skew Class-Balanced Re-Weighting for Unbiased Scene Graph Generation
    Kang, Haeyong
    Yoo, Chang D.
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2023, 5 (01): : 287 - 303
  • [34] Compositional Transformers for Scene Generation
    Hudson, Drew A.
    Zitnick, C. Lawrence
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [35] DBiased-P: Dual-Biased Predicate Predictor for Unbiased Scene Graph Generation
    Han, Xianjing
    Song, Xuemeng
    Dong, Xingning
    Wei, Yinwei
    Liu, Meng
    Nie, Liqiang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 5319 - 5329
  • [36] A Novel Two-Stage Training Method for Unbiased Scene Graph Generation via Distribution
    Jia, Dongdong
    Zhou, Meili
    Wei, Wei
    Wang, Dong
    Bai, Zongwen
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2023, 17 (12): : 3383 - 3397
  • [37] Semantic Diversity-Aware Prototype-Based Learning for Unbiased Scene Graph Generation
    Jeon, Jaehyeong
    Kim, Kibum
    Yoon, Kanghoon
    Park, Chanyoung
    COMPUTER VISION - ECCV 2024, PT XXXVII, 2025, 15095 : 379 - 395
  • [38] Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation
    Jeon, Jaehyeong
    Kim, Kibum
    Yoon, Kanghoon
    Park, Chanyoung
    arXiv,
  • [39] Unbiased Heterogeneous Scene Graph Generation with Relation-Aware Message Passing Neural Network
    Yoon, Kanghoon
    Kim, Kibum
    Moon, Jinyoung
    Park, Chanyoung
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3285 - 3294
  • [40] Generative Compositional Augmentations for Scene Graph Prediction
    Knyazev, Boris
    de Vries, Harm
    Cangea, Catalina
    Taylor, Graham W.
    Courville, Aaron
    Belilovsky, Eugene
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15807 - 15817