Compositional Feature Augmentation for Unbiased Scene Graph Generation

被引：11

作者：

Li, Lin ^{[1
,2
]}

Chen, Guikun ^{[1
]}

Xiao, Jun ^{[1
]}

Yang, Yi ^{[1
]}

Wang, Chunping ^{[3
]}

Chen, Long ^{[2
]}

机构：

[1] Zhejiang Univ, Hangzhou, Peoples R China

[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China

[3] FinVolut, Shanghai, Peoples R China

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/ICCV51070.2023.01982

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Scene Graph Generation (SGG) aims to detect all the visual relation triplets <sub, pred, obj> in a given image. With the emergence of various advanced techniques for better utilizing both the intrinsic and extrinsic information in each relation triplet, SGG has achieved great progress over the recent years. However, due to the ubiquitous long-tailed predicate distributions, today's SGG models are still easily biased to the head predicates. Currently, the most prevalent debiasing solutions for SGG are re-balancing methods, e.g., changing the distributions of original training samples. In this paper, we argue that all existing re-balancing strategies fail to increase the diversity of the relation triplet features of each predicate, which is critical for robust SGG. To this end, we propose a novel Compositional Feature Augmentation (CFA) strategy, which is the first unbiased SGG work to mitigate the bias issue from the perspective of increasing the diversity of triplet features. Specifically, we first decompose each relation triplet feature into two components: intrinsic feature and extrinsic feature, which correspond to the intrinsic characteristics and extrinsic contexts of a relation triplet, respectively. Then, we design two different feature augmentation modules to enrich the feature diversity of original relation triplets by replacing or mixing up either their intrinsic or extrinsic features from other samples. Due to its model-agnostic nature, CFA can be seamlessly incorporated into various SGG frameworks. Extensive ablations have shown that CFA achieves a new state-of-the-art performance on the trade-off between different metrics.

引用

页码：21628 / 21638

页数：11

共 50 条

[1] Relation-Specific Feature Augmentation for unbiased scene graph generation
Liu, Zhihong
Wang, Jianji
Chen, Hui
Ma, Yongqiang
Zheng, Nanning
PATTERN RECOGNITION, 2025, 157
[2] Adaptive Feature Learning for Unbiased Scene Graph Generation
Yang, Jiarui
Wang, Chuan
Yang, Liang
Jiang, Yuchen
Cao, Angelina
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2252 - 2265
[3] Fast Contextual Scene Graph Generation with Unbiased Context Augmentation
Jin, Tianlei
Guo, Fangtai
Meng, Qiwei
Zhu, Shiqiang
Xi, Xiangming
Wang, Wen
Mu, Zonghao
Song, Wei
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6302 - 6311
[4] TEMPLATE-GUIDED DATA AUGMENTATION FOR UNBIASED SCENE GRAPH GENERATION
Zang, Yujie
Li, Yaochen
Cao, Luguang
Lu, Ruitao
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3510 - 3514
[5] Unbiased Scene Graph Generation in Videos
Nag, Sayak
Min, Kyle
Tripathi, Subama
Roy-Chowdhury, Amit K.
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 22803 - 22813
[6] State-Aware Compositional Learning Toward Unbiased Training for Scene Graph Generation
He, Tao
Gao, Lianli
Song, Jingkuan
Li, Yuan-Fang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 43 - 56
[7] Unbiased Scene Graph Generation from Biased Training
Tang, Kaihua
Niu, Yulei
Huang, Jianqiang
Shi, Jiaxin
Zhang, Hanwang
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3713 - 3722
[8] Unbiased Scene Graph Generation Using Predicate Similarities
Matsui, Yusuke
Ohashi, Misaki
IEEE ACCESS, 2024, 12 : 95507 - 95516
[9] Vision Relation Transformer for Unbiased Scene Graph Generation
Sudhakaran, Gopika
Dhami, Devendra Singh
Kersting, Kristian
Roth, Stefan
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21825 - 21836
[10] Geometric and Semantic Improvement for Unbiased Scene Graph Generation
Zhang, Ruhui
Xu, Pengcheng
Kang, Kang
Yang, You
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2023, 17 (10): : 2643 - 2657

← 1 2 3 4 5 →