A Novel Two-Stage Training Method for Unbiased Scene Graph Generation via Distribution

被引:0
|
作者
Jia, Dongdong [1 ]
Zhou, Meili [1 ]
Wei, Wei [2 ,3 ]
Wang, Dong [1 ]
Bai, Zongwen [1 ]
机构
[1] Yanan Univ, Sch Phys & Elect Informat, Yanan 716000, Shaanxi, Peoples R China
[2] Xian Univ Technol, Sch Comp Sci & Engn, Xian 710048, Peoples R China
[3] Shaanxi Key Lab Network Comp & Secur Technol, Xian, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Scene Graph Generation; Transformer-based Architecture; Distribution Alignment; Model-independent; Visual Genome Dataset;
D O I
10.3837/tiis.2023.12.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scene graphs serve as semantic abstractions of images and play a crucial role in enhancing visual comprehension and reasoning. However, the performance of Scene Graph Generation is often compromised when working with biased data in real-world situations. While many existing systems focus on a single stage of learning for both feature extraction and classification, some employ Class-Balancing strategies, such as Re-weighting, Data Resampling, and Transfer Learning from head to tail. In this paper, we propose a novel approach that decouples the feature extraction and classification phases of the scene graph generation process. For feature extraction, we leverage a transformer-based architecture and design an adaptive calibration function specifically for predicate classification. This function enables us to dynamically adjust the classification scores for each predicate category. Additionally, we introduce a Distribution Alignment technique that effectively balances the class distribution after the feature extraction phase reaches a stable state, thereby facilitating the retraining of the classification head. Importantly, our Distribution Alignment strategy is model-independent and does not require additional supervision, making it applicable to a wide range of SGG models. Using the scene graph diagnostic toolkit on Visual Genome and several popular models, we achieved significant improvements over the previous state-of-the-art methods with our model. Compared to the TDE model, our model improved mR@100 by 70.5% for PredCls, by 84.0% for SGCls, and by 97.6% for SGDet tasks.
引用
收藏
页码:3383 / 3397
页数:15
相关论文
共 50 条
  • [1] Unbiased Scene Graph Generation via Two-Stage Causal Modeling
    Sun, Shuzhou
    Zhi, Shuaifeng
    Liao, Qing
    Heikkila, Janne
    Liu, Li
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12562 - 12580
  • [2] Unbiased Scene Graph Generation from Biased Training
    Tang, Kaihua
    Niu, Yulei
    Huang, Jianqiang
    Shi, Jiaxin
    Zhang, Hanwang
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3713 - 3722
  • [3] An Effective Dynamic Reweighting Method for Unbiased Scene Graph Generation
    Hu, Lingfeng
    Liu, Si
    Wang, Hanzi
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 345 - 356
  • [4] Unbiased Scene Graph Generation in Videos
    Nag, Sayak
    Min, Kyle
    Tripathi, Subama
    Roy-Chowdhury, Amit K.
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 22803 - 22813
  • [5] A New Training Data Organization Form and Training Mode for Unbiased Scene Graph Generation
    Xu, Hongbo
    Wang, Lichun
    Xu, Kai
    Fu, Fangyu
    Yin, Baocai
    Huang, Qingming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5295 - 5305
  • [6] Resistance Training Using Prior Bias: Toward Unbiased Scene Graph Generation
    Chen, Chao
    Zhan, Yibing
    Yu, Baosheng
    Liu, Liu
    Luo, Yong
    Du, Bo
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 212 - 220
  • [7] Unbiased scene graph generation using the self-distillation method
    Sun, Bo
    Hao, Zhuo
    Yu, Lejun
    He, Jun
    VISUAL COMPUTER, 2024, 40 (04): : 2381 - 2390
  • [8] Unbiased scene graph generation using the self-distillation method
    Bo Sun
    Zhuo Hao
    Lejun Yu
    Jun He
    The Visual Computer, 2024, 40 : 2381 - 2390
  • [9] Two-Stage Training of Graph Neural Networks for Graph Classification
    Do, Manh Tuan
    Park, Noseong
    Shin, Kijung
    NEURAL PROCESSING LETTERS, 2023, 55 (03) : 2799 - 2823
  • [10] Two-Stage Training of Graph Neural Networks for Graph Classification
    Manh Tuan Do
    Noseong Park
    Kijung Shin
    Neural Processing Letters, 2023, 55 : 2799 - 2823