A Novel Two-Stage Training Method for Unbiased Scene Graph Generation via Distribution

被引:0
|
作者
Jia, Dongdong [1 ]
Zhou, Meili [1 ]
Wei, Wei [2 ,3 ]
Wang, Dong [1 ]
Bai, Zongwen [1 ]
机构
[1] Yanan Univ, Sch Phys & Elect Informat, Yanan 716000, Shaanxi, Peoples R China
[2] Xian Univ Technol, Sch Comp Sci & Engn, Xian 710048, Peoples R China
[3] Shaanxi Key Lab Network Comp & Secur Technol, Xian, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Scene Graph Generation; Transformer-based Architecture; Distribution Alignment; Model-independent; Visual Genome Dataset;
D O I
10.3837/tiis.2023.12.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scene graphs serve as semantic abstractions of images and play a crucial role in enhancing visual comprehension and reasoning. However, the performance of Scene Graph Generation is often compromised when working with biased data in real-world situations. While many existing systems focus on a single stage of learning for both feature extraction and classification, some employ Class-Balancing strategies, such as Re-weighting, Data Resampling, and Transfer Learning from head to tail. In this paper, we propose a novel approach that decouples the feature extraction and classification phases of the scene graph generation process. For feature extraction, we leverage a transformer-based architecture and design an adaptive calibration function specifically for predicate classification. This function enables us to dynamically adjust the classification scores for each predicate category. Additionally, we introduce a Distribution Alignment technique that effectively balances the class distribution after the feature extraction phase reaches a stable state, thereby facilitating the retraining of the classification head. Importantly, our Distribution Alignment strategy is model-independent and does not require additional supervision, making it applicable to a wide range of SGG models. Using the scene graph diagnostic toolkit on Visual Genome and several popular models, we achieved significant improvements over the previous state-of-the-art methods with our model. Compared to the TDE model, our model improved mR@100 by 70.5% for PredCls, by 84.0% for SGCls, and by 97.6% for SGDet tasks.
引用
收藏
页码:3383 / 3397
页数:15
相关论文
共 50 条
  • [31] A two-stage method for optimal placement of distributed generation units and capacitors in distribution systems
    Mouwafi, Mohamed T.
    El-Sehiemy, Ragab A.
    Abou El-Ela, Adel A.
    APPLIED ENERGY, 2022, 307
  • [32] Weakly-supervised Video Scene Graph Generation via Unbiased Cross-modal Learning
    Wu, Ziyue
    Gao, Junyu
    Xu, Changsheng
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4574 - 4583
  • [33] Local Graph Edge Partitioning with a Two-Stage Heuristic Method
    Ji, Shengwei
    Bu, Chenyang
    Li, Lei
    Wu, Xindong
    2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 228 - 237
  • [34] TEMPLATE-GUIDED DATA AUGMENTATION FOR UNBIASED SCENE GRAPH GENERATION
    Zang, Yujie
    Li, Yaochen
    Cao, Luguang
    Lu, Ruitao
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3510 - 3514
  • [35] Knowledge-Enhanced Context Representation for Unbiased Scene Graph Generation
    Wang, Yuanlong
    Liu, Zhenqi
    Zhang, Hu
    Li, Ru
    WEB AND BIG DATA, APWEB-WAIM 2024, PT I, 2024, 14961 : 248 - 263
  • [36] Relation-Specific Feature Augmentation for unbiased scene graph generation
    Liu, Zhihong
    Wang, Jianji
    Chen, Hui
    Ma, Yongqiang
    Zheng, Nanning
    PATTERN RECOGNITION, 2025, 157
  • [37] Camouflaged object segmentation with prior via two-stage training
    Wang, Rui
    Shi, Caijuan
    Duan, Changyu
    Gao, Weixiang
    Zhu, Hongli
    Wei, Yunchao
    Liu, Meiqin
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 246
  • [38] Informative Scene Graph Generation via Debiasing
    Gao, Lianli
    Lyu, Xinyu
    Guo, Yuyu
    Hu, Yuxuan
    Li, Yuan-Fang
    Xu, Lu
    Shen, Heng Tao
    Song, Jingkuan
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025,
  • [39] Unbiased scene graph generation via head-tail cooperative network with self-supervised learning
    Wang, Lei
    Yuan, Zejian
    Lu, Yao
    Chen, Badong
    IMAGE AND VISION COMPUTING, 2024, 151
  • [40] A novel defect generation model based on two-stage GAN
    Zhang, Yuming
    Gao, Zhongyuan
    Zhi, Chao
    Chen, Mengqi
    Zhou, Youyong
    Wang, Shuai
    Fu, Sida
    Yu, Lingjie
    E-POLYMERS, 2022, 22 (01) : 793 - 802