A Novel Two-Stage Training Method for Unbiased Scene Graph Generation via Distribution

被引:0
|
作者
Jia, Dongdong [1 ]
Zhou, Meili [1 ]
Wei, Wei [2 ,3 ]
Wang, Dong [1 ]
Bai, Zongwen [1 ]
机构
[1] Yanan Univ, Sch Phys & Elect Informat, Yanan 716000, Shaanxi, Peoples R China
[2] Xian Univ Technol, Sch Comp Sci & Engn, Xian 710048, Peoples R China
[3] Shaanxi Key Lab Network Comp & Secur Technol, Xian, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Scene Graph Generation; Transformer-based Architecture; Distribution Alignment; Model-independent; Visual Genome Dataset;
D O I
10.3837/tiis.2023.12.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scene graphs serve as semantic abstractions of images and play a crucial role in enhancing visual comprehension and reasoning. However, the performance of Scene Graph Generation is often compromised when working with biased data in real-world situations. While many existing systems focus on a single stage of learning for both feature extraction and classification, some employ Class-Balancing strategies, such as Re-weighting, Data Resampling, and Transfer Learning from head to tail. In this paper, we propose a novel approach that decouples the feature extraction and classification phases of the scene graph generation process. For feature extraction, we leverage a transformer-based architecture and design an adaptive calibration function specifically for predicate classification. This function enables us to dynamically adjust the classification scores for each predicate category. Additionally, we introduce a Distribution Alignment technique that effectively balances the class distribution after the feature extraction phase reaches a stable state, thereby facilitating the retraining of the classification head. Importantly, our Distribution Alignment strategy is model-independent and does not require additional supervision, making it applicable to a wide range of SGG models. Using the scene graph diagnostic toolkit on Visual Genome and several popular models, we achieved significant improvements over the previous state-of-the-art methods with our model. Compared to the TDE model, our model improved mR@100 by 70.5% for PredCls, by 84.0% for SGCls, and by 97.6% for SGDet tasks.
引用
收藏
页码:3383 / 3397
页数:15
相关论文
共 50 条
  • [41] Dual-Branch Hybrid Learning Network for Unbiased Scene Graph Generation
    Zheng, Chaofan
    Gao, Lianli
    Lyu, Xinyu
    Zeng, Pengpeng
    El Saddik, Abdulmotaleb
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1743 - 1756
  • [42] Attention redirection transformer with semantic oriented learning for unbiased scene graph generation
    Zhang, Ruonan
    An, Gaoyun
    Cen, Yigang
    Ruan, Qiuqi
    PATTERN RECOGNITION, 2025, 158
  • [43] Bridging Visual and Textual Semantics: Towards Consistency for Unbiased Scene Graph Generation
    Zhang, Ruonan
    An, Gaoyun
    Hao, Yiqing
    Wu, Dapeng Oliver
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (11) : 7102 - 7119
  • [44] PCPL: Predicate-Correlation Perception Learning for Unbiased Scene Graph Generation
    Yan, Shaotian
    Shen, Chen
    Jin, Zhongming
    Huang, Jianqiang
    Jiang, Rongxin
    Chen, Yaowu
    Hua, Xian-Sheng
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 265 - 273
  • [45] A two-stage decomposition method on fresh product distribution problem
    Hu, Hongtao
    Zhang, Ye
    Zhen, Lu
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2017, 55 (16) : 4729 - 4752
  • [46] Text to Image Synthesis Using Two-Stage Generation and Two-Stage Discrimination
    Zhang, Zhiqiang
    Zhang, Yunye
    Yu, Wenxin
    He, Gang
    Jiang, Ning
    He, Gang
    Fan, Yibo
    Yang, Zhuo
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT II, 2019, 11776 : 110 - 114
  • [47] Two-stage graph matching point cloud registration method based on graph attention network
    Guo, Jiacheng
    Liu, Xuejun
    Zhang, Shuo
    Yan, Yong
    Sha, Yun
    Jiang, Yinan
    JOURNAL OF APPLIED REMOTE SENSING, 2024, 18 (03)
  • [48] Superpixels with Content-Awareness via a Two-Stage Generation Framework
    Li, Cheng
    Liao, Nannan
    Huang, Zhe
    Bian, He
    Zhang, Zhe
    Ren, Long
    SYMMETRY-BASEL, 2024, 16 (08):
  • [49] Parameter estimation via artificial data generation with the "two-stage" approach
    Garatti, Simone
    Bittanti, Sergio
    2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 5605 - 5610
  • [50] Unbiased estimation of selected treatment means in two-stage trials
    Bowden, Jack
    Glimm, Ekkehard
    BIOMETRICAL JOURNAL, 2008, 50 (04) : 515 - 527