Unbiased Scene Graph Generation Using Predicate Similarities

被引:0
|
作者
Matsui, Yusuke [1 ]
Ohashi, Misaki [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Dept Informat & Commun Engn, Bunkyo Ku, Tokyo 1138656, Japan
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Task analysis; Knowledge transfer; Feature extraction; Visualization; Training; Computer vision; Transfer learning; Bioinformatics; Genomics; Classification algorithms; Scene classification; Scene graph; unbiased generation; predicate similarities; transfer learning; long-tailed distribution; SMOTE;
D O I
10.1109/ACCESS.2024.3424230
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scene Graphs are widely applied in computer vision as a graphical representation of relationships between objects shown in images. However, these applications have not yet reached a practical stage of development owing to biased training caused by long-tailed predicate distributions. In recent years, many studies have tackled this problem. In contrast, relatively few works have considered predicate similarities as a unique dataset feature which also leads to the biased prediction. Due to the feature, infrequent predicates (e.g., "parked on", "covered in") are easily misclassified as closely-related frequent predicates (e.g., "on", "in"). Utilizing predicate similarities, we propose a new classification scheme that branches the process to several fine-grained classifiers for similar predicate groups. The classifiers aim to capture the differences among similar predicates in detail. We also introduce the idea of transfer learning to enhance the features for the predicates which lack sufficient training samples to learn the descriptive representations. Our target here is to improve the average precision scores even for the instances with the tail predicators. The results of extensive experiments on the Visual Genome dataset show that the combination of our method and an existing debiasing approach greatly improves performance on tail predicates in challenging SGCls/SGDet tasks. Nonetheless, the overall performance of the proposed approach does not reach that of the current state of the art, so further analysis remains necessary as future work.
引用
收藏
页码:95507 / 95516
页数:10
相关论文
共 50 条
  • [21] CogTree: Cognition Tree Loss for Unbiased Scene Graph Generation
    Yu, Jing
    Chai, Yuan
    Wang, Yujing
    Hu, Yue
    Wu, Qi
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1274 - 1280
  • [22] Bipartite Graph Network with Adaptive Message Passing for Unbiased Scene Graph Generation
    Li, Rongjie
    Zhang, Songyang
    Wan, Bo
    He, Xuming
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11104 - 11114
  • [23] PANET: A CONTEXT BASED PREDICATE ASSOCIATION NETWORK FOR SCENE GRAPH GENERATION
    Chen, Yunian
    Wang, Yanjie
    Zhang, Yang
    Guo, Yanwen
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 508 - 513
  • [24] Addressing Predicate Overlap in Scene Graph Generation with Semantic Granularity Controller
    Chen, Guikun
    Li, Lin
    Luo, Yawei
    Xiao, Jun
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 78 - 83
  • [25] TEMPLATE-GUIDED DATA AUGMENTATION FOR UNBIASED SCENE GRAPH GENERATION
    Zang, Yujie
    Li, Yaochen
    Cao, Luguang
    Lu, Ruitao
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3510 - 3514
  • [26] Knowledge-Enhanced Context Representation for Unbiased Scene Graph Generation
    Wang, Yuanlong
    Liu, Zhenqi
    Zhang, Hu
    Li, Ru
    WEB AND BIG DATA, APWEB-WAIM 2024, PT I, 2024, 14961 : 248 - 263
  • [27] Relation-Specific Feature Augmentation for unbiased scene graph generation
    Liu, Zhihong
    Wang, Jianji
    Chen, Hui
    Ma, Yongqiang
    Zheng, Nanning
    PATTERN RECOGNITION, 2025, 157
  • [28] Evidential Representation Proposal for Predicate Classification Output Logits in Scene Graph Generation
    Kunitomo-Jacquin, Lucie
    Fukuda, Ken
    ARTIFICIAL INTELLIGENCE IN HCI, PT I, AI-HCI 2024, 2024, 14734 : 391 - 402
  • [29] Improving Predicate Representation in Scene Graph Generation by Self-Supervised Learning
    Hasegawa, So
    Hiromoto, Masayuki
    Nakagawa, Akira
    Umeda, Yuhei
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2739 - 2748
  • [30] Unbiased Scene Graph Generation via Two-Stage Causal Modeling
    Sun, Shuzhou
    Zhi, Shuaifeng
    Liao, Qing
    Heikkila, Janne
    Liu, Li
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12562 - 12580