Improving Predicate Representation in Scene Graph Generation by Self-Supervised Learning

被引:0
|
作者
Hasegawa, So [1 ]
Hiromoto, Masayuki [1 ]
Nakagawa, Akira [1 ]
Umeda, Yuhei [1 ]
机构
[1] Fujitsu Ltd, Tokyo, Japan
关键词
D O I
10.1109/WACV56688.2023.00276
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene graph generation (SGG) aims to understand sophisticated visual information by detecting triplets of subject, object, and their relationship (predicate). Since the predicate labels are heavily imbalanced, existing supervised methods struggle to improve accuracy for the rare predicates due to insufficient labeled data. In this paper, we propose SePiR, a novel self-supervised learning method for SGG to improve the representation of rare predicates. We first train a relational encoder by contrastive learning without using predicate labels, and then fine-tune a predicate classifier with labeled data. To apply contrastive learning to SGG, we newly propose data augmentation in which subject-object pairs are augmented by replacing their visual features with those from other images having the same object labels. By such augmentation, we can increase the variation of the visual features while keeping the relationship between the objects. Comprehensive experimental results on the Visual Genome dataset show that the SGG performance of SePiR is comparable to the state-of-the-art, and especially with the limited labeled dataset, our method significantly outperforms the existing supervised methods. Moreover, SePiR's improved representation enables the model architecture simpler, resulting in 3.6x and 6.3x reduction of the parameters and inference time from the existing method, independently.
引用
收藏
页码:2739 / 2748
页数:10
相关论文
共 50 条
  • [31] Self-supervised Hierarchical Graph Neural Network for Graph Representation
    Bandyopadhyay, Sambaran
    Aggarwal, Manasvi
    Murty, M. Narasimha
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 603 - 608
  • [32] Dual-channel graph contrastive learning for self-supervised graph-level representation learning
    Luo, Zhenfei
    Dong, Yixiang
    Zheng, Qinghua
    Liu, Huan
    Luo, Minnan
    PATTERN RECOGNITION, 2023, 139
  • [33] Self-supervised Graph Learning with Segmented Graph Channels
    Gao, Hang
    Li, Jiangmeng
    Zheng, Changwen
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT II, 2023, 13714 : 293 - 308
  • [34] Cross-View Masked Model for Self-Supervised Graph Representation Learning
    Duan H.
    Yu B.
    Xie C.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (11): : 1 - 13
  • [35] Chemistry-Wise Augmentations for Molecule Graph Self-supervised Representation Learning
    Ondar, Evgeniia
    Makarov, Ilya
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2023, PT II, 2023, 14135 : 327 - 336
  • [36] GAN-based self-supervised message passing graph representation learning
    Yang, Yining
    Xu, Ke
    Tang, Ying
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 251
  • [37] SimGRL: a simple self-supervised graph representation learning framework via triplets
    Da Huang
    Fangyuan Lei
    Xi Zeng
    Complex & Intelligent Systems, 2023, 9 : 5049 - 5062
  • [38] Scalable self-supervised graph representation learning via enhancing and contrasting subgraphs
    Yizhu Jiao
    Yun Xiong
    Jiawei Zhang
    Yao Zhang
    Tianqi Zhang
    Yangyong Zhu
    Knowledge and Information Systems, 2022, 64 : 235 - 260
  • [39] Siamese Network Based Multiscale Self-Supervised Heterogeneous Graph Representation Learning
    Chen, Zijun
    Luo, Lihui
    Li, Xunkai
    Jiang, Bin
    Guo, Qiang
    Wang, Chunpeng
    IEEE ACCESS, 2022, 10 : 98490 - 98500
  • [40] Self-Supervised Graph Representation Learning Method Based on Data and Feature Augmentation
    Xu, Yunfeng
    Fan, Hexun
    Computer Engineering and Applications, 2024, 60 (17) : 148 - 157