Improving Predicate Representation in Scene Graph Generation by Self-Supervised Learning

被引：0

作者：

Hasegawa, So ^{[1
]}

Hiromoto, Masayuki ^{[1
]}

Nakagawa, Akira ^{[1
]}

Umeda, Yuhei ^{[1
]}

机构：

[1] Fujitsu Ltd, Tokyo, Japan

来源：

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) | 2023年

关键词：

D O I：

10.1109/WACV56688.2023.00276

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Scene graph generation (SGG) aims to understand sophisticated visual information by detecting triplets of subject, object, and their relationship (predicate). Since the predicate labels are heavily imbalanced, existing supervised methods struggle to improve accuracy for the rare predicates due to insufficient labeled data. In this paper, we propose SePiR, a novel self-supervised learning method for SGG to improve the representation of rare predicates. We first train a relational encoder by contrastive learning without using predicate labels, and then fine-tune a predicate classifier with labeled data. To apply contrastive learning to SGG, we newly propose data augmentation in which subject-object pairs are augmented by replacing their visual features with those from other images having the same object labels. By such augmentation, we can increase the variation of the visual features while keeping the relationship between the objects. Comprehensive experimental results on the Visual Genome dataset show that the SGG performance of SePiR is comparable to the state-of-the-art, and especially with the limited labeled dataset, our method significantly outperforms the existing supervised methods. Moreover, SePiR's improved representation enables the model architecture simpler, resulting in 3.6x and 6.3x reduction of the parameters and inference time from the existing method, independently.

引用

页码：2739 / 2748

页数：10

共 50 条

[21] GMAEEG: A Self-Supervised Graph Masked Autoencoder for EEG Representation Learning
Fu, Zanhao
Zhu, Huaiyu
Zhao, Yisheng
Huan, Ruohong
Zhang, Yi
Chen, Shuohui
Pan, Yun
IEEE Journal of Biomedical and Health Informatics, 2024, 28 (11): : 6486 - 6497
[22] Self-supervised Graph Learning for Recommendation
Wu, Jiancan
Wang, Xiang
Feng, Fuli
He, Xiangnan
Chen, Liang
Lian, Jianxun
Xie, Xing
SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 726 - 735
[23] SCENE REPRESENTATION LEARNING FROM VIDEOS USING SELF-SUPERVISED AND WEAKLY-SUPERVISED TECHNIQUES
Peri, Raghuveer
Parthasarathy, Srinivas
Sundaram, Shiva
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1671 - 1675
[24] Graph Adversarial Self-Supervised Learning
Yang, Longqi
Zhang, Liangliang
Yang, Wenjing
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[25] Graph Self-Supervised Learning: A Survey
Liu, Yixin
Jin, Ming
Pan, Shirui
Zhou, Chuan
Zheng, Yu
Xia, Feng
Yu, Philip S.
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (06) : 5879 - 5900
[26] Whitening for Self-Supervised Representation Learning
Ermolov, Aleksandr
Siarohin, Aliaksandr
Sangineto, Enver
Sebe, Nicu
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[27] Self-Supervised Representation Learning for CAD
Jones, Benjamin T.
Hu, Michael
Kodnongbua, Milin
Kim, Vladimir G.
Schulz, Adriana
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21327 - 21336
[28] Improving Self-supervised Molecular Representation Learning using Persistent Homology
Luo, Yuankai
Shi, Lei
Thost, Veronika
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[29] Look Twice as Much as You Say: Scene Graph Contrastive Learning for Self-Supervised Image Caption Generation
Zhang, Chunhui
Huang, Chao
Li, Youhuan
Zhang, Xiangliang
Ye, Yanfang
Zhang, Chuxu
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 2519 - 2528
[30] Self-supervised contrastive graph representation with node and graph augmentation?
Duan, Haoran
Xie, Cheng
Li, Bin
Tang, Peng
NEURAL NETWORKS, 2023, 167 : 223 - 232

← 1 2 3 4 5 →