Scene Graph Prediction with Limited Labels

被引：3

作者：

Chen, Vincent S. ^{[1
]}

Varma, Paroma ^{[1
]}

Krishna, Ranjay ^{[1
]}

Bernstein, Michael ^{[1
]}

Re, Christopher ^{[1
]}

Fei-Fei, Li ^{[1
]}

机构：

[1] Stanford Univ, Stanford, CA 94305 USA

来源：

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW) | 2019年

关键词：

D O I：

10.1109/ICCVW.2019.00220

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Visual knowledge bases such as Visual Genome power numerous applications in computer vision, including visual question answering and captioning, but suffer from sparse, incomplete relationships. All scene graph models to date are limited to training on a small set of visual relationships that have thousands of training labels each. Hiring human annotators is expensive, and using textual knowledge base completion methods are incompatible with visual data. In this paper, we introduce a semi-supervised method that assigns probabilistic relationship labels to a large number of unlabeled images using few labeled examples. We analyze visual relationships to suggest two types of image-agnostic features that are used to generate noisy heuristics, whose outputs are aggregated using a factor graph-based generative model. With as few as 10 labeled examples per relationship, the generative model creates enough training data to train any existing state-of-the-art scene graph model. We demonstrate that our method outperforms all baseline approaches on scene graph prediction by 5.16 recall@ 100 for PREDCLS. In our limited label setting, we define a complexity metric for relationships that serves as an indicator (R-2 = 0.778) for conditions under which our method succeeds over transfer learning, the de facto approach for training with limited labels.

引用

页码：1772 / 1782

页数：11

共 50 条

[41] Virtual Category Learning: A Semi-Supervised Learning Method for Dense Prediction With Extremely Limited Labels
Chen, Changrui
Han, Jungong
Debattista, Kurt
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (08) : 5595 - 5611
[42] EasyLabels: weak labels for scene segmentation in laparoscopic videos
Fuentes-Hurtado, Felix
Kadkhodamohammadi, Abdolrahim
Flouty, Evangello
Barbarisi, Santiago
Luengo, Imanol
Stoyanov, Danail
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2019, 14 (07) : 1247 - 1257
[43] MSMatch: Semisupervised Multispectral Scene Classification With Few Labels
Gomez, Pablo
Meoni, Gabriele
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 11643 - 11654
[44] EasyLabels: weak labels for scene segmentation in laparoscopic videos
Félix Fuentes-Hurtado
Abdolrahim Kadkhodamohammadi
Evangello Flouty
Santiago Barbarisi
Imanol Luengo
Danail Stoyanov
International Journal of Computer Assisted Radiology and Surgery, 2019, 14 : 1247 - 1257
[45] 3D Spatial Multimodal Knowledge Accumulation for Scene Graph Prediction in Point Cloud
Feng, Mingtao
Hou, Haoran
Zhang, Liang
Wu, Zijie
Guo, Yulan
Mian, Ajmal
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9182 - 9191
[46] Exploring and Exploiting the Hierarchical Structure of a Scene for Scene Graph Generation
Kurosawa, Ikuto
Kobayashi, Tetsunori
Hayashi, Yoshihiko
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1422 - 1429
[47] Fine-Grained Scene Graph Generation via Sample-Level Bias Prediction
Li, Yansheng
Wang, Tingzhu
Wu, Kang
Wang, Linlin
Guo, Xin
Wang, Wenbin
COMPUTER VISION - ECCV 2024, PT XXVI, 2025, 15084 : 18 - 35
[48] Disentangling Cognitive Diagnosis with Limited Exercise Labels
Chen, Xiangzhi
Wu, Le
Liu, Fei
Chen, Lei
Zhang, Kun
Hong, Richang
Wang, Meng
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[49] Learning Classifiers for Target Domain with Limited or No Labels
Zhu, Pengkai
Wang, Hanxiao
Saligrama, Venkatesh
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[50] Large Scale Sentiment Learning with Limited Labels
Iosifidis, Vasileios
Ntoutsi, Eirini
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 1823 - 1832

← 1 2 3 4 5 →