Scene Graph Prediction with Limited Labels

被引:3
|
作者
Chen, Vincent S. [1 ]
Varma, Paroma [1 ]
Krishna, Ranjay [1 ]
Bernstein, Michael [1 ]
Re, Christopher [1 ]
Fei-Fei, Li [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
关键词
D O I
10.1109/ICCVW.2019.00220
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual knowledge bases such as Visual Genome power numerous applications in computer vision, including visual question answering and captioning, but suffer from sparse, incomplete relationships. All scene graph models to date are limited to training on a small set of visual relationships that have thousands of training labels each. Hiring human annotators is expensive, and using textual knowledge base completion methods are incompatible with visual data. In this paper, we introduce a semi-supervised method that assigns probabilistic relationship labels to a large number of unlabeled images using few labeled examples. We analyze visual relationships to suggest two types of image-agnostic features that are used to generate noisy heuristics, whose outputs are aggregated using a factor graph-based generative model. With as few as 10 labeled examples per relationship, the generative model creates enough training data to train any existing state-of-the-art scene graph model. We demonstrate that our method outperforms all baseline approaches on scene graph prediction by 5.16 recall@ 100 for PREDCLS. In our limited label setting, we define a complexity metric for relationships that serves as an indicator (R-2 = 0.778) for conditions under which our method succeeds over transfer learning, the de facto approach for training with limited labels.
引用
收藏
页码:1772 / 1782
页数:11
相关论文
共 50 条
  • [41] Virtual Category Learning: A Semi-Supervised Learning Method for Dense Prediction With Extremely Limited Labels
    Chen, Changrui
    Han, Jungong
    Debattista, Kurt
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (08) : 5595 - 5611
  • [42] EasyLabels: weak labels for scene segmentation in laparoscopic videos
    Fuentes-Hurtado, Felix
    Kadkhodamohammadi, Abdolrahim
    Flouty, Evangello
    Barbarisi, Santiago
    Luengo, Imanol
    Stoyanov, Danail
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2019, 14 (07) : 1247 - 1257
  • [43] MSMatch: Semisupervised Multispectral Scene Classification With Few Labels
    Gomez, Pablo
    Meoni, Gabriele
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 11643 - 11654
  • [44] EasyLabels: weak labels for scene segmentation in laparoscopic videos
    Félix Fuentes-Hurtado
    Abdolrahim Kadkhodamohammadi
    Evangello Flouty
    Santiago Barbarisi
    Imanol Luengo
    Danail Stoyanov
    International Journal of Computer Assisted Radiology and Surgery, 2019, 14 : 1247 - 1257
  • [45] 3D Spatial Multimodal Knowledge Accumulation for Scene Graph Prediction in Point Cloud
    Feng, Mingtao
    Hou, Haoran
    Zhang, Liang
    Wu, Zijie
    Guo, Yulan
    Mian, Ajmal
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9182 - 9191
  • [46] Exploring and Exploiting the Hierarchical Structure of a Scene for Scene Graph Generation
    Kurosawa, Ikuto
    Kobayashi, Tetsunori
    Hayashi, Yoshihiko
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1422 - 1429
  • [47] Fine-Grained Scene Graph Generation via Sample-Level Bias Prediction
    Li, Yansheng
    Wang, Tingzhu
    Wu, Kang
    Wang, Linlin
    Guo, Xin
    Wang, Wenbin
    COMPUTER VISION - ECCV 2024, PT XXVI, 2025, 15084 : 18 - 35
  • [48] Disentangling Cognitive Diagnosis with Limited Exercise Labels
    Chen, Xiangzhi
    Wu, Le
    Liu, Fei
    Chen, Lei
    Zhang, Kun
    Hong, Richang
    Wang, Meng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [49] Learning Classifiers for Target Domain with Limited or No Labels
    Zhu, Pengkai
    Wang, Hanxiao
    Saligrama, Venkatesh
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [50] Large Scale Sentiment Learning with Limited Labels
    Iosifidis, Vasileios
    Ntoutsi, Eirini
    KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 1823 - 1832