Scene Graph Prediction with Limited Labels

被引:3
|
作者
Chen, Vincent S. [1 ]
Varma, Paroma [1 ]
Krishna, Ranjay [1 ]
Bernstein, Michael [1 ]
Re, Christopher [1 ]
Fei-Fei, Li [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
关键词
D O I
10.1109/ICCVW.2019.00220
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual knowledge bases such as Visual Genome power numerous applications in computer vision, including visual question answering and captioning, but suffer from sparse, incomplete relationships. All scene graph models to date are limited to training on a small set of visual relationships that have thousands of training labels each. Hiring human annotators is expensive, and using textual knowledge base completion methods are incompatible with visual data. In this paper, we introduce a semi-supervised method that assigns probabilistic relationship labels to a large number of unlabeled images using few labeled examples. We analyze visual relationships to suggest two types of image-agnostic features that are used to generate noisy heuristics, whose outputs are aggregated using a factor graph-based generative model. With as few as 10 labeled examples per relationship, the generative model creates enough training data to train any existing state-of-the-art scene graph model. We demonstrate that our method outperforms all baseline approaches on scene graph prediction by 5.16 recall@ 100 for PREDCLS. In our limited label setting, we define a complexity metric for relationships that serves as an indicator (R-2 = 0.778) for conditions under which our method succeeds over transfer learning, the de facto approach for training with limited labels.
引用
收藏
页码:1772 / 1782
页数:11
相关论文
共 50 条
  • [1] Scene Graph Prediction with Limited Labels
    Chen, Vincent S.
    Varma, Paroma
    Krishna, Ranjay
    Bernstein, Michael
    Re, Christopher
    Li Fei-Fei
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2580 - 2590
  • [2] Not All Relations are Equal: Mining Informative Labels for Scene Graph Generation
    Goel, Arushi
    Fernando, Basura
    Keller, Frank
    Bilen, Hakan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15575 - 15585
  • [3] The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation
    Li, Lin
    Chen, Long
    Huang, Yifeng
    Zhang, Zhimeng
    Zhang, Songyang
    Xiao, Jun
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18847 - 18856
  • [4] Generative Compositional Augmentations for Scene Graph Prediction
    Knyazev, Boris
    de Vries, Harm
    Cangea, Catalina
    Taylor, Graham W.
    Courville, Aaron
    Belilovsky, Eugene
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15807 - 15817
  • [5] A Balanced Relation Prediction Framework for Scene Graph Generation
    Xu, Kai
    Wang, Lichun
    Li, Shuang
    Zhang, Huiyong
    Yin, Baocai
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IV, 2023, 14257 : 216 - 228
  • [6] Dual Scene Graph Convolutional Network for Motivation Prediction
    Wanyan, Yuyang
    Yang, Xiaoshan
    Ma, Xuan
    Xu, Changsheng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (03)
  • [7] Scene Graph Lossless Compression with Adaptive Prediction for Objects and Relations
    Lin, Weiyao
    Zhang, Yufeng
    Dai, Wenrui
    Liu, Huabin
    See, John
    Xiong, Hongkai
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)
  • [8] Zero-Shot Predicate Prediction for Scene Graph Parsing
    Li, Yiming
    Yang, Xiaoshan
    Huang, Xuhui
    Ma, Zhe
    Xu, Changsheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3140 - 3153
  • [9] Contrastive Graph Poisson Networks: Semi-Supervised Learning with Extremely Limited Labels
    Wan, Sheng
    Zhan, Yibing
    Liu, Liu
    Yu, Baosheng
    Pan, Shirui
    Gong, Chen
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [10] 3D scene graph prediction from point clouds
    Wu F.
    Yan F.
    Shi W.
    Zhou Z.
    Virtual Reality and Intelligent Hardware, 2022, 4 (01): : 76 - 88