Self-Supervised Object Detection and Retrieval Using Unlabeled Videos

被引:3
|
作者
Amrani, Elad [1 ,2 ]
Ben-Ari, Rami [1 ]
Shapira, Inbar [1 ]
Hakim, Tal [1 ]
Bronstein, Alex [2 ]
机构
[1] IBM Res AI, Ruschlikon, Switzerland
[2] Technion, Haifa, Israel
关键词
D O I
10.1109/CVPRW50498.2020.00485
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning an object detection or retrieval system requires a large data set with manual annotations. Such data are expensive and time-consuming to create and therefore difficult to obtain on a large scale. In this work, we propose using the natural correlation in narrations and the visual presence of objects in video to learn an object detector and retriever without any manual labeling involved. We pose the problem as weakly supervised learning with noisy labels, and propose a novel object detection and retrieval paradigm under these constraints. We handle the background rejection by using contrastive samples and confront the high level of label noise with a new clustering score. Our evaluation is based on a set of ten objects with manual ground truth annotation in almost 5000 frames extracted from instructional videos from the web. We demonstrate superior results compared to state-of-the-art weakly-supervised approaches and report a strongly-labeled upper bound as well. While the focus of the paper is object detection and retrieval, the proposed methodology can be applied to a broader range of noisy weakly-supervised problems.
引用
收藏
页码:4100 / 4108
页数:9
相关论文
共 50 条
  • [1] Self-Supervised Object Detection from Egocentric Videos
    Akiva, Peri
    Huang, Jing
    Liang, Kevin J.
    Kovvuri, Rama
    Chen, Xingyu
    Feiszli, Matt
    Dana, Kristin
    Hassner, Tal
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5202 - 5214
  • [2] Self-Supervised Learning of Object Segmentation from Unlabeled RGB-D Videos
    Lu, Shiyang
    Deng, Yunfu
    Boularias, Abdeslam
    Bekris, Kostas
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 7017 - 7023
  • [3] A Self-supervised Learning System for Object Detection in Videos Using Random Walks on Graphs
    Tan, Juntao
    Song, Changkyu
    Boularias, Abdeslam
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 14061 - 14068
  • [4] The Retrieval of the Beautiful: Self-Supervised Salient Object Detection for Beauty Product Retrieval
    Wang, Jiawei
    Zhu, Shuai
    Xu, Jiao
    Cao, Da
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2548 - 2552
  • [5] Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos
    Chen, Brian
    Rouditchenko, Andrew
    Duarte, Kevin
    Kuehne, Hilde
    Thomas, Samuel
    Boggust, Angie
    Panda, Rameswar
    Kingsbury, Brian
    Feris, Rogerio
    Harwath, David
    Glass, James
    Picheny, Michael
    Chang, Shih-Fu
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7992 - 8001
  • [6] Self-supervised Object-Centric Learning for Videos
    Aydemir, Gorkay
    Xie, Weidi
    Guney, Fatma
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [7] Identity from here, Pose from there: Self-supervised Disentanglement and Generation of Objects using Unlabeled Videos
    Xiao, Fanyi
    Liu, Haotian
    Lee, Yong Jae
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7012 - 7021
  • [8] Object Detection with Self-Supervised Scene Adaptation
    Zhang, Zekun
    Hoai, Minh
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21589 - 21599
  • [9] STEPs: Self-Supervised Key Step Extraction and Localization from Unlabeled Procedural Videos
    Shah, Anshul
    Lundell, Benjamin
    Sawhney, Harpreet
    Chellappa, Rama
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10341 - 10353
  • [10] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection
    Cao, Shengcao
    Joshi, Dhiraj
    Gui, Liang-Yan
    Wang, Yu-Xiong
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,