Spatially Invariant Unsupervised Object Detection with Convolutional Neural Networks

被引:0
|
作者
Crawford, Eric [1 ]
Pineau, Joelle [2 ]
机构
[1] McGill Univ, Mila, Montreal, PQ, Canada
[2] Facebook AI Res, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are many reasons to expect an ability to reason in terms of objects to be a crucial skill for any generally intelligent agent. Indeed, recent machine learning literature is replete with examples of the benefits of object-like representations: generalization, transfer to new tasks, and interpretability, among others. However, in order to reason in terms of objects, agents need a way of discovering and detecting objects in the visual world - a task which we call unsupervised object detection. This task has received significantly less attention in the literature than its supervised counterpart, especially in the case of large images containing many objects. In the current work, we develop a neural network architecture that effectively addresses this large-image, many-object setting. In particular, we combine ideas from Attend, Infer, Repeat (AIR), which performs unsupervised object detection but does not scale well, with recent developments in supervised object detection. We replace AIR's core recurrent network with a convolutional (and thus spatially invariant) network, and make use of an object-specification scheme that describes the location of objects with respect to local grid cells rather than the image as a whole. Through a series of experiments, we demonstrate a number of features of our architecture: that, unlike AIR, it is able to discover and detect objects in large, many-object scenes; that it has a significant ability to generalize to images that are larger and contain more objects than images encountered during training; and that it is able to discover and detect objects with enough accuracy to facilitate non-trivial downstream processing.
引用
收藏
页码:3412 / 3420
页数:9
相关论文
共 50 条
  • [1] Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection
    Cheng, Gong
    Han, Junwei
    Zhou, Peicheng
    Xu, Dong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (01) : 265 - 278
  • [2] Parallel Convolutional Neural Networks for Object Detection
    Olugboja, Adedeji
    Wang, Zenghui
    Sun, Yanxia
    [J]. JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2021, 12 (04) : 279 - 286
  • [3] Object Detection Using Convolutional Neural Networks
    Galvez, Reagan L.
    Bandala, Argel A.
    Dadios, Elmer P.
    Vicerra, Ryan Rhay P.
    Maningo, Jose Martin Z.
    [J]. PROCEEDINGS OF TENCON 2018 - 2018 IEEE REGION 10 CONFERENCE, 2018, : 2023 - 2027
  • [4] Cascaded Convolutional Neural Networks for Object Detection
    Guo, Yajing
    Guo, Xiaoqiang
    Jiang, Zhuqing
    Zhou, Yun
    [J]. 2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
  • [5] Unsupervised Hyperspectral Anomaly Detection with Convolutional Neural Networks
    Yilmaz, Fatma Nur
    Arisoy, Sertac
    Kayabol, Koray
    [J]. 29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [6] Unsupervised feature selection for multi-class object detection using convolutional neural networks
    Matsugu, M
    Cardon, P
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2004, PT 1, 2004, 3173 : 864 - 869
  • [7] Spatially Supervised Recurrent Convolutional Neural Networks for Visual Object Tracking
    Ning, Guanghan
    Zhang, Zhi
    Huang, Chen
    Ren, Xiaobo
    Wang, Haohong
    Cai, Canhui
    He, Zhihai
    [J]. 2017 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2017, : 2311 - 2314
  • [8] Convolutional Neural Networks for Unsupervised Anomaly Detection in Text Data
    Gorokhov, Oleg
    Petrovskiy, Mikhail
    Mashechkin, Igor
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2017, 2017, 10585 : 500 - 507
  • [9] Crater Detection Using Unsupervised Algorithms and Convolutional Neural Networks
    Emami, Ebrahim
    Ahmad, Touqeer
    Bebis, George
    Nefian, Ara
    Fong, Terry
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (08): : 5373 - 5383
  • [10] Object Detection Using Deep Convolutional Neural Networks
    Qian, Huimin
    Xu, Jiawei
    Zhou, Jun
    [J]. 2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 1151 - 1156