Spatially Invariant Unsupervised Object Detection with Convolutional Neural Networks

被引:0
|
作者
Crawford, Eric [1 ]
Pineau, Joelle [2 ]
机构
[1] McGill Univ, Mila, Montreal, PQ, Canada
[2] Facebook AI Res, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are many reasons to expect an ability to reason in terms of objects to be a crucial skill for any generally intelligent agent. Indeed, recent machine learning literature is replete with examples of the benefits of object-like representations: generalization, transfer to new tasks, and interpretability, among others. However, in order to reason in terms of objects, agents need a way of discovering and detecting objects in the visual world - a task which we call unsupervised object detection. This task has received significantly less attention in the literature than its supervised counterpart, especially in the case of large images containing many objects. In the current work, we develop a neural network architecture that effectively addresses this large-image, many-object setting. In particular, we combine ideas from Attend, Infer, Repeat (AIR), which performs unsupervised object detection but does not scale well, with recent developments in supervised object detection. We replace AIR's core recurrent network with a convolutional (and thus spatially invariant) network, and make use of an object-specification scheme that describes the location of objects with respect to local grid cells rather than the image as a whole. Through a series of experiments, we demonstrate a number of features of our architecture: that, unlike AIR, it is able to discover and detect objects in large, many-object scenes; that it has a significant ability to generalize to images that are larger and contain more objects than images encountered during training; and that it is able to discover and detect objects with enough accuracy to facilitate non-trivial downstream processing.
引用
收藏
页码:3412 / 3420
页数:9
相关论文
共 50 条
  • [21] Differential Geometry boosts Convolutional Neural Networks for Object Detection
    Wang, Chu
    Siddiqi, Kaleem
    [J]. PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 1006 - 1013
  • [22] PROVABLE TRANSLATIONAL ROBUSTNESS FOR OBJECT DETECTION WITH CONVOLUTIONAL NEURAL NETWORKS
    Vierling, Axel
    James, Charu
    Berns, Karsten
    Katsaouni, Nikoletta
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 694 - 698
  • [23] Object Detection Using Convolutional Neural Networks: A Comprehensive Review
    Issaoui, Hanen
    ElAdel, Asma
    Zaied, Mourad
    [J]. 2024 IEEE 27TH INTERNATIONAL SYMPOSIUM ON REAL-TIME DISTRIBUTED COMPUTING, ISORC 2024, 2024,
  • [24] Object Detection from Video Tubelets with Convolutional Neural Networks
    Kang, Kai
    Ouyang, Wanli
    Li, Hongsheng
    Wang, Xiaogang
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 817 - 825
  • [25] Moving object detection and tracking Using Convolutional Neural Networks
    Mane, Shraddha
    Mangale, Supriya
    [J]. PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 1809 - 1813
  • [26] Simultaneous Object Detection and Localization using Convolutional Neural Networks
    Zahra Ouadiay, Fatima
    Bouftaih, Hamza
    Bouyakhf, El Houssine
    Majid Himmi, M.
    [J]. 2018 INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND COMPUTER VISION (ISCV2018), 2018,
  • [27] Convolutional Neural Networks for Segmentation and Object Detection of Human Semen
    Nissen, Malte S.
    Krause, Oswin
    Almstrup, Kristian
    Kjaerulff, Soren
    Nielsen, Torben T.
    Nielsen, Mads
    [J]. IMAGE ANALYSIS, SCIA 2017, PT I, 2017, 10269 : 397 - 406
  • [28] Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images
    Cheng, Gong
    Zhou, Peicheng
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12): : 7405 - 7415
  • [29] Object Detection In Infrared Images Using Convolutional Neural Networks
    Rao, P. Srinivasa
    Rani, Sushma N.
    Badal, Tapas
    Guptha, Suneeth Kumar
    [J]. JOURNAL OF INFORMATION ASSURANCE AND SECURITY, 2020, 15 (03): : 136 - 143
  • [30] UNSUPERVISED BODY PART REGRESSION VIA SPATIALLY SELF-ORDERING CONVOLUTIONAL NEURAL NETWORKS
    Yan, Ke
    Lu, Le
    Summers, Ronald M.
    [J]. 2018 IEEE 15TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2018), 2018, : 1022 - 1025