Attentive Contexts for Object Detection

被引:179
|
作者
Li, Jianan [1 ]
Wei, Yunchao [2 ]
Liang, Xiaodan [3 ]
Dong, Jian [4 ]
Xu, Tingfa [1 ]
Feng, Jiashi [4 ]
Yan, Shuicheng [4 ]
机构
[1] Beijing Inst Technol, Sch Opt Engn, Beijing 100081, Peoples R China
[2] Beijing Jiaotong Univ, Beijing 100044, Peoples R China
[3] Sun Yat Sen Univ, Guangzhou 510006, Guangdong, Peoples R China
[4] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 119077, Singapore
关键词
Context; neural networks; object detection;
D O I
10.1109/TMM.2016.2642789
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Modern deep neural network-based object detection methods typically classify candidate proposals using their interior features. However, global and local surrounding contexts that are believed to be valuable for object detection are not fully exploited by existing methods yet. In this work, we take a step towards understanding what is a robust practice to extract and utilize contextual information to facilitate object detection in practice. Specifically, we consider the following two questions: "how to identify useful global contextual information for detecting a certain object?" and "how to exploit local context surrounding a proposal for better inferring its contents?" We provide preliminary answers to these questions through developing a novel attention to context convolution neural network (AC-CNN)-based object detection model. AC-CNN effectively incorporates global and local contextual information into the region-based CNN (e.g., fast R-CNN and faster R-CNN) detection framework and provides better object detection performance. It consists of one attention-based global contextualized (AGC) subnetwork and one multi-scale local contextualized (MLC) subnetwork. To capture global context, the AGC subnetwork recurrently generates an attention map for an input image to highlight useful global contextual locations, through multiple stacked long short-term memory layers. For capturing surrounding local context, the MLC subnetwork exploits both the inside and outside contextual information of each specific proposal at multiple scales. The global and local context are then fused together for making the final decision for detection. Extensive experiments on PASCAL VOC 2007 and VOC 2012 well demonstrate the superiority of the proposed AC-CNN over well-established baselines.
引用
收藏
页码:944 / 954
页数:11
相关论文
共 50 条
  • [41] Improving Object Detection Quality by Incorporating Global Contexts via Self-Attention
    Lee, Donghyeon
    Kim, Joonyoung
    Jung, Kyomin
    [J]. ELECTRONICS, 2021, 10 (01) : 1 - 15
  • [42] Learning Auxiliary Monocular Contexts Helps Monocular 3D Object Detection
    Liu, Xianpeng
    Xue, Nan
    Wu, Tianfu
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1810 - 1818
  • [43] Pre-Attentive and Attentive Detection of Humans in Wide-Field Scenes
    J. H. Elder
    S. J. D. Prince
    Y. Hou
    M. Sizintsev
    E. Olevskiy
    [J]. International Journal of Computer Vision, 2007, 72 : 47 - 66
  • [44] Pre-attentive and attentive detection of humans in wide-field scenes
    Elder, J. H.
    Prince, S. J. D.
    Hou, Y.
    Sizintsev, M.
    Olevskiy, E.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 72 (01) : 47 - 66
  • [45] ATTENTIVE NOVELTY DETECTION IN HUMANS IS GOVERNED BY PRE-ATTENTIVE SENSORY MEMORY
    TIITINEN, H
    MAY, P
    REINIKAINEN, K
    NAATANEN, R
    [J]. NATURE, 1994, 372 (6501) : 90 - 92
  • [46] Pre-attentive change detection
    Schröger, E
    [J]. EUROPEAN JOURNAL OF NEUROSCIENCE, 1998, 10 : 271 - 271
  • [47] ATTENTIVE AND PREATTENTIVE PROCESSES IN MOTION DETECTION
    DICK, M
    ULLMAN, S
    SAGI, D
    [J]. PERCEPTION, 1987, 16 (02) : 259 - 259
  • [48] Learning transform-aware attentive network for object tracking
    Lu, Xiankai
    Ni, Bingbing
    Ma, Chao
    Yang, Xiaokang
    [J]. NEUROCOMPUTING, 2019, 349 : 133 - 144
  • [49] Geo-Contextual Priors for Attentive Urban Object Recognition
    Amlacher, Katrin
    Fritz, Gerald
    Luley, Patrick
    Almer, Alexander
    Paletta, Lucas
    [J]. ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7, 2009, : 3015 - 3020
  • [50] Segmentation coding for object-based attentive selection systems
    Wilson, CS
    Morris, TG
    DeWeerth, SP
    [J]. ISCAS '98 - PROCEEDINGS OF THE 1998 INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-6, 1998, : B227 - B230