Attentive Contexts for Object Detection

被引：179

作者：

Li, Jianan ^{[1
]}

Wei, Yunchao ^{[2
]}

Liang, Xiaodan ^{[3
]}

Dong, Jian ^{[4
]}

Xu, Tingfa ^{[1
]}

Feng, Jiashi ^{[4
]}

Yan, Shuicheng ^{[4
]}

机构：

[1] Beijing Inst Technol, Sch Opt Engn, Beijing 100081, Peoples R China

[2] Beijing Jiaotong Univ, Beijing 100044, Peoples R China

[3] Sun Yat Sen Univ, Guangzhou 510006, Guangdong, Peoples R China

[4] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 119077, Singapore

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2017年 / 19卷 / 05期

关键词：

Context; neural networks; object detection;

D O I：

10.1109/TMM.2016.2642789

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Modern deep neural network-based object detection methods typically classify candidate proposals using their interior features. However, global and local surrounding contexts that are believed to be valuable for object detection are not fully exploited by existing methods yet. In this work, we take a step towards understanding what is a robust practice to extract and utilize contextual information to facilitate object detection in practice. Specifically, we consider the following two questions: "how to identify useful global contextual information for detecting a certain object?" and "how to exploit local context surrounding a proposal for better inferring its contents?" We provide preliminary answers to these questions through developing a novel attention to context convolution neural network (AC-CNN)-based object detection model. AC-CNN effectively incorporates global and local contextual information into the region-based CNN (e.g., fast R-CNN and faster R-CNN) detection framework and provides better object detection performance. It consists of one attention-based global contextualized (AGC) subnetwork and one multi-scale local contextualized (MLC) subnetwork. To capture global context, the AGC subnetwork recurrently generates an attention map for an input image to highlight useful global contextual locations, through multiple stacked long short-term memory layers. For capturing surrounding local context, the MLC subnetwork exploits both the inside and outside contextual information of each specific proposal at multiple scales. The global and local context are then fused together for making the final decision for detection. Extensive experiments on PASCAL VOC 2007 and VOC 2012 well demonstrate the superiority of the proposed AC-CNN over well-established baselines.

引用

页码：944 / 954

页数：11

共 50 条

[41] Improving Object Detection Quality by Incorporating Global Contexts via Self-Attention
Lee, Donghyeon
Kim, Joonyoung
Jung, Kyomin
[J]. ELECTRONICS, 2021, 10 (01) : 1 - 15
[42] Learning Auxiliary Monocular Contexts Helps Monocular 3D Object Detection
Liu, Xianpeng
Xue, Nan
Wu, Tianfu
[J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1810 - 1818
[43] Pre-Attentive and Attentive Detection of Humans in Wide-Field Scenes
J. H. Elder
S. J. D. Prince
Y. Hou
M. Sizintsev
E. Olevskiy
[J]. International Journal of Computer Vision, 2007, 72 : 47 - 66
[44] Pre-attentive and attentive detection of humans in wide-field scenes
Elder, J. H.
Prince, S. J. D.
Hou, Y.
Sizintsev, M.
Olevskiy, E.
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 72 (01) : 47 - 66
[45] ATTENTIVE NOVELTY DETECTION IN HUMANS IS GOVERNED BY PRE-ATTENTIVE SENSORY MEMORY
TIITINEN, H
MAY, P
REINIKAINEN, K
NAATANEN, R
[J]. NATURE, 1994, 372 (6501) : 90 - 92
[46] Pre-attentive change detection
Schröger, E
[J]. EUROPEAN JOURNAL OF NEUROSCIENCE, 1998, 10 : 271 - 271
[47] ATTENTIVE AND PREATTENTIVE PROCESSES IN MOTION DETECTION
DICK, M
ULLMAN, S
SAGI, D
[J]. PERCEPTION, 1987, 16 (02) : 259 - 259
[48] Learning transform-aware attentive network for object tracking
Lu, Xiankai
Ni, Bingbing
Ma, Chao
Yang, Xiaokang
[J]. NEUROCOMPUTING, 2019, 349 : 133 - 144
[49] Geo-Contextual Priors for Attentive Urban Object Recognition
Amlacher, Katrin
Fritz, Gerald
Luley, Patrick
Almer, Alexander
Paletta, Lucas
[J]. ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7, 2009, : 3015 - 3020
[50] Segmentation coding for object-based attentive selection systems
Wilson, CS
Morris, TG
DeWeerth, SP
[J]. ISCAS '98 - PROCEEDINGS OF THE 1998 INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-6, 1998, : B227 - B230

← 1 2 3 4 5 →