Attentive Contexts for Object Detection

被引:179
|
作者
Li, Jianan [1 ]
Wei, Yunchao [2 ]
Liang, Xiaodan [3 ]
Dong, Jian [4 ]
Xu, Tingfa [1 ]
Feng, Jiashi [4 ]
Yan, Shuicheng [4 ]
机构
[1] Beijing Inst Technol, Sch Opt Engn, Beijing 100081, Peoples R China
[2] Beijing Jiaotong Univ, Beijing 100044, Peoples R China
[3] Sun Yat Sen Univ, Guangzhou 510006, Guangdong, Peoples R China
[4] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 119077, Singapore
关键词
Context; neural networks; object detection;
D O I
10.1109/TMM.2016.2642789
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Modern deep neural network-based object detection methods typically classify candidate proposals using their interior features. However, global and local surrounding contexts that are believed to be valuable for object detection are not fully exploited by existing methods yet. In this work, we take a step towards understanding what is a robust practice to extract and utilize contextual information to facilitate object detection in practice. Specifically, we consider the following two questions: "how to identify useful global contextual information for detecting a certain object?" and "how to exploit local context surrounding a proposal for better inferring its contents?" We provide preliminary answers to these questions through developing a novel attention to context convolution neural network (AC-CNN)-based object detection model. AC-CNN effectively incorporates global and local contextual information into the region-based CNN (e.g., fast R-CNN and faster R-CNN) detection framework and provides better object detection performance. It consists of one attention-based global contextualized (AGC) subnetwork and one multi-scale local contextualized (MLC) subnetwork. To capture global context, the AGC subnetwork recurrently generates an attention map for an input image to highlight useful global contextual locations, through multiple stacked long short-term memory layers. For capturing surrounding local context, the MLC subnetwork exploits both the inside and outside contextual information of each specific proposal at multiple scales. The global and local context are then fused together for making the final decision for detection. Extensive experiments on PASCAL VOC 2007 and VOC 2012 well demonstrate the superiority of the proposed AC-CNN over well-established baselines.
引用
收藏
页码:944 / 954
页数:11
相关论文
共 50 条
  • [1] ATTENTIVE LAYER SEPARATION FOR OBJECT CLASSIFICATION AND OBJECT LOCALIZATION IN OBJECT DETECTION
    Kim, Jung Uk
    Ro, Yong Man
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3995 - 3999
  • [2] Local-Global Attentive Adaptation for Object Detection
    Zhang, Dan
    Li, Jingjing
    Li, Xingpeng
    Du, Zhekai
    Xiong, Lin
    Ye, Mao
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 100
  • [3] Cascade Attentive Dropout for Weakly Supervised Object Detection
    Wenlong Gao
    Ying Chen
    Yong Peng
    [J]. Neural Processing Letters, 2023, 55 : 6907 - 6923
  • [4] Dense Attentive Feature Enhancement for Salient Object Detection
    Li, Zun
    Lang, Congyan
    Liang, Liqian
    Zhao, Jian
    Feng, Songhe
    Hou, Qibin
    Feng, Jiashi
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8128 - 8141
  • [5] Cascade Attentive Dropout for Weakly Supervised Object Detection
    Gao, Wenlong
    Chen, Ying
    Peng, Yong
    [J]. NEURAL PROCESSING LETTERS, 2023, 55 (06) : 6907 - 6923
  • [6] Detective: An Attentive Recurrent Model for Sparse Object Detection
    Kechaou, Amine
    Martinez, Manuel
    Haurilet, Monica
    Stiefelhagen, Rainer
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5340 - 5347
  • [7] Residual attentive feature learning network for salient object detection
    Zhang, Qing
    Shi, Yanjiao
    Zhang, Xueqin
    Zhang, Liqian
    [J]. NEUROCOMPUTING, 2022, 501 : 741 - 752
  • [8] Rethinking Attentive Object Detection via Neural Attention Learning
    Ge, Chongjian
    Song, Yibing
    Ma, Chao
    Qi, Yuankai
    Luo, Ping
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1726 - 1739
  • [9] Attentive object detection using an information theoretic saliency measure
    Fritz, G
    Seifert, C
    Paletta, L
    Bischof, H
    [J]. ATTENTION AND PERFORMANCE IN COMPUTATIONAL VISION, 2005, 3368 : 29 - 41
  • [10] Attentive Feedback Network for Boundary-Aware Salient Object Detection
    Feng, Mengyang
    Lu, Huchuan
    Ding, Errui
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1623 - 1632