Second-order Attention Guided Convolutional Activations for Visual Recognition

被引:0
|
作者
Chen, Shannan [1 ]
Wang, Qian [1 ]
Sun, Qiule [2 ]
Liu, Bin [3 ]
Zhang, Jianxin [1 ,4 ]
Zhang, Qiang [1 ,5 ]
机构
[1] Dalian Univ, Key Lab Adv Design & Intelligent Comp, Minist Educ, Dalian, Peoples R China
[2] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian, Peoples R China
[3] Dalian Univ Technol, Int Sch Informat Sci & Engn DUT RUISE, Dalian, Peoples R China
[4] Dalian Univ Technol, Sch Comp Sci & Engn, Dalian, Peoples R China
[5] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
second-order statistics; channel attention; deep convolutional networks; visual recognition;
D O I
10.1109/ICPR48806.2021.9412350
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, modeling deep convolutional activations by the global second-order pooling has shown great advance on visual recognition tasks. However, most of the existing deep second-order statistical models mainly compute second-order statistics of activations of the last convolutional layer as image representations, and they seldom introduce second-order statistics into earlier layers to better fit network topology, thus limiting the representational ability to a certain extent. Motivated by the flexibility of attention blocks that are commonly plugged into intermediate layers of deep convolutional networks (ConvNets), this work makes an attempt to combine deep second-order statistics with attention mechanisms in ConvNets, and further proposes a novel Second-order Attention Guided Network (SoAG-Net) for visual recognition. More specifically, SoAG-Net involves several SoAG modules seemingly inserted into intermediate layers of the network, in which SoAG collects second-order statistics of convolutional activations by polynomial kernel approximation to predict channel-wise attention maps utilized for guiding the learning of convolutional activations through tensor scaling along channel dimension. SoAG improves the nonlinearity of ConvNets and enables ConvNets to fit more complicated distribution of convolutional activations. Experiment results on three commonly used datasets illuminate that SoAG-Net outperforms its counterparts and achieves competitive performance with state-of-the-art models under the same backbone.
引用
下载
收藏
页码:3071 / 3076
页数:6
相关论文
共 50 条
  • [1] ATTENTION-GUIDED SECOND-ORDER POOLING CONVOLUTIONAL NETWORKS
    Chen, Shannan
    Sun, Qiule
    Li, Cunhua
    Zhang, Jianxin
    Zhang, Qiang
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2230 - 2234
  • [2] Second-order convolutional networks for iris recognition
    Jia, Lingyao
    Shi, Xueyu
    Sun, Qiule
    Tang, Xingqiang
    Li, Peihua
    APPLIED INTELLIGENCE, 2022, 52 (10) : 11273 - 11287
  • [3] Second-order convolutional networks for iris recognition
    Lingyao Jia
    Xueyu Shi
    Qiule Sun
    Xingqiang Tang
    Peihua Li
    Applied Intelligence, 2022, 52 : 11273 - 11287
  • [4] SORT: Second-Order Response Transform for Visual Recognition
    Wang, Yan
    Xie, Lingxi
    Liu, Chenxi
    Qiao, Siyuan
    Zhang, Ya
    Zhang, Wenjun
    Tian, Qi
    Yuille, Alan
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1368 - 1377
  • [5] Delving into Fully Convolutional Networks Activations for Visual Recognition
    Zhang, Longfei
    Guo, Yanming
    PROCEEDINGS OF 2018 THE 3RD INTERNATIONAL CONFERENCE ON MULTIMEDIA AND IMAGE PROCESSING (ICMIP 2018), 2018, : 99 - 104
  • [6] Dropping Activations in Convolutional Neural Networks with Visual Attention Maps
    Montoya Obeso, Abraham
    Benois-Pineau, Jenny
    Garcia Vazquez, Mireya Sarai
    Acosta, Alejandro A. Ramirez
    2019 INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2019,
  • [7] Siamese network visual tracking algorithm based on second-order attention
    Hou Z.
    Chen M.
    Ma J.
    Guo F.
    Yu W.
    Ma S.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2024, 50 (03): : 739 - 747
  • [8] The role of attention in second-order motion versus letter recognition tasks
    Ho, EC
    Koch, C
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 1996, 37 (03) : 2431 - 2431
  • [9] Second-order visual processing
    School of Psychology, University Park, Nottingham, United Kingdom
    Optics and Photonics News, 2001, 12 (01): : 18 - 20
  • [10] Is Second-order Information Helpful for Large-scale Visual Recognition?
    Li, Peihua
    Xie, Jiangtao
    Wang, Qilong
    Zuo, Wangmeng
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2089 - 2097