Second-order Attention Guided Convolutional Activations for Visual Recognition

被引:0
|
作者
Chen, Shannan [1 ]
Wang, Qian [1 ]
Sun, Qiule [2 ]
Liu, Bin [3 ]
Zhang, Jianxin [1 ,4 ]
Zhang, Qiang [1 ,5 ]
机构
[1] Dalian Univ, Key Lab Adv Design & Intelligent Comp, Minist Educ, Dalian, Peoples R China
[2] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian, Peoples R China
[3] Dalian Univ Technol, Int Sch Informat Sci & Engn DUT RUISE, Dalian, Peoples R China
[4] Dalian Univ Technol, Sch Comp Sci & Engn, Dalian, Peoples R China
[5] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
second-order statistics; channel attention; deep convolutional networks; visual recognition;
D O I
10.1109/ICPR48806.2021.9412350
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, modeling deep convolutional activations by the global second-order pooling has shown great advance on visual recognition tasks. However, most of the existing deep second-order statistical models mainly compute second-order statistics of activations of the last convolutional layer as image representations, and they seldom introduce second-order statistics into earlier layers to better fit network topology, thus limiting the representational ability to a certain extent. Motivated by the flexibility of attention blocks that are commonly plugged into intermediate layers of deep convolutional networks (ConvNets), this work makes an attempt to combine deep second-order statistics with attention mechanisms in ConvNets, and further proposes a novel Second-order Attention Guided Network (SoAG-Net) for visual recognition. More specifically, SoAG-Net involves several SoAG modules seemingly inserted into intermediate layers of the network, in which SoAG collects second-order statistics of convolutional activations by polynomial kernel approximation to predict channel-wise attention maps utilized for guiding the learning of convolutional activations through tensor scaling along channel dimension. SoAG improves the nonlinearity of ConvNets and enables ConvNets to fit more complicated distribution of convolutional activations. Experiment results on three commonly used datasets illuminate that SoAG-Net outperforms its counterparts and achieves competitive performance with state-of-the-art models under the same backbone.
引用
下载
收藏
页码:3071 / 3076
页数:6
相关论文
共 50 条
  • [41] Second-order motion perception in the peripheral visual field
    Zanker, Johannes M.
    Journal of the Optical Society of America A: Optics and Image Science, and Vision, 1997, 14 (07): : 1385 - 1392
  • [42] Specificity of brain reactions to second-order visual stimuli
    Babenko, Vitaly V.
    Ermakov, Pavel N.
    VISUAL NEUROSCIENCE, 2015, 32
  • [43] Second-order motion perception in the peripheral visual field
    Zanker, JM
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1997, 14 (07): : 1385 - 1392
  • [44] Dynamics of visual masking revealed by second-order metacontrast
    Sackur, Jerome
    JOURNAL OF VISION, 2011, 11 (04): : 10
  • [45] Second-order motion descriptors for efficient action recognition
    Oves Garcia, Reinier
    Morales, Eduardo F.
    Enrique Sucar, L.
    PATTERN ANALYSIS AND APPLICATIONS, 2021, 24 (02) : 473 - 482
  • [46] Pedestrian Recognition Using Second-Order HOG Feature
    Cao, Hui
    Yamaguchi, Koichiro
    Naito, Takashi
    Ninomiya, Yoshiki
    COMPUTER VISION - ACCV 2009, PT II, 2010, 5995 : 628 - 634
  • [47] Second-order motion descriptors for efficient action recognition
    Reinier Oves García
    Eduardo F. Morales
    L. Enrique Sucar
    Pattern Analysis and Applications, 2021, 24 : 473 - 482
  • [48] Consecutive Convolutional Activations for Scene Character Recognition
    Zhang, Zhong
    Wang, Hong
    Liu, Shuang
    Xiao, Baihua
    IEEE ACCESS, 2018, 6 : 35734 - 35742
  • [49] Pure Second-Order Logic with Second-Order Identity
    Paseau, Alexander
    NOTRE DAME JOURNAL OF FORMAL LOGIC, 2010, 51 (03) : 351 - 360
  • [50] Attention alters spatial resolution by modulating second-order processing
    Jigo, Michael
    Carrasco, Marisa
    JOURNAL OF VISION, 2018, 18 (07): : 1 - 12