Spatial Group-Wise Enhance: Enhancing Semantic Feature Learning in CNN

被引:7
|
作者
Li, Yuxuan [1 ]
Li, Xiang [1 ]
Yang, Jian [1 ]
机构
[1] Nankai Univ, 38 Tongyan Rd, Tianjin 300350, Peoples R China
来源
关键词
Computer vision; Backbone; Attention mechanism;
D O I
10.1007/978-3-031-26348-4_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The success of attention modules in CNN has attracted increasing and widespread attention over the past years. However, most existing attention modules fail to consider two important factors: (1) For images, different semantic entities are located in different areas, thus they should be associated with different spatial attention masks; (2) most existing framework exploits individual local or global information to guide the generation of attention masks, which ignores the joint information of local-global similarities that can be more effective. To explore these two ingredients, we propose the Spatial Group-wise Enhance (SGE) module. SGE explicitly distributes different but accurate spatial attention masks for various semantics, through the guidance of local-global similarities inside each individual semantic feature group. Furthermore, SGE is lightweight with almost no extra parameters and calculations. Despite being trained with only category supervisions, SGE is effective in highlighting multiple active areas with various high-level semantics (such as the dog's eyes, nose, etc.). When integrated with popular CNN backbones, SGE can significantly boost their performance on image recognition tasks. Specifically, based on ResNet101 backbones, SGE improves the baseline by 0.7% Top-1 accuracy on ImageNet classification and 1.6 similar to 1.8% AP on COCO detection tasks. The code and pretrained models are available at https://github.com/implus/PytorchInsight.
引用
收藏
页码:316 / 332
页数:17
相关论文
共 50 条
  • [21] Group-wise interactive region learning for zero-shot recognition
    Guo, Ting
    Liang, Jiye
    Xie, Guo-Sen
    INFORMATION SCIENCES, 2023, 642
  • [22] Constructing Consistent Longitudinal Brain Networks by Group-Wise Graph Learning
    Turja, Md Asadullah
    Zsembik, Leo Charles Peek
    Wu, Guorong
    Styner, Martin
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT III, 2019, 11766 : 654 - 662
  • [23] A Sparse Regression Method for Group-Wise Feature Selection with False Discovery Rate Control
    Gossmann, Alexej
    Cao, Shaolong
    Brzyski, Damian
    Zhao, Lan-Juan
    Deng, Hong-Wen
    Wang, Yu-Ping
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (04) : 1066 - 1078
  • [24] GFENet: group-wise feature-enhanced network for steering angle prediction by fusing events and imagesGFENet: group-wise feature-enhanced network...D.-W. Chen et al.
    Duo-Wen Chen
    Chi Guo
    Jian-Lang Hu
    Applied Intelligence, 2025, 55 (3)
  • [25] Group-Wise Hub Identification by Learning Common Graph Embeddings on Grassmannian Manifold
    Yang, Defu
    Chen, Jiazhou
    Yan, Chenggang
    Kim, Minjeong
    Laurienti, Paul J.
    Styner, Martin
    Wu, Guorong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 8249 - 8260
  • [26] Testing for spatial group-wise heteroskedasticity in spatial autocorrelation regression models: Lagrange multiplier scan tests
    Julie Le Gallo
    Fernando A. López
    Coro Chasco
    The Annals of Regional Science, 2020, 64 : 287 - 312
  • [27] Testing for spatial group-wise heteroskedasticity in spatial autocorrelation regression models: Lagrange multiplier scan tests
    Le Gallo, Julie
    Lopez, Fernando A.
    Chasco, Coro
    ANNALS OF REGIONAL SCIENCE, 2020, 64 (02): : 287 - 312
  • [28] Combining kernel principal component analysis and spatial group-wise enhance convolutional neural network for fault recognition of rolling element bearings
    Pan, Huilin
    Jiao, Weidong
    Yan, Tianyu
    Rehman, Attiq Ur
    Wan, Anping
    Yang, Shixi
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2023, 34 (12)
  • [29] A Group-Wise Feature Enhancement-and-Fusion Network with Dual-Polarization Feature Enrichment for SAR Ship Detection
    Xu, Xiaowo
    Zhang, Xiaoling
    Shao, Zikang
    Shi, Jun
    Wei, Shunjun
    Zhang, Tianwen
    Zeng, Tianjiao
    REMOTE SENSING, 2022, 14 (20)
  • [30] Wise-SrNet: a novel architecture for enhancing image classification by learning spatial resolution of feature maps
    Rahimzadeh, Mohammad
    Parvin, Soroush
    Askari, Amirali
    Safi, Elnaz
    Mohammadi, Mohammad Reza
    PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (02)