Spatial Group-Wise Enhance: Enhancing Semantic Feature Learning in CNN

被引:7
|
作者
Li, Yuxuan [1 ]
Li, Xiang [1 ]
Yang, Jian [1 ]
机构
[1] Nankai Univ, 38 Tongyan Rd, Tianjin 300350, Peoples R China
来源
关键词
Computer vision; Backbone; Attention mechanism;
D O I
10.1007/978-3-031-26348-4_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The success of attention modules in CNN has attracted increasing and widespread attention over the past years. However, most existing attention modules fail to consider two important factors: (1) For images, different semantic entities are located in different areas, thus they should be associated with different spatial attention masks; (2) most existing framework exploits individual local or global information to guide the generation of attention masks, which ignores the joint information of local-global similarities that can be more effective. To explore these two ingredients, we propose the Spatial Group-wise Enhance (SGE) module. SGE explicitly distributes different but accurate spatial attention masks for various semantics, through the guidance of local-global similarities inside each individual semantic feature group. Furthermore, SGE is lightweight with almost no extra parameters and calculations. Despite being trained with only category supervisions, SGE is effective in highlighting multiple active areas with various high-level semantics (such as the dog's eyes, nose, etc.). When integrated with popular CNN backbones, SGE can significantly boost their performance on image recognition tasks. Specifically, based on ResNet101 backbones, SGE improves the baseline by 0.7% Top-1 accuracy on ImageNet classification and 1.6 similar to 1.8% AP on COCO detection tasks. The code and pretrained models are available at https://github.com/implus/PytorchInsight.
引用
下载
收藏
页码:316 / 332
页数:17
相关论文
共 50 条
  • [1] GROUP-WISE FEATURE SELECTION FOR SUPERVISED LEARNING
    Xiao, Qi
    Li, Hebi
    Tian, Jin
    Wang, Zhengdao
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3149 - 3153
  • [2] Group-Wise Learning for Weakly Supervised Semantic Segmentation
    Zhou, Tianfei
    Li, Liulei
    Li, Xueyi
    Feng, Chun-Mei
    Li, Jianwu
    Shao, Ling
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 799 - 811
  • [3] Lightweight Facial Expression Recognition with Spatial Group-Wise Enhance
    Liu, Jin
    Luo, Xiaoshu
    Xu, Zhaoxing
    Computer Engineering and Applications, 2023, 59 (22) : 233 - 241
  • [4] Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation
    Li, Xueyi
    Zhou, Tianfei
    Li, Jianwu
    Zhou, Yi
    Zhang, Zhaoxiang
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1984 - 1992
  • [5] Group-Wise Feature Fusion R-CNN for Dual Polarization SAR Ship Detection
    Xu, Xiaowo
    Zhang, Xiaoling
    Zeng, Tianjiao
    Shi, Jun
    Shao, Zikang
    Zhang, Tianwen
    2023 IEEE RADAR CONFERENCE, RADARCONF23, 2023,
  • [6] DENSE DOCKED SHIP DETECTION VIA SPATIAL GROUP-WISE ENHANCE ATTENTION IN SAR IMAGES
    Wang, Xiaoya
    Cui, Zongyong
    Cao, Zongjie
    Dang, Sihang
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1244 - 1247
  • [7] Group-Wise Dynamic Dropout Based on Latent Semantic Variations
    Ke, Zhiwei
    Wen, Zhiwei
    Xie, Weicheng
    Wang, Yi
    Shen, Linlin
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11229 - 11236
  • [8] Group-wise Inhibition based Feature Regularization for Robust Classification
    Liu, Haozhe
    Wu, Haoqian
    Xie, Weicheng
    Liu, Feng
    Shen, Linlin
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 468 - 476
  • [9] Group-wise Contrastive Learning for Neural Dialogue Generation
    Cai, Hengyi
    Chen, Hongshen
    Song, Yonghao
    Ding, Zhuoye
    Bao, Yongjun
    Yan, Weipeng
    Zhao, Xiaofang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 793 - 802
  • [10] Traceable Group-Wise Self-Optimizing Feature Transformation Learning: A Dual Optimization Perspective
    Xiao, Meng
    Wang, Dongjie
    Wu, Min
    Liu, Kunpeng
    Xiong, Hui
    Zhou, Yuanchun
    Fu, Yanjie
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (04)