Adaptive Local Cross-Channel Vector Pooling Attention Module for Semantic Segmentation of Remote Sensing Imagery

被引:8
|
作者
Wang, Xiaofeng [1 ]
Kang, Menglei [1 ]
Chen, Yan [2 ]
Jiang, Wenxiang [2 ]
Wang, Mengyuan [2 ]
Weise, Thomas [2 ]
Tan, Ming [2 ]
Xu, Lixiang [1 ]
Li, Xinlu [1 ]
Zou, Le [1 ]
Zhang, Chen [1 ]
机构
[1] Hefei Univ, Sch Artificial Intelligence & Big Data, Dept Big Data & Informat Engn, Hefei 230601, Peoples R China
[2] Hefei Univ, Inst Appl Optimizat, Sch Artificial Intelligence & Big Data, Hefei 230601, Peoples R China
基金
中国国家自然科学基金;
关键词
adaptive local cross-channel interaction; vector average pooling; attention mechanism; remote sensing imagery; semantic segmentation; deep learning; NETWORK; CLASSIFICATION; FUSION;
D O I
10.3390/rs15081980
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Adding an attention module to the deep convolution semantic segmentation network has significantly enhanced the network performance. However, the existing channel attention module focusing on the channel dimension neglects the spatial relationship, causing location noise to transmit to the decoder. In addition, the spatial attention module exemplified by self-attention has a high training cost and challenges in execution efficiency, making it unsuitable to handle large-scale remote sensing data. We propose an efficient vector pooling attention (VPA) module for building the channel and spatial location relationship. The module can locate spatial information better by performing a unique vector average pooling in the vertical and horizontal dimensions of the feature maps. Furthermore, it can also learn the weights directly by using the adaptive local cross-channel interaction. Multiple weight learning ablation studies and comparison experiments with the classical attention modules were conducted by connecting the VPA module to a modified DeepLabV3 network using ResNet50 as the encoder. The results show that the mIoU of our network with the addition of an adaptive local cross-channel interaction VPA module increases by 3% compared to the standard network on the MO-CSSSD. The VPA-based semantic segmentation network can significantly improve precision efficiency compared with other conventional attention networks. Furthermore, the results on the WHU Building dataset present an improvement in IoU and F1-score by 1.69% and 0.97%, respectively. Our network raises the mIoU by 1.24% on the ISPRS Vaihingen dataset. The VPA module can also significantly improve the network's performance on small target segmentation.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] AANet: Adaptive Attention Networks for Semantic Segmentation of High-Resolution Remote Sensing Imagery
    Chen, Yan
    Zhang, Qianchuan
    Wang, Xiaofeng
    Dong, Quan
    Kang, Menglei
    Jiang, Wenxiang
    Wang, Mengyuan
    Xu, Lixiang
    Zhang, Chen
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 14640 - 14655
  • [2] SPANet: Successive Pooling Attention Network for Semantic Segmentation of Remote Sensing Images
    Sun, Le
    Cheng, Shiwei
    Zheng, Yuhui
    Wu, Zebin
    Zhang, Jianwei
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 4045 - 4057
  • [3] Learning Cross-Channel Representations for Semantic Segmentation
    Ma, Lingfeng
    Xie, Hongtao
    Liu, Chuanbin
    Zhang, Yongdong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2774 - 2787
  • [4] Channel selection and local attention transformer model for semantic segmentation on UAV remote sensing scene
    Liu, Da
    Long, Hao
    Liu, Zhenbao
    IET IMAGE PROCESSING, 2025, 19 (01)
  • [5] DEANet: Dual Encoder with Attention Network for Semantic Segmentation of Remote Sensing Imagery
    Wei, Haoran
    Xu, Xiangyang
    Ou, Ni
    Zhang, Xinru
    Dai, Yaping
    REMOTE SENSING, 2021, 13 (19)
  • [6] LANet: Local Attention Embedding to Improve the Semantic Segmentation of Remote Sensing Images
    Ding, Lei
    Tang, Hao
    Bruzzone, Lorenzo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (01): : 426 - 435
  • [7] DIFFUSION MODELS FOR REMOTE SENSING IMAGERY SEMANTIC SEGMENTATION
    Ayala, C.
    Sesma, R.
    Aranda, C.
    Galar, M.
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 5654 - 5657
  • [8] Semantic Segmentation of Marine Remote Sensing Based on a Cross Direction Attention Mechanism
    Gao, Hao
    Cao, Lin
    Yu, Dingfeng
    Xiong, Xuejun
    Cao, Maoyong
    IEEE ACCESS, 2020, 8 : 142483 - 142494
  • [9] Boundary Loss for Remote Sensing Imagery Semantic Segmentation
    Bokhovkin, Alexey
    Burnaev, Evgeny
    ADVANCES IN NEURAL NETWORKS - ISNN 2019, PT II, 2019, 11555 : 388 - 401
  • [10] An Efficient Semantic Segmentation Method for Remote-Sensing Imagery Using Improved Coordinate Attention
    Huo, Yan
    Gang, Shuang
    Dong, Liang
    Guan, Chao
    APPLIED SCIENCES-BASEL, 2024, 14 (10):