Semantic segmentation of remote sensing images based on dilated convolution and spatial-channel attention mechanism

被引:5
|
作者
Jin, Huazhong [1 ]
Bao, Zhixi [1 ]
Chang, Xueli [1 ]
Zhang, Tingtao [2 ]
Chen, Can [1 ]
机构
[1] Hubei Univ Technol, Coll Comp, Dept Comp Sci & Technol, Wuhan, Peoples R China
[2] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing, Peoples R China
关键词
semantic segmentation; dilated convolution; spatial-channel attention; convolutional conditional random field;
D O I
10.1117/1.JRS.17.016518
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The rich context information and multiscale ground information in remote sensing images are crucial to improving the semantic segmentation accuracy. Therefore, we propose a remote sensing image semantic segmentation method that integrates multilevel spatial channel attention and multi-scale dilated convolution, effectively addressing the issue of poor segmentation performance of small target objects in remote sensing images. This method builds a multilevel characteristic fusion structure, combining deep-level semantic characteristics with the details of the shallow levels to generate multiscale feature diagrams. Then, we introduce the dilated convolution of the series combination in each layer of the atrous spatial pyramid pooling structure to reduce the loss of small target information. Finally, using convolutional conditional random field to describe the context information on the space and edges to improve the model's ability to extract details. We prove the effectiveness of the model on the three public datasets. From the quantitative point of view, we mainly evaluate the four indicators of the model's F1 score, overall accuracy (OA), Intersection over Union (IoU), and Mean Intersection over Union (MIoU). On GID dataset, F1 score, OA, and MIoU reach 87.27, 87.80, and 77.70, respectively, superior to most mainstream semantic segmentation networks.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Swin-CDSA: The Semantic Segmentation of Remote Sensing Images Based on Cascaded Depthwise Convolution and Spatial Attention Mechanism
    Kang, Yuhan
    Ji, Jian
    Xu, Hekai
    Yang, Yong
    Chen, Peng
    Zhao, Hui
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [2] Semantic segmentation of remote sensing images based on dual-channel attention mechanism
    Jiang, Jionghui
    Feng, Xi'an
    Huang, Hui
    IET IMAGE PROCESSING, 2024, 18 (09) : 2346 - 2356
  • [3] Semantic Segmentation With Attention Mechanism for Remote Sensing Images
    Zhao, Qi
    Liu, Jiahui
    Li, Yuewen
    Zhang, Hong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [4] Multibranch Spatial-Channel Attention for Semantic Labeling of Very High-Resolution Remote Sensing Images
    Han, Bingnan
    Yin, Jihao
    Luo, Xiaoyan
    Jia, Xiuping
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (12) : 2167 - 2171
  • [5] SCAttNet: Semantic Segmentation Network With Spatial and Channel Attention Mechanism for High-Resolution Remote Sensing Images
    Li, Haifeng
    Qiu, Kaijian
    Chen, Li
    Mei, Xiaoming
    Hong, Liang
    Tao, Chao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (05) : 905 - 909
  • [6] Remote sensing semantic segmentation with convolution neural network using attention mechanism
    Ni Xianyang
    Cheng Yinbao
    Wang Zhongyu
    PROCEEDINGS OF 2019 14TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS (ICEMI), 2019, : 608 - 613
  • [7] DILATED RESIDUAL NETWORK BASED ON DUAL EXPECTATION MAXIMIZATION ATTENTION FOR SEMANTIC SEGMENTATION OF REMOTE SENSING IMAGES
    Liu, Jiachao
    Xiong, Xinyue
    Li, Jiaojiao
    Wu, Chaoxiong
    Song, Rui
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1825 - 1828
  • [8] Generating the captions for remote sensing images: A spatial-channel attention based memory-guided transformer approach
    Gajbhiye, Gaurav O.
    Nandedkar, Abhijeet V.
    Engineering Applications of Artificial Intelligence, 2022, 114
  • [9] Generating the captions for remote sensing images: A spatial-channel attention based memory-guided transformer approach
    Gajbhiye, Gaurav O.
    Nandedkar, Abhijeet, V
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 114
  • [10] Gaussian Dynamic Convolution for Semantic Segmentation in Remote Sensing Images
    Feng, Mingzhe
    Sun, Xin
    Dong, Junyu
    Zhao, Haoran
    REMOTE SENSING, 2022, 14 (22)