Remote Sensing Image Road Segmentation Method Integrating CNN-Transformer and UNet

被引:1
|
作者
Wang, Rui [1 ]
Cai, Mingxiang [1 ]
Xia, Zixuan [2 ]
Zhou, Zhicui [3 ]
机构
[1] China Transport Telecommun & Informat Ctr, Beijing 100011, Peoples R China
[2] Heilongjiang Univ Technol, Harbin 150022, Heilongjiang, Peoples R China
[3] No 1 Middle Sch Weifang, Jixi 150022, Heilongjiang, Peoples R China
关键词
Road segmentation; deep learning; CNN-transformer; attention; UNet; EXTRACTION;
D O I
10.1109/ACCESS.2023.3344797
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real-time and accurate road information is crucial for updating electronic navigation maps. To address the problem of low precision and poor robustness in current semantic segmentation methods for road extraction from remote sensing imagery, we proposed a UNet road semantic segmentation model based on attention mechanism improvement. First, we introduce a CNN-Transformer hybrid structure to the encoder to enhance the feature extraction capabilities of global and local details. Second, the traditional upsampling module in the decoder is replaced with a dual upsampling module to improve feature extraction capabilities and segmentation accuracy. Furthermore, the hard-swish activation function is used instead of ReLU activation function to smooth the curve, which helps to improve the generalization and non-linear feature extraction abilities and avoid gradient vanishing. Finally, a comprehensive loss function combining cross entropy and dice is used to strengthen the segmentation result constraints and further improve segmentation accuracy. Experimental validation is performed on the Ottawa Road Dataset and the Massachusetts Road Dataset. Experimental results show that compared with U-Net, PSPNet, DeepLab V3 and TransUNet networks, this algorithm is the best in terms of MIoU, MPA and F1 score. Among them, on the Ottawa road data set, the MPA of this algorithm reached 95.48%. On the Massachusetts road data set, MPA is 92.56%. This method shows good performance in road extraction.
引用
收藏
页码:144446 / 144455
页数:10
相关论文
共 50 条
  • [21] HCTNet: A hybrid CNN-transformer network for breast ultrasound image segmentation
    He, Qiqi
    Yang, Qiuju
    Xie, Minghao
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 155
  • [22] Multiscale Fusion CNN-Transformer Network for High-Resolution Remote Sensing Image Change Detection
    Jiang, Ming
    Chen, Yimin
    Dong, Zhe
    Liu, Xiaoping
    Zhang, Xinchang
    Zhang, Honghui
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 5280 - 5293
  • [23] DBCT-Net:A dual branch hybrid CNN-transformer network for remote sensing image fusion
    Wang, Quanli
    Jin, Xin
    Jiang, Qian
    Wu, Liwen
    Zhang, Yunchun
    Zhou, Wei
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 233
  • [24] Remote Sensing Image Classification Method Based on Fusion of CNN and Transformer
    Jin Chuan
    Tong Changqing
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (20)
  • [25] MFTransNet: A Multi-Modal Fusion with CNN-Transformer Network for Semantic Segmentation of HSR Remote Sensing Images
    He, Shumeng
    Yang, Houqun
    Zhang, Xiaoying
    Li, Xuanyu
    MATHEMATICS, 2023, 11 (03)
  • [26] Hyperspectral Image Compression Sensing Network With CNN-Transformer Mixture Architectures
    Zhang, Lei
    Zhang, Longsheng
    Song, Chengpeng
    Zhang, Peng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [27] CNN-transformer dual branch collaborative model for semantic segmentation of high-resolution remote sensing images
    Zhu, Xiaotong
    Peng, Taile
    Guo, Jia
    Wang, Hao
    Cao, Taotao
    Photogrammetric Record, 2025, 40 (189):
  • [28] An Efficient Hybrid CNN-Transformer Approach for Remote Sensing Super-Resolution
    Zhang, Wenjian
    Tan, Zheng
    Lv, Qunbo
    Li, Jiaao
    Zhu, Baoyu
    Liu, Yangyang
    REMOTE SENSING, 2024, 16 (05)
  • [29] Alternate encoder and dual decoder CNN-Transformer networks for medical image segmentation
    Lin Zhang
    Xinyu Guo
    Hongkun Sun
    Weigang Wang
    Liwei Yao
    Scientific Reports, 15 (1)
  • [30] HTC-Net: A hybrid CNN-transformer framework for medical image segmentation
    Tang, Hui
    Chen, Yuanbin
    Wang, Tao
    Zhou, Yuanbo
    Zhao, Longxuan
    Gao, Qinquan
    Du, Min
    Tan, Tao
    Zhang, Xinlin
    Tong, Tong
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 88