Unsupervised Multi-Scale Hybrid Feature Extraction Network for Semantic Segmentation of High-Resolution Remote Sensing Images

被引:2
|
作者
Song, Wanying [1 ]
Nie, Fangxin [1 ]
Wang, Chi [1 ]
Jiang, Yinyin [1 ]
Wu, Yan [2 ]
机构
[1] Xian Univ Sci & Technol, Sch Commun & Informat Engn, Xian Key Lab Network Convergence Commun, Xian 710054, Peoples R China
[2] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
基金
中国博士后科学基金;
关键词
high-resolution remote sensing; unsupervised; semantic segmentation; global context information; fine-grained features; feature fusion;
D O I
10.3390/rs16203774
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Generating pixel-level annotations for semantic segmentation tasks of high-resolution remote sensing images is both time-consuming and labor-intensive, which has led to increased interest in unsupervised methods. Therefore, in this paper, we propose an unsupervised multi-scale hybrid feature extraction network based on the CNN-Transformer architecture, referred to as MSHFE-Net. The MSHFE-Net consists of three main modules: a Multi-Scale Pixel-Guided CNN Encoder, a Multi-Scale Aggregation Transformer Encoder, and a Parallel Attention Fusion Module. The Multi-Scale Pixel-Guided CNN Encoder is designed for multi-scale, fine-grained feature extraction in unsupervised tasks, efficiently recovering local spatial information in images. Meanwhile, the Multi-Scale Aggregation Transformer Encoder introduces a multi-scale aggregation module, which further enhances the unsupervised acquisition of multi-scale contextual information, obtaining global features with stronger feature representation. The Parallel Attention Fusion Module employs an attention mechanism to fuse global and local features in both channel and spatial dimensions in parallel, enriching the semantic relations extracted during unsupervised training and improving the performance of unsupervised semantic segmentation. K-means clustering is then performed on the fused features to achieve high-precision unsupervised semantic segmentation. Experiments with MSHFE-Net on the Potsdam and Vaihingen datasets demonstrate its effectiveness in significantly improving the accuracy of unsupervised semantic segmentation.
引用
收藏
页数:20
相关论文
共 50 条
  • [11] MFINet: Multi-Scale Feature Interaction Network for Change Detection of High-Resolution Remote Sensing Images
    Ren, Wuxu
    Wang, Zhongchen
    Xia, Min
    Lin, Haifeng
    REMOTE SENSING, 2024, 16 (07)
  • [12] MFRNet: A Multipath Feature Refinement Network for Semantic Segmentation in High-Resolution Remote Sensing Images
    Xiao, Tao
    Liu, Yikun
    Huang, Yuwen
    Yang, Gongping
    REMOTE SENSING LETTERS, 2022, 13 (12) : 1271 - 1283
  • [13] Multi-scale attention fusion network for semantic segmentation of remote sensing images
    Wen, Zhiqiang
    Huang, Hongxu
    Liu, Shuai
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (24) : 7909 - 7926
  • [14] MCNet: A Multi-scale and Cascade Network for Semantic Segmentation of Remote Sensing Images
    Zhou, Yin
    Li, Tianyi
    Li, Xianju
    Feng, Ruyi
    WEB AND BIG DATA, PT II, APWEB-WAIM 2023, 2024, 14332 : 162 - 176
  • [15] Local-enhanced multi-scale aggregation swin transformer for semantic segmentation of high-resolution remote sensing images
    Ren, Dong
    Li, Falin
    Sun, Hang
    Liu, Li
    Ren, Shun
    Yu, Mei
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2024, 45 (01) : 101 - 120
  • [16] MFALNet: A Multiscale Feature Aggregation Lightweight Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Lv, Liang
    Guo, Yiyou
    Bao, Tengfei
    Fu, Chenqin
    Huo, Hong
    Fang, Tao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (12) : 2172 - 2176
  • [17] Double-Branch Multi-Scale Contextual Network: A Model for Multi-Scale Street Tree Segmentation in High-Resolution Remote Sensing Images
    Zhang, Hongyang
    Liu, Shuo
    SENSORS, 2024, 24 (04)
  • [18] UNeXt: An Efficient Network for the Semantic Segmentation of High-Resolution Remote Sensing Images
    Chang, Zhanyuan
    Xu, Mingyu
    Wei, Yuwen
    Lian, Jie
    Zhang, Chongming
    Li, Chuanjiang
    SENSORS, 2024, 24 (20)
  • [19] A Deformable Attention Network for High-Resolution Remote Sensing Images Semantic Segmentation
    Zuo, Renxiang
    Zhang, Guangyun
    Zhang, Rongting
    Jia, Xiuping
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [20] Edge Guidance Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Ni, Yue
    Liu, Jiahang
    Cui, Jian
    Yang, Yuze
    Wang, Xiaozhen
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 9809 - 9822