A Hybrid Algorithm with Swin Transformer and Convolution for Cloud Detection

被引:8
|
作者
Gong, Chengjuan [1 ,2 ]
Long, Tengfei [1 ]
Yin, Ranyu [1 ]
Jiao, Weili [1 ]
Wang, Guizhou [1 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst AIR, Beijing 100094, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Swin transformer; cloud detection; image segmentation; attention; convolution; LANDSAT; SHADOW;
D O I
10.3390/rs15215264
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Cloud detection is critical in remote sensing image processing, and convolutional neural networks (CNNs) have significantly advanced this field. However, traditional CNNs primarily focus on extracting local features, which can be challenging for cloud detection due to the variability in the size, shape, and boundaries of clouds. To address this limitation, we propose a hybrid Swin transformer-CNN cloud detection (STCCD) network that combines the strengths of both architectures. The STCCD network employs a novel dual-stream encoder that integrates Swin transformer and CNN blocks. Swin transformers can capture global context features more effectively than traditional CNNs, while CNNs excel at extracting local features. The two streams are fused via a fusion coupling module (FCM) to produce a richer representation of the input image. To further enhance the network's ability in extracting cloud features, we incorporate a feature fusion module based on the attention mechanism (FFMAM) and an aggregation multiscale feature module (AMSFM). The FFMAM selectively merges global and local features based on their importance, while the AMSFM aggregates feature maps from different spatial scales to obtain a more comprehensive representation of the cloud mask. We evaluated the STCCD network on three challenging cloud detection datasets (GF1-WHU, SPARCS, and AIR-CD), as well as the L8-Biome dataset to assess its generalization capability. The results show that the STCCD network outperformed other state-of-the-art methods on all datasets. Notably, the STCCD model, trained on only four bands (visible and near-infrared) of the GF1-WHU dataset, outperformed the official Landsat-8 Fmask algorithm in the L8-Biome dataset, which uses additional bands (shortwave infrared, cirrus, and thermal).
引用
收藏
页数:26
相关论文
共 50 条
  • [31] Hybrid Transformer and Convolution for Image Compressed Sensing
    Nan, Ruili
    Sun, Guiling
    Zheng, Bowen
    Zhang, Pengchen
    [J]. ELECTRONICS, 2024, 13 (17)
  • [32] Hybrid Transformer and Convolution for Medical Image Segmentation
    Wang, Fan
    Wang, Bo
    [J]. 2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 156 - 159
  • [33] A hybrid convolution transformer for hyperspectral image classification
    Arshad, Tahir
    Zhang, Junping
    Ullah, Inam
    [J]. EUROPEAN JOURNAL OF REMOTE SENSING, 2024,
  • [34] Random Swin Transformer
    Choi, Keong-Hun
    Ha, Jong-Eun
    [J]. 2022 22ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2022), 2022, : 1611 - 1614
  • [35] Deep learning-based bubble detection with swin transformer
    Uesawa, Shinichiro
    Yoshida, Hiroyuki
    [J]. JOURNAL OF NUCLEAR SCIENCE AND TECHNOLOGY, 2024, 61 (11) : 1438 - 1452
  • [36] Swin transformer based vehicle detection in undisciplined traffic environment
    Deshmukh, Prashant
    Satyanarayana, G. S. R.
    Majhi, Sudhan
    Sahoo, Upendra Kumar
    Das, Santos Kumar
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [37] SwinSOD: Salient object detection using swin-transformer
    Wu, Shuang
    Zhang, Guangjian
    Liu, Xuefeng
    [J]. IMAGE AND VISION COMPUTING, 2024, 146
  • [38] A Swin Transformer-Based Approach for Motorcycle Helmet Detection
    Bouhayane, Ayyoub
    Charouh, Zakaria
    Ghogho, Mounir
    Guennoun, Zouhair
    [J]. IEEE ACCESS, 2023, 11 : 74410 - 74419
  • [39] Cas-VSwin transformer: A variant swin transformer for surface-defect detection
    Gao, Linfeng
    Zhang, Jianxun
    Yang, Changhui
    Zhou, Yuechuan
    [J]. COMPUTERS IN INDUSTRY, 2022, 140
  • [40] One-Stage Detection Model Based on Swin Transformer
    Kim, Tae Yang
    Niaz, Asim
    Choi, Jung Sik
    Choi, Kwang Nam
    [J]. IEEE ACCESS, 2024, 12 : 60960 - 60972