A Hybrid Algorithm with Swin Transformer and Convolution for Cloud Detection

被引:8
|
作者
Gong, Chengjuan [1 ,2 ]
Long, Tengfei [1 ]
Yin, Ranyu [1 ]
Jiao, Weili [1 ]
Wang, Guizhou [1 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst AIR, Beijing 100094, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Swin transformer; cloud detection; image segmentation; attention; convolution; LANDSAT; SHADOW;
D O I
10.3390/rs15215264
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Cloud detection is critical in remote sensing image processing, and convolutional neural networks (CNNs) have significantly advanced this field. However, traditional CNNs primarily focus on extracting local features, which can be challenging for cloud detection due to the variability in the size, shape, and boundaries of clouds. To address this limitation, we propose a hybrid Swin transformer-CNN cloud detection (STCCD) network that combines the strengths of both architectures. The STCCD network employs a novel dual-stream encoder that integrates Swin transformer and CNN blocks. Swin transformers can capture global context features more effectively than traditional CNNs, while CNNs excel at extracting local features. The two streams are fused via a fusion coupling module (FCM) to produce a richer representation of the input image. To further enhance the network's ability in extracting cloud features, we incorporate a feature fusion module based on the attention mechanism (FFMAM) and an aggregation multiscale feature module (AMSFM). The FFMAM selectively merges global and local features based on their importance, while the AMSFM aggregates feature maps from different spatial scales to obtain a more comprehensive representation of the cloud mask. We evaluated the STCCD network on three challenging cloud detection datasets (GF1-WHU, SPARCS, and AIR-CD), as well as the L8-Biome dataset to assess its generalization capability. The results show that the STCCD network outperformed other state-of-the-art methods on all datasets. Notably, the STCCD model, trained on only four bands (visible and near-infrared) of the GF1-WHU dataset, outperformed the official Landsat-8 Fmask algorithm in the L8-Biome dataset, which uses additional bands (shortwave infrared, cirrus, and thermal).
引用
收藏
页数:26
相关论文
共 50 条
  • [1] Remote Sensing Object Detection Based on Convolution and Swin Transformer
    Jiang, Xuzhao
    Wu, Yonghong
    [J]. IEEE ACCESS, 2023, 11 : 38643 - 38656
  • [2] Swin Transformer Combined with Convolution Neural Network for Surface Defect Detection
    Li, Yinghao
    Xiang, Yihao
    Guo, Haogong
    Liu, Panpan
    Liu, Chengming
    [J]. MACHINES, 2022, 10 (11)
  • [3] A Driving Area Detection Algorithm Based on Improved Swin Transformer
    Liu, Shuang
    Li, Ying
    Sheng, Huankun
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (02) : 227 - 234
  • [4] DETR Novel Small Target Detection Algorithm Based on Swin Transformer
    Xu, Fengchang
    Alfred, Rayner
    Pailus, Rayner Henry
    Lyu, Ge
    Du, Shifeng
    Chew, Jackel Vui Lung
    Li, Guozhang
    Wang, Xinliang
    [J]. IEEE ACCESS, 2024, 12 : 115838 - 115852
  • [5] Small object detection algorithm incorporating swin transformer for tea buds
    Shi, Meiling
    Zheng, Dongling
    Wu, Tianhao
    Zhang, Wenjing
    Fu, Ruijie
    Huang, Kailiang
    [J]. PLOS ONE, 2024, 19 (03):
  • [6] Swin-RGC: Swin-Transformer With Recursive Gated Convolution for Substation Equipment Non-Rigid Defect Detection
    Li, Hui
    Zhang, Jie
    Li, Rui
    Zhang, Hui
    Zou, Le
    Liu, Shujuan
    [J]. IEEE ACCESS, 2023, 11 : 72655 - 72664
  • [7] PVformer: Pedestrian and Vehicle Detection Algorithm Based on Swin Transformer in Rainy Scenes
    Sun, Zaiming
    Liu, Chang'an
    Qu, Hongquan
    Xie, Guangda
    [J]. SENSORS, 2022, 22 (15)
  • [8] Underwater Target Detection Algorithm Based on YOLO and Swin Transformer for Sonar Images
    Chen, Ruoyu
    Zhan, Shuyue
    Chen, Ying
    [J]. 2022 OCEANS HAMPTON ROADS, 2022,
  • [9] Asymmetric convolution Swin transformer for medical image super-resolution
    Lu, Weijia
    Jiang, Jiehui
    Tian, Hao
    Gu, Jun
    Lu, Yuhong
    Yang, Wanli
    Gong, Ming
    Han, Tianyi
    Jiang, Xiaojuan
    Zhang, Tingting
    [J]. ALEXANDRIA ENGINEERING JOURNAL, 2023, 85 : 177 - 184
  • [10] A low delay convolution algorithm based on cloud hybrid system
    Cai, Xilong
    Wang, Yonghao
    Hu, Wei
    Zhu, Xiangyu
    [J]. PROCEEDINGS OF THE 2019 14TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2019), 2019, : 442 - 446