TRCDNet: A Transformer Network for Video Cloud Detection

被引:2
|
作者
Luo, Chen [1 ,2 ]
Feng, Shanshan [1 ,2 ]
Quan, Yingling [3 ]
Ye, Yunming [1 ,2 ]
Li, Xutao [1 ,2 ]
Xu, Yong [3 ]
Zhang, Baoquan [3 ]
Chen, Zhihao [3 ]
机构
[1] Harbin Inst Technol, Dept Comp Sci, Shenzhen 518055, Peoples R China
[2] Harbin Inst Technol, Shenzhen Key Lab Internet Informat Collaborat, Shenzhen, Peoples R China
[3] Harbin Inst Technol, Sch Comp Sci & Technol, Shenzhen 518071, Peoples R China
关键词
Cloud detection on geostationary satellite images; Fengyun-4A satellites; video cloud detection; SHADOW DETECTION; ALGORITHMS; FEATURES; IMAGERY; FUSION;
D O I
10.1109/TGRS.2023.3288543
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In remote-sensing image (RSI) preprocessing steps, detecting and removing cloudy areas is a critical task. Recently, cloud detection methods based on deep neural networks achieve outstanding performance over traditional methods. Current approaches mostly focus on cloud detection on a single image captured by polar-orbiting satellites. However, there is another type of meteorological satellite-geostationary satellite, which can capture temporal consecutive frames of a particular location. Therefore, the cloud detection task targeting a geostationary satellite can be treated as a video cloud detection task. And in addition to extracting features on a single image, extracting and making full use of the relations between sequential frames is also important. To tackle this problem, we design a deep-learning video cloud detection model: transformer network for video cloud detection (TRCDNet). The proposed network is based on the encoder-decoder structure. In the encoder, the module ContextGhostLayer is proposed to encode more semantic information to tackle challenging problems like thin clouds in RSIs. Besides, we design a transformer-based video sequence transformer (VSTR) block. Based on the attention mechanism, VSTR can fully extract the across-frame relations. In the proposed decoder, the cloud masks are recovered gradually to the same scale as the input image. To evaluate the methods, we create a Video Cloud Detection dataset based on the captured videos from Fengyun 4 (FY-4) satellite: Fengyun4aCloud. Extensive experiments of current cloud detection methods, semantic segmentation methods, and video semantic segmentation (VSS) methods indicate that the designed TRCDNet achieves state-of-art performance in video cloud detection.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Video Transformer Network
    Neimark, Daniel
    Bar, Omri
    Zohar, Maya
    Asselmann, Dotan
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3156 - 3165
  • [2] Cascaded Network Based on EfficientNet and Transformer for Deepfake Video Detection
    Liwei Deng
    Jiandong Wang
    Zhen Liu
    Neural Processing Letters, 2023, 55 : 7057 - 7076
  • [3] Cascaded Network Based on EfficientNet and Transformer for Deepfake Video Detection
    Deng, Liwei
    Wang, Jiandong
    Liu, Zhen
    NEURAL PROCESSING LETTERS, 2023, 55 (06) : 7057 - 7076
  • [4] CloudViT: A Lightweight Vision Transformer Network for Remote Sensing Cloud Detection
    Zhang, Bin
    Zhang, Yongjun
    Li, Yansheng
    Wan, Yi
    Yao, Yongxiang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [5] A Transformer-based network intrusion detection approach for cloud security
    Long, Zhenyue
    Yan, Huiru
    Shen, Guiquan
    Zhang, Xiaolu
    He, Haoyang
    Cheng, Long
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2024, 13 (01):
  • [6] CloudViT: A Lightweight Vision Transformer Network for Remote Sensing Cloud Detection
    Zhang, Bin
    Zhang, Yongjun
    Li, Yansheng
    Wan, Yi
    Yao, Yongxiang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [7] SCTRANS: A TRANSFORMER NETWORK BASED ON THE SPATIAL AND CHANNEL ATTENTION FOR CLOUD DETECTION
    Jiao, Wenke
    Zhang, Yongjun
    Zhang, Bin
    Wan, Yi
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 615 - 618
  • [8] A Transformer-based network intrusion detection approach for cloud security
    Zhenyue Long
    Huiru Yan
    Guiquan Shen
    Xiaolu Zhang
    Haoyang He
    Long Cheng
    Journal of Cloud Computing, 13
  • [9] AMANet: An Adaptive Memory Attention Network for video cloud detection
    Luo, Chen
    Feng, Shanshan
    Quan, YingLing
    Ye, Yunming
    Xu, Yong
    Li, Xutao
    Zhang, Baoquan
    PATTERN RECOGNITION, 2024, 155
  • [10] Video Action Transformer Network
    Girdhar, Rohit
    Carreira, Joao
    Doersch, Carl
    Zisserman, Andrew
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 244 - 253