Transformer with large convolution kernel decoder network for salient object detection in optical remote sensing images

被引:2
|
作者
Dong, Pengwei [1 ]
Wang, Bo [1 ]
Cong, Runmin [2 ]
Sun, Hai-Han [3 ]
Li, Chongyi [4 ]
机构
[1] Ningxia Univ, Sch Elect & Elect Engn, Yinchuan, Peoples R China
[2] Shandong Univ, Sch Control Sci & Engn, Shandong, Peoples R China
[3] Univ Wisconsin Madison, Dept Elect & Comp Engn, Madison, WI USA
[4] Nankai Univ, Sch Comp Sci, Tianjin, Peoples R China
关键词
Salient object detection; Optical remote sensing image; Transformer; Large convolutional kernel; ATTENTION; MODEL;
D O I
10.1016/j.cviu.2023.103917
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite salient object detection in optical remote sensing images (ORSI-SOD) has made great strides in recent years, it is still a very challenging topic due to various scales and shapes of objects, cluttered backgrounds, and diverse imaging orientations. Most previous deep learning-based methods fails to effectively capture local and global features, resulting in ambiguous localization and semantic information and inaccurate detail and boundary prediction for ORSI-SOD. In this paper, we propose a novel Transformer with large convolutional kernel decoding network, named TLCKD-Net, which effectively models the long-range dependence that is indispensable for feature extraction of ORSI-SOD. First, we utilize Transformer backbone network to perceive global and local details of salient objects. Second, a large convolutional kernel decoding module based on self-attention mechanism is designed for different sizes of salient objects to extract feature information at different scales. Then, a large convolutional refinement and a Salient Feature Enhancement Module are used to recover and refine the saliency features to obtain high quality saliency maps. Extensive experiments on two public ORSI-SOD datasets show that our proposed method outperforms 16 state-of-the-art methods both qualitatively and quantitatively. In addition, a series of ablation studies demonstrate the effectiveness of different modules for ORSI-SOD. Our source code is publicly available at https://github.com/Dpw506/TLCKD-Net.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Salient Object Detection in Optical Remote Sensing Images Driven by Transformer
    Li, Gongyang
    Bai, Zhen
    Liu, Zhi
    Zhang, Xinpeng
    Ling, Haibin
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5257 - 5269
  • [2] Transformer guidance dual-stream network for salient object detection in optical remote sensing images
    Zhang, Yi
    Guo, Jichang
    Yue, Huihui
    Yin, Xiangjun
    Zheng, Sida
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (24): : 17733 - 17747
  • [3] Transformer guidance dual-stream network for salient object detection in optical remote sensing images
    Yi Zhang
    Jichang Guo
    Huihui Yue
    Xiangjun Yin
    Sida Zheng
    [J]. Neural Computing and Applications, 2023, 35 : 17733 - 17747
  • [4] Adaptive Spatial Tokenization Transformer for Salient Object Detection in Optical Remote Sensing Images
    Gao, Lina
    Liu, Bing
    Fu, Ping
    Xu, Mingzhu
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [5] Bidirectional mutual guidance transformer for salient object detection in optical remote sensing images
    Huang, Kan
    Tian, Chunwei
    Li, Ge
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (13) : 4016 - 4033
  • [6] Attention-based pyramid decoder network for salient object detection in remote sensing images
    Liu, Yu
    Lin, Jie
    Yue, Gongtao
    Shao, Zhaosheng
    Zhang, Shanwen
    [J]. JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (04)
  • [7] Adjacent Complementary Network for Salient Object Detection in Optical Remote Sensing Images
    Song, Dawei
    Dong, Yongsheng
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [8] Attention Guided Network for Salient Object Detection in Optical Remote Sensing Images
    Lin, Yuhan
    Sun, Han
    Liu, Ningzhong
    Bian, Yetong
    Cen, Jun
    Zhou, Huiyu
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 25 - 36
  • [9] Adaptive Dual-Stream Sparse Transformer Network for Salient Object Detection in Optical Remote Sensing Images
    Zhao, Jie
    Jia, Yun
    Ma, Lin
    Yu, Lidan
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 5173 - 5192
  • [10] ASNet: Adaptive Semantic Network Based on Transformer-CNN for Salient Object Detection in Optical Remote Sensing Images
    Yan, Ruixiang
    Yan, Longquan
    Geng, Guohua
    Cao, Yufei
    Zhou, Pengbo
    Meng, Yongle
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 16