Transformer with large convolution kernel decoder network for salient object detection in optical remote sensing images

被引:2
|
作者
Dong, Pengwei [1 ]
Wang, Bo [1 ]
Cong, Runmin [2 ]
Sun, Hai-Han [3 ]
Li, Chongyi [4 ]
机构
[1] Ningxia Univ, Sch Elect & Elect Engn, Yinchuan, Peoples R China
[2] Shandong Univ, Sch Control Sci & Engn, Shandong, Peoples R China
[3] Univ Wisconsin Madison, Dept Elect & Comp Engn, Madison, WI USA
[4] Nankai Univ, Sch Comp Sci, Tianjin, Peoples R China
关键词
Salient object detection; Optical remote sensing image; Transformer; Large convolutional kernel; ATTENTION; MODEL;
D O I
10.1016/j.cviu.2023.103917
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite salient object detection in optical remote sensing images (ORSI-SOD) has made great strides in recent years, it is still a very challenging topic due to various scales and shapes of objects, cluttered backgrounds, and diverse imaging orientations. Most previous deep learning-based methods fails to effectively capture local and global features, resulting in ambiguous localization and semantic information and inaccurate detail and boundary prediction for ORSI-SOD. In this paper, we propose a novel Transformer with large convolutional kernel decoding network, named TLCKD-Net, which effectively models the long-range dependence that is indispensable for feature extraction of ORSI-SOD. First, we utilize Transformer backbone network to perceive global and local details of salient objects. Second, a large convolutional kernel decoding module based on self-attention mechanism is designed for different sizes of salient objects to extract feature information at different scales. Then, a large convolutional refinement and a Salient Feature Enhancement Module are used to recover and refine the saliency features to obtain high quality saliency maps. Extensive experiments on two public ORSI-SOD datasets show that our proposed method outperforms 16 state-of-the-art methods both qualitatively and quantitatively. In addition, a series of ablation studies demonstrate the effectiveness of different modules for ORSI-SOD. Our source code is publicly available at https://github.com/Dpw506/TLCKD-Net.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Nested Network With Two-Stream Pyramid for Salient Object Detection in Optical Remote Sensing Images
    Li, Chongyi
    Cong, Runmin
    Hou, Junhui
    Zhang, Sanyi
    Qian, Yue
    Kwong, Sam
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (11): : 9156 - 9166
  • [32] X-shape Feature Expansion Network for Salient Object Detection in Optical Remote Sensing Images
    Huang, Lisu
    Sun, Minghui
    Liang, Yanhua
    Qin, Guihe
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII, 2023, 14260 : 246 - 258
  • [33] A parallel down-up fusion network for salient object detection in optical remote sensing images
    Li, Chongyi
    Cong, Runmin
    Guo, Chunle
    Li, Hua
    Zhang, Chunjie
    Zheng, Feng
    Zhao, Yao
    [J]. NEUROCOMPUTING, 2020, 415 : 411 - 420
  • [34] Iterative Saliency Aggregation and Assignment Network for Efficient Salient Object Detection in Optical Remote Sensing Images
    Yao, Zhaojian
    Gao, Wei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [35] Semantic-Guided Attention Refinement Network for Salient Object Detection in Optical Remote Sensing Images
    Huang, Zhou
    Chen, Huaixin
    Liu, Biyuan
    Wang, Zhixi
    [J]. REMOTE SENSING, 2021, 13 (11)
  • [36] Multilevel Interactive Reverse-Guided Network for Salient Object Detection in Optical Remote Sensing Images
    Zhao, Jie
    Jia, Yun
    Ma, Lin
    Yu, Lidan
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 12983 - 12999
  • [37] Edge-Guided Recurrent Positioning Network for Salient Object Detection in Optical Remote Sensing Images
    Zhou, Xiaofei
    Shen, Kunye
    Weng, Li
    Cong, Runmin
    Zheng, Bolun
    Zhang, Jiyong
    Yan, Chenggang
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (01) : 539 - 552
  • [38] Remote Sensing Object Detection Based on Convolution and Swin Transformer
    Jiang, Xuzhao
    Wu, Yonghong
    [J]. IEEE ACCESS, 2023, 11 : 38643 - 38656
  • [39] Large kernel convolution application for land cover change detection of remote sensing images
    Huang, Junqing
    Yuan, Xiaochen
    Lam, Chan-Tong
    Ke, Wei
    Huang, Guoheng
    [J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 132
  • [40] RRNet: Relational Reasoning Network With Parallel Multiscale Attention for Salient Object Detection in Optical Remote Sensing Images
    Cong, Runmin
    Zhang, Yumo
    Fang, Leyuan
    Li, Jun
    Zhao, Yao
    Kwong, Sam
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60