Efficient Context-Guided Stacked Refinement Network for RGB-T Salient Object Detection

被引:71
|
作者
Huo, Fushuo [1 ]
Zhu, Xuegui [1 ]
Zhang, Lei [2 ]
Liu, Qifeng [1 ]
Shu, Yu [1 ]
机构
[1] Chongqing Univ, State Key Lab Power Transmiss Equipment & Syst Se, Chongqing 400044, Peoples R China
[2] Chongqing Univ, Sch Microelect & Commun Engn, Chongqing 400044, Peoples R China
关键词
Feature extraction; Task analysis; Fuses; Object detection; Image segmentation; Semantics; Lighting; Salient object detection; RGB-T; multi-modality; information fusion; FUSION;
D O I
10.1109/TCSVT.2021.3102268
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
RGB-T salient object detection (SOD) aims at utilizing the complementary cues of RGB and Thermal (T) modalities to detect and segment the common objects. However, on one hand, existing methods simply fuse the features of two modalities without fully considering the characters of RGB and T. On the other hand, the high computational cost of existing methods prevents them from real-world applications (e.g., automatic driving, abnormal detection, person re-ID). To this end, we proposed an efficient encoder-decoder network named Context-guided Stacked Refinement Network (CSRNet). Specifically, we utilize a lightweight backbone and design efficient decoder parts, which greatly reduce the computational cost. To fuse RGB and T modalities, we proposed an efficient Context-guided Cross Modality Fusion (CCMF) module to filter the noise and explore the complementation of two modalities. Besides, Stacked Refinement Network (SRN) progressively refines the features from top to down via the interaction of semantic and spatial information. Extensive experiments show that our method performs favorably against state-of-the-art algorithms on RGB-T SOD task while with small model size (4.6M), few FLOPs (4.2G), and real-time speed (38 fps). Our codes is available at: https://github.com/huofushuo/CSRNet.
引用
下载
收藏
页码:3111 / 3124
页数:14
相关论文
共 50 条
  • [21] Saliency Prototype for RGB-D and RGB-T Salient Object Detection
    Zhang, Zihao
    Wang, Jie
    Han, Yahong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3696 - 3705
  • [22] Multi-enhanced Adaptive Attention Network for RGB-T Salient Object Detection
    Hao, Hao-Zhou
    Cheng, Yao
    Ji, Yi
    Li, Ying
    Liu, Chun-Ping
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [23] ECFFNet: Effective and Consistent Feature Fusion Network for RGB-T Salient Object Detection
    Zhou, Wujie
    Guo, Qinling
    Lei, Jingsheng
    Yu, Lu
    Hwang, Jenq-Neng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) : 1224 - 1235
  • [24] Asymmetric cross-modal activation network for RGB-T salient object detection
    Xu, Chang
    Li, Qingwu
    Zhou, Qingkai
    Jiang, Xiongbiao
    Yu, Dabing
    Zhou, Yaqin
    KNOWLEDGE-BASED SYSTEMS, 2022, 258
  • [25] TSFNet: Two-Stage Fusion Network for RGB-T Salient Object Detection
    Guo, Qinling
    Zhou, Wujie
    Lei, Jingsheng
    Yu, Lu
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1655 - 1659
  • [26] EAF-Net: an enhancement and aggregation–feedback network for RGB-T salient object detection
    Haiyang He
    Jing Wang
    Xiaolin Li
    Minglin Hong
    Shiguo Huang
    Tao Zhou
    Machine Vision and Applications, 2022, 33
  • [27] Feature differences reduction and specific features preserving network for RGB-T salient object detection
    Xu, Qiqi
    Di, Zhenguang
    Dong, Haoyu
    Yang, Gang
    Image and Vision Computing, 2024, 152
  • [28] Thermal images-aware guided early fusion network for cross-illumination RGB-T salient object detection
    Wang, Han
    Song, Kechen
    Huang, Liming
    Wen, Hongwei
    Yan, Yunhui
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 118
  • [29] Does Thermal Really Always Matter for RGB-T Salient Object Detection?
    Cong, Runmin
    Zhang, Kepu
    Zhang, Chen
    Zheng, Feng
    Zhao, Yao
    Huang, Qingming
    Kwong, Sam
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 6971 - 6982
  • [30] Modality-Induced Transfer-Fusion Network for RGB-D and RGB-T Salient Object Detection
    Chen, Gang
    Shao, Feng
    Chai, Xiongli
    Chen, Hangwei
    Jiang, Qiuping
    Meng, Xiangchao
    Ho, Yo-Sung
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (04) : 1787 - 1801