Three-stream interaction decoder network for RGB-thermal salient object detection

被引:7
|
作者
Huo, Fushuo [1 ]
Zhu, Xuegui [1 ]
Li, Bingheng [2 ]
机构
[1] Chongqing Univ, State Key Lab Power Transmiss Equipment & Syst Sec, Chongqing 400044, Peoples R China
[2] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
关键词
Salient object detection; RGB-thermal; Multimodal fusion; Contextual information; Three-stream decoder;
D O I
10.1016/j.knosys.2022.110007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Salient Object Detection (SOD) has witnessed remarkable improvement during the past decade. However, RGB-based SOD methods may fail for real-world applications in some extreme environments like low-light conditions and cluttered backgrounds. Thermal (T) images can capture the heat radiation from the surface of the objects and overcome such extreme situations. Therefore, some researchers introduce the T modality to the SOD task. Existing RGB-T SOD methods fail to explicitly explore multiscale complementary saliency cues from dual modalities and lack the full explorations of individual RGB and T modalities. To deal with such problems, we propose the Three-stream Interaction Decoder Network (TIDNet) for the RGB-T SOD task. Specifically, the feature maps from the encoder branches are fed to the three-stream interaction decoder for in-depth saliency exploration, catching the single modality and multi-modality saliency cues. For single modality decoder streams, Contextual-enhanced Channel Reduction units (CCR) firstly reduce the channel dimension of feature maps from RGB and T modalities, reducing the computational burden and discriminatively enriching the multi-scale information. For the multi-modality decoder stream, Multi-scale Cross Modality Fusion (MCMF) unit is proposed to explore the complementary multi-scale information from RGB and T modalities. Then Internal and Multiple Decoder Interaction (IMDI) units further dig the specified and complementary saliency cues from the three-stream decoder. Three-stream deep supervision has been deployed on each feature level to facilitate the training strategy. Comprehensive experiments show our method outperforms fifteen state-of-the-art methods in terms of seven metrics. The codes and models are available at https://github.com/huofushuo/TIDNet. (c) 2022 Published by Elsevier B.V.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Cross-modal hierarchical interaction network for RGB-D salient object detection
    Bi, Hongbo
    Wu, Ranwan
    Liu, Ziqi
    Zhu, Huihui
    Zhang, Cong
    Xiang, Tian -Zhu
    [J]. PATTERN RECOGNITION, 2023, 136
  • [22] Cross-modality Discrepant Interaction Network for RGB-D Salient Object Detection
    Zhang, Chen
    Cong, Runmin
    Lin, Qinwei
    Ma, Lin
    Li, Feng
    Zhao, Yao
    Kwong, Sam
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2094 - 2102
  • [23] TAFNet: A Three-Stream Adaptive Fusion Network for RGB-T Crowd Counting
    Tang, Haihan
    Wang, Yi
    Chau, Lap-Pui
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 3299 - 3303
  • [24] UTDNet: A unified triplet decoder network for multimodal salient object detection
    Huo, Fushuo
    Liu, Ziming
    Guo, Jingcai
    Xu, Wenchao
    Guo, Song
    [J]. Neural Networks, 2024, 170 : 521 - 534
  • [25] UTDNet: A unified triplet decoder network for multimodal salient object detection
    Huo, Fushuo
    Liu, Ziming
    Guo, Jingcai
    Xu, Wenchao
    Guo, Song
    [J]. NEURAL NETWORKS, 2024, 170 : 521 - 534
  • [26] AirSOD: A Lightweight Network for RGB-D Salient Object Detection
    Zeng, Zhihong
    Liu, Haijun
    Chen, Fenglei
    Tan, Xiaoheng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1656 - 1669
  • [27] HIGH SPEED AND ROBUST RGB-THERMAL TRACKING VIA DUAL ATTENTIVE STREAM SIAMESE NETWORK
    Guo, Chaoyang
    Xiao, Liang
    [J]. 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 803 - 806
  • [28] Circular Complement Network for RGB-D Salient Object Detection
    Bai, Zhen
    Liu, Zhi
    Li, Gongyang
    Ye, Linwei
    Wang, Yang
    [J]. NEUROCOMPUTING, 2021, 451 : 95 - 106
  • [29] Bilateral Attention Network for RGB-D Salient Object Detection
    Zhang, Zhao
    Lin, Zheng
    Xu, Jun
    Jin, Wen-Da
    Lu, Shao-Ping
    Fan, Deng-Ping
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1949 - 1961
  • [30] Bilateral Attention Network for RGB-D Salient Object Detection
    Zhang, Zhao
    Lin, Zheng
    Xu, Jun
    Jin, Wen-Da
    Lu, Shao-Ping
    Fan, Deng-Ping
    [J]. IEEE Transactions on Image Processing, 2021, 30 : 1949 - 1961