MSEDNet: Multi-scale fusion and edge-supervised network for RGB-T salient object detection

被引:11
|
作者
Peng, Daogang [1 ]
Zhou, Weiyi [1 ]
Pan, Junzhen [1 ]
Wang, Danhao [1 ]
机构
[1] Shanghai Univ Elect Power, Coll Automat Engn, 2588 Changyang Rd, Shanghai 200090, Peoples R China
关键词
RGB-T; Salient object detection; Multi-scale fusion; Edge fusion loss; SEGMENTATION;
D O I
10.1016/j.neunet.2023.12.031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
RGB-T Salient object detection (SOD) is to accurately segment salient regions in both visible light images and thermal infrared images. However, most of existing methods for SOD neglects the critical complementarity between multiple modalities images, which is beneficial to further improve the detection accuracy. Therefore, this work introduces the MSEDNet RGB-T SOD method. We utilize an encoder to extract multi-level modalities features from both visible light images and thermal infrared images, which are subsequently categorized into high, medium, and low level. Additionally, we propose three separate feature fusion modules to comprehensively extract complementary information between different modalities during the fusion process. These modules are applied to specific feature levels: the Edge Dilation Sharpening module for low-level features, the Spatial and Channel-Aware module for mid-level features, and the Cross-Residual Fusion module for high-level features. Finally, we introduce an edge fusion loss function for supervised learning, which effectively extracts edge information from different modalities and suppresses background noise. Comparative demonstrate the superiority of the proposed MSEDNet over other state-of-the-art methods. The code and results can be found at the following link: https://github.com/Zhou-wy/MSEDNet.
引用
收藏
页码:410 / 422
页数:13
相关论文
共 50 条
  • [21] Unidirectional RGB-T salient object detection with intertwined driving of encoding and fusion
    Wang, Jie
    Song, Kechen
    Bao, Yanqi
    Yan, Yunhui
    Han, Yahong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 114
  • [22] Modality-Induced Transfer-Fusion Network for RGB-D and RGB-T Salient Object Detection
    Chen, Gang
    Shao, Feng
    Chai, Xiongli
    Chen, Hangwei
    Jiang, Qiuping
    Meng, Xiangchao
    Ho, Yo-Sung
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (04) : 1787 - 1801
  • [23] Pyramid contract-based network for RGB-T salient object detection
    Ranwan Wu
    Hongbo Bi
    Cong Zhang
    Jiayuan Zhang
    Yuyu Tong
    Wei Jin
    Zhigang Liu
    Multimedia Tools and Applications, 2024, 83 : 20805 - 20825
  • [24] Wavelet-Driven Multi-Band Feature Fusion for RGB-T Salient Object Detection
    Zhao, Jianxun
    Wen, Xin
    He, Yu
    Yang, Xiaowei
    Song, Kechen
    Sensors, 2024, 24 (24)
  • [25] Cross-Modality Double Bidirectional Interaction and Fusion Network for RGB-T Salient Object Detection
    Xie, Zhengxuan
    Shao, Feng
    Chen, Gang
    Chen, Hangwei
    Jiang, Qiuping
    Meng, Xiangchao
    Ho, Yo-Sung
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 4149 - 4163
  • [26] Pyramid contract-based network for RGB-T salient object detection
    Wu, Ranwan
    Bi, Hongbo
    Zhang, Cong
    Zhang, Jiayuan
    Tong, Yuyu
    Jin, Wei
    Liu, Zhigang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 20805 - 20825
  • [27] Interactive context-aware network for RGB-T salient object detection
    Wang, Yuxuan
    Dong, Feng
    Zhu, Jinchao
    Chen, Jianren
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (28) : 72153 - 72174
  • [28] WaveNet: Wavelet Network With Knowledge Distillation for RGB-T Salient Object Detection
    Zhou, Wujie
    Sun, Fan
    Jiang, Qiuping
    Cong, Runmin
    Hwang, Jenq-Neng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3027 - 3039
  • [29] Weakly-supervised salient object detection with the multi-scale progressive network
    Liu X.
    Guo J.
    Zheng S.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2023, 50 (01): : 48 - 57
  • [30] UMINet: a unified multi-modality interaction network for RGB-D and RGB-T salient object detection
    Gao, Lina
    Fu, Ping
    Xu, Mingzhu
    Wang, Tiantian
    Liu, Bing
    VISUAL COMPUTER, 2024, 40 (03): : 1565 - 1582