Cross-modal collaborative propagation for RGB-T saliency detection

被引:2
|
作者
Yu, Xiaosheng [1 ]
Pang, Yu [2 ]
Chi, Jianning [1 ]
Qi, Qi [3 ]
机构
[1] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110819, Peoples R China
[2] Shenyang Univ Technol, Sch Artificial Intelligence, Shenyang 110870, Peoples R China
[3] Liaoning Prov Party Comm, Party Sch, Dept Decis Consulting, Shenyang 110004, Peoples R China
来源
VISUAL COMPUTER | 2024年 / 40卷 / 06期
关键词
Saliency detection; Collaborative learning; Propagation mechanism; Deep features optimization; Multi-modal integration; IMAGE;
D O I
10.1007/s00371-023-03085-5
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recently, RGB-T saliency detection becomes gradually a hot topic due to the fact that RGB-T multi-modal data could overcome the limitation of conventional RGB data in some cases. However, existing RGB-T saliency detection methods usually fail to take both advantages of two modalities and cannot boost performance effectively. Therefore, we achieve RGB-T saliency detection via a novel method, namely cross-modal collaborative propagation (CMCP), which contains a novel saliency propagation mechanism and a novel cross-modal collaborative learning framework relied on the proposed propagation mechanism. More specifically, we firstly propose a novel saliency propagation method and then, respectively, regard two modalities as inputs to generate RGB-induced and thermal-induced propagation mechanisms. To bridge RGB-T modalities, a novel cross-modal collaborative learning framework between RGB-induced and thermal-induced propagation mechanisms is devised to optimize, respectively, two propagation results. In other words, two modalities constantly extract supervision information to help the opposite side to refine propagation result until attaining a stable state. Finally, we integrate two modalities-induced propagation results into a refined saliency map. We compare our model with the state-of-the-art RGB-T and RGB saliency detection algorithms on three benchmark datasets, and experimental results show that the proposed CMCP achieves the significant improvement.
引用
收藏
页码:4337 / 4354
页数:18
相关论文
共 50 条
  • [1] Cross-modal co-feedback cellular automata for RGB-T saliency detection
    Pang, Yu
    Wu, Hao
    Wu, Chengdong
    PATTERN RECOGNITION, 2023, 135
  • [2] Cross-Modal Pattern-Propagation for RGB-T Tracking
    Wang, Chaoqun
    Xu, Chunyan
    Cui, Zhen
    Zhou, Ling
    Zhang, Tong
    Zhang, Xiaoya
    Yang, Jian
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 7062 - 7071
  • [3] Learning cross-modal interaction for RGB-T tracking
    Xu, Chunyan
    Cui, Zhen
    Wang, Chaoqun
    Zhou, Chuanwei
    Yang, Jian
    SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (01)
  • [4] Learning cross-modal interaction for RGB-T tracking
    Chunyan XU
    Zhen CUI
    Chaoqun WANG
    Chuanwei ZHOU
    Jian YANG
    Science China(Information Sciences), 2023, 66 (01) : 320 - 321
  • [5] Learning cross-modal interaction for RGB-T tracking
    Chunyan Xu
    Zhen Cui
    Chaoqun Wang
    Chuanwei Zhou
    Jian Yang
    Science China Information Sciences, 2023, 66
  • [6] Asymmetric cross-modal activation network for RGB-T salient object detection
    Xu, Chang
    Li, Qingwu
    Zhou, Qingkai
    Jiang, Xiongbiao
    Yu, Dabing
    Zhou, Yaqin
    KNOWLEDGE-BASED SYSTEMS, 2022, 258
  • [7] RGB-T Saliency Detection Based on Multiscale Modal Reasoning Interaction
    Wu, Yunhe
    Jia, Tong
    Chang, Xingya
    Wang, Hao
    Chen, Dongyue
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [8] Effectiveness Guided Cross-Modal Information Sharing for Aligned RGB-T Object Detection
    An, Zijia
    Liu, Chunlei
    Han, Yuqi
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2562 - 2566
  • [9] RGB-T Image Saliency Detection via Collaborative Graph Learning
    Tu, Zhengzheng
    Xia, Tian
    Li, Chenglong
    Wang, Xiaoxiao
    Ma, Yan
    Tang, Jin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (01) : 160 - 173
  • [10] Fast RGB-T Tracking via Cross-Modal Correlation Filters
    Zhai, Sulan
    Shao, Pengpeng
    Liang, Xinyan
    Wang, Xin
    NEUROCOMPUTING, 2019, 334 : 172 - 181