Cross-modal collaborative propagation for RGB-T saliency detection

被引:2
|
作者
Yu, Xiaosheng [1 ]
Pang, Yu [2 ]
Chi, Jianning [1 ]
Qi, Qi [3 ]
机构
[1] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110819, Peoples R China
[2] Shenyang Univ Technol, Sch Artificial Intelligence, Shenyang 110870, Peoples R China
[3] Liaoning Prov Party Comm, Party Sch, Dept Decis Consulting, Shenyang 110004, Peoples R China
来源
VISUAL COMPUTER | 2024年 / 40卷 / 06期
关键词
Saliency detection; Collaborative learning; Propagation mechanism; Deep features optimization; Multi-modal integration; IMAGE;
D O I
10.1007/s00371-023-03085-5
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recently, RGB-T saliency detection becomes gradually a hot topic due to the fact that RGB-T multi-modal data could overcome the limitation of conventional RGB data in some cases. However, existing RGB-T saliency detection methods usually fail to take both advantages of two modalities and cannot boost performance effectively. Therefore, we achieve RGB-T saliency detection via a novel method, namely cross-modal collaborative propagation (CMCP), which contains a novel saliency propagation mechanism and a novel cross-modal collaborative learning framework relied on the proposed propagation mechanism. More specifically, we firstly propose a novel saliency propagation method and then, respectively, regard two modalities as inputs to generate RGB-induced and thermal-induced propagation mechanisms. To bridge RGB-T modalities, a novel cross-modal collaborative learning framework between RGB-induced and thermal-induced propagation mechanisms is devised to optimize, respectively, two propagation results. In other words, two modalities constantly extract supervision information to help the opposite side to refine propagation result until attaining a stable state. Finally, we integrate two modalities-induced propagation results into a refined saliency map. We compare our model with the state-of-the-art RGB-T and RGB saliency detection algorithms on three benchmark datasets, and experimental results show that the proposed CMCP achieves the significant improvement.
引用
收藏
页码:4337 / 4354
页数:18
相关论文
共 50 条
  • [41] RGB-T salient object detection via CNN feature and result saliency map fusion
    Chang Xu
    Qingwu Li
    Mingyu Zhou
    Qingkai Zhou
    Yaqin Zhou
    Yunpeng Ma
    Applied Intelligence, 2022, 52 : 11343 - 11362
  • [42] Unsupervised RGB-T saliency detection by node classification distance and sparse constrained graph learning
    Aojun Gong
    Liming Huang
    Jiashun Shi
    Chuang Liu
    Applied Intelligence, 2022, 52 : 1030 - 1043
  • [43] RGB-T salient object detection via CNN feature and result saliency map fusion
    Xu, Chang
    Li, Qingwu
    Zhou, Mingyu
    Zhou, Qingkai
    Zhou, Yaqin
    Ma, Yunpeng
    APPLIED INTELLIGENCE, 2022, 52 (10) : 11343 - 11362
  • [44] Cross-Modal Saliency Correlation for Image Annotation
    Yun Gu
    Haoyang Xue
    Jie Yang
    Neural Processing Letters, 2017, 45 : 777 - 789
  • [45] Cross-Modal Saliency Correlation for Image Annotation
    Gu, Yun
    Xue, Haoyang
    Yang, Jie
    NEURAL PROCESSING LETTERS, 2017, 45 (03) : 777 - 789
  • [46] Unsupervised RGB-T saliency detection by node classification distance and sparse constrained graph learning
    Gong, Aojun
    Huang, Liming
    Shi, Jiashun
    Liu, Chuang
    APPLIED INTELLIGENCE, 2022, 52 (01) : 1030 - 1043
  • [47] CCANet: A Collaborative Cross-Modal Attention Network for RGB-D Crowd Counting
    Liu, Yanbo
    Cao, Guo
    Shi, Boshan
    Hu, Yingxiang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 154 - 165
  • [48] Cross-Collaboration Weighted Fusion Network for RGB-T Salient Detection
    Wang, Yumei
    Dongye, Changlei
    Zhao, Wenxiu
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024, 2024, 14865 : 301 - 312
  • [49] Lightweight cross-modal transformer for RGB-D salient object detection
    Huang, Nianchang
    Yang, Yang
    Zhang, Qiang
    Han, Jungong
    Huang, Jin
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249
  • [50] Learning an Augmented RGB Representation with Cross-Modal Knowledge Distillation for Action Detection
    Dai, Rui
    Das, Srijan
    Bremond, Francois
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13033 - 13044