Cross-modal collaborative propagation for RGB-T saliency detection

被引:2
|
作者
Yu, Xiaosheng [1 ]
Pang, Yu [2 ]
Chi, Jianning [1 ]
Qi, Qi [3 ]
机构
[1] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110819, Peoples R China
[2] Shenyang Univ Technol, Sch Artificial Intelligence, Shenyang 110870, Peoples R China
[3] Liaoning Prov Party Comm, Party Sch, Dept Decis Consulting, Shenyang 110004, Peoples R China
来源
VISUAL COMPUTER | 2024年 / 40卷 / 06期
关键词
Saliency detection; Collaborative learning; Propagation mechanism; Deep features optimization; Multi-modal integration; IMAGE;
D O I
10.1007/s00371-023-03085-5
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recently, RGB-T saliency detection becomes gradually a hot topic due to the fact that RGB-T multi-modal data could overcome the limitation of conventional RGB data in some cases. However, existing RGB-T saliency detection methods usually fail to take both advantages of two modalities and cannot boost performance effectively. Therefore, we achieve RGB-T saliency detection via a novel method, namely cross-modal collaborative propagation (CMCP), which contains a novel saliency propagation mechanism and a novel cross-modal collaborative learning framework relied on the proposed propagation mechanism. More specifically, we firstly propose a novel saliency propagation method and then, respectively, regard two modalities as inputs to generate RGB-induced and thermal-induced propagation mechanisms. To bridge RGB-T modalities, a novel cross-modal collaborative learning framework between RGB-induced and thermal-induced propagation mechanisms is devised to optimize, respectively, two propagation results. In other words, two modalities constantly extract supervision information to help the opposite side to refine propagation result until attaining a stable state. Finally, we integrate two modalities-induced propagation results into a refined saliency map. We compare our model with the state-of-the-art RGB-T and RGB saliency detection algorithms on three benchmark datasets, and experimental results show that the proposed CMCP achieves the significant improvement.
引用
收藏
页码:4337 / 4354
页数:18
相关论文
共 50 条
  • [21] Complementarity-aware cross-modal feature fusion network for RGB-T semantic segmentation
    Wu, Wei
    Chu, Tao
    Liu, Qiong
    PATTERN RECOGNITION, 2022, 131
  • [22] CMPNet: A cross-modal multi-scale perception network for RGB-T crowd counting
    Zhang, Shihui
    Chen, Kun
    Zhai, Gangzheng
    Li, He
    Han, Shaojie
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 164
  • [23] Saliency Prototype for RGB-D and RGB-T Salient Object Detection
    Zhang, Zihao
    Wang, Jie
    Han, Yahong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3696 - 3705
  • [24] Vehicle Detection Based on Adaptive Multimodal Feature Fusion and Cross-Modal Vehicle Index Using RGB-T Images
    Wu, Yuanfeng
    Guan, Xinran
    Zhao, Boya
    Ni, Li
    Huang, Min
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 8166 - 8177
  • [25] CACFNet: Cross-Modal Attention Cascaded Fusion Network for RGB-T Urban Scene Parsing
    Zhou, Wujie
    Dong, Shaohua
    Fang, Meixin
    Yu, Lu
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 1919 - 1929
  • [26] RGB-T Saliency Detection via Low-Rank Tensor Learning and Unified Collaborative Ranking
    Huang, Liming
    Song, Kechen
    Gong, Aojun
    Liu, Chuang
    Yan, Yunhui
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 1585 - 1589
  • [27] RGB-D Saliency Detection based on Cross-Modal and Multi-scale Feature Fusion
    Zhu, Xuxing
    Wu, Jin
    Zhu, Lei
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 6154 - 6160
  • [28] Cross-Modal Adaptation for RGB-D Detection
    Hoffman, Judy
    Gupta, Saurabh
    Leong, Jian
    Guadarrama, Sergio
    Darrell, Trevor
    2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 5032 - 5039
  • [29] Cross-modal Collaborative Manifold Propagation for Image Recommendation
    Jian, Meng
    Jia, Ting
    Yang, Xun
    Wu, Lifang
    Huo, Lina
    ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 344 - 348
  • [30] Learning Static-Adaptive Graphs for RGB-T Image Saliency Detection
    Xu, Zhengmei
    Tang, Jin
    Zhou, Aiwu
    Liu, Huaming
    INFORMATION, 2022, 13 (02)