Cross-modal collaborative propagation for RGB-T saliency detection

被引:2
|
作者
Yu, Xiaosheng [1 ]
Pang, Yu [2 ]
Chi, Jianning [1 ]
Qi, Qi [3 ]
机构
[1] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110819, Peoples R China
[2] Shenyang Univ Technol, Sch Artificial Intelligence, Shenyang 110870, Peoples R China
[3] Liaoning Prov Party Comm, Party Sch, Dept Decis Consulting, Shenyang 110004, Peoples R China
来源
VISUAL COMPUTER | 2024年 / 40卷 / 06期
关键词
Saliency detection; Collaborative learning; Propagation mechanism; Deep features optimization; Multi-modal integration; IMAGE;
D O I
10.1007/s00371-023-03085-5
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recently, RGB-T saliency detection becomes gradually a hot topic due to the fact that RGB-T multi-modal data could overcome the limitation of conventional RGB data in some cases. However, existing RGB-T saliency detection methods usually fail to take both advantages of two modalities and cannot boost performance effectively. Therefore, we achieve RGB-T saliency detection via a novel method, namely cross-modal collaborative propagation (CMCP), which contains a novel saliency propagation mechanism and a novel cross-modal collaborative learning framework relied on the proposed propagation mechanism. More specifically, we firstly propose a novel saliency propagation method and then, respectively, regard two modalities as inputs to generate RGB-induced and thermal-induced propagation mechanisms. To bridge RGB-T modalities, a novel cross-modal collaborative learning framework between RGB-induced and thermal-induced propagation mechanisms is devised to optimize, respectively, two propagation results. In other words, two modalities constantly extract supervision information to help the opposite side to refine propagation result until attaining a stable state. Finally, we integrate two modalities-induced propagation results into a refined saliency map. We compare our model with the state-of-the-art RGB-T and RGB saliency detection algorithms on three benchmark datasets, and experimental results show that the proposed CMCP achieves the significant improvement.
引用
收藏
页码:4337 / 4354
页数:18
相关论文
共 50 条
  • [31] Modal complementary fusion network for RGB-T salient object detection
    Ma, Shuai
    Song, Kechen
    Dong, Hongwen
    Tian, Hongkun
    Yan, Yunhui
    APPLIED INTELLIGENCE, 2023, 53 (08) : 9038 - 9055
  • [32] Modal complementary fusion network for RGB-T salient object detection
    Shuai Ma
    Kechen Song
    Hongwen Dong
    Hongkun Tian
    Yunhui Yan
    Applied Intelligence, 2023, 53 : 9038 - 9055
  • [33] RGB-D Saliency Detection Based on Attention Mechanism and Multi-Scale Cross-Modal Fusion
    Cui Z.
    Feng Z.
    Wang F.
    Liu Q.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (06): : 893 - 902
  • [34] Learning Multiscale Deep Features and SVM Regressors for Adaptive RGB-T Saliency Detection
    Ma, Yunpeng
    Sun, Dengdi
    Meng, Qianqian
    Ding, Zhuanlian
    Li, Chenglong
    2017 10TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL. 1, 2017, : 389 - 392
  • [35] C4Net: Excavating Cross-Modal Context- and Content-Complementarity for RGB-T Semantic Segmentation
    Zhao, Shenlu
    Li, Jingyi
    Zhang, Qiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (02) : 1347 - 1361
  • [36] CSA-Net: Cross-modal scale-aware attention-aggregated network for RGB-T crowd counting
    Li, He
    Zhang, Junge
    Kong, Weihang
    Shen, Jienan
    Shao, Yuguang
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [37] RGB-D Saliency Detection with 3D Cross-modal Fusion and Mid-level Integration
    Liu, Taoqi
    Li, Bo
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1328 - 1335
  • [38] Multi-modal adapter for RGB-T tracking
    Wang, He
    Xu, Tianyang
    Tang, Zhangyong
    Wu, Xiao-Jun
    Kittler, Josef
    INFORMATION FUSION, 2025, 118
  • [39] Cross-Modal Collaborative Communications
    Zhou, Liang
    Wu, Dan
    Chen, Jianxin
    Wei, Xin
    IEEE WIRELESS COMMUNICATIONS, 2020, 27 (02) : 112 - 117
  • [40] Cross-modal feature extraction and integration based RGBD saliency detection
    Pan, Liang
    Zhou, Xiaofei
    Shi, Ran
    Zhang, Jiyong
    Yan, Chenggang
    IMAGE AND VISION COMPUTING, 2020, 101