Cross-modal collaborative propagation for RGB-T saliency detection

被引：2

作者：

Yu, Xiaosheng ^{[1
]}

Pang, Yu ^{[2
]}

Chi, Jianning ^{[1
]}

Qi, Qi ^{[3
]}

机构：

[1] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110819, Peoples R China

[2] Shenyang Univ Technol, Sch Artificial Intelligence, Shenyang 110870, Peoples R China

[3] Liaoning Prov Party Comm, Party Sch, Dept Decis Consulting, Shenyang 110004, Peoples R China

来源：

VISUAL COMPUTER | 2024年 / 40卷 / 06期

关键词：

Saliency detection; Collaborative learning; Propagation mechanism; Deep features optimization; Multi-modal integration; IMAGE;

D O I：

10.1007/s00371-023-03085-5

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Recently, RGB-T saliency detection becomes gradually a hot topic due to the fact that RGB-T multi-modal data could overcome the limitation of conventional RGB data in some cases. However, existing RGB-T saliency detection methods usually fail to take both advantages of two modalities and cannot boost performance effectively. Therefore, we achieve RGB-T saliency detection via a novel method, namely cross-modal collaborative propagation (CMCP), which contains a novel saliency propagation mechanism and a novel cross-modal collaborative learning framework relied on the proposed propagation mechanism. More specifically, we firstly propose a novel saliency propagation method and then, respectively, regard two modalities as inputs to generate RGB-induced and thermal-induced propagation mechanisms. To bridge RGB-T modalities, a novel cross-modal collaborative learning framework between RGB-induced and thermal-induced propagation mechanisms is devised to optimize, respectively, two propagation results. In other words, two modalities constantly extract supervision information to help the opposite side to refine propagation result until attaining a stable state. Finally, we integrate two modalities-induced propagation results into a refined saliency map. We compare our model with the state-of-the-art RGB-T and RGB saliency detection algorithms on three benchmark datasets, and experimental results show that the proposed CMCP achieves the significant improvement.

引用

页码：4337 / 4354

页数：18

共 50 条

[21] Complementarity-aware cross-modal feature fusion network for RGB-T semantic segmentation
Wu, Wei
Chu, Tao
Liu, Qiong
PATTERN RECOGNITION, 2022, 131
[22] CMPNet: A cross-modal multi-scale perception network for RGB-T crowd counting
Zhang, Shihui
Chen, Kun
Zhai, Gangzheng
Li, He
Han, Shaojie
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 164
[23] Saliency Prototype for RGB-D and RGB-T Salient Object Detection
Zhang, Zihao
Wang, Jie
Han, Yahong
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3696 - 3705
[24] Vehicle Detection Based on Adaptive Multimodal Feature Fusion and Cross-Modal Vehicle Index Using RGB-T Images
Wu, Yuanfeng
Guan, Xinran
Zhao, Boya
Ni, Li
Huang, Min
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 8166 - 8177
[25] CACFNet: Cross-Modal Attention Cascaded Fusion Network for RGB-T Urban Scene Parsing
Zhou, Wujie
Dong, Shaohua
Fang, Meixin
Yu, Lu
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 1919 - 1929
[26] RGB-T Saliency Detection via Low-Rank Tensor Learning and Unified Collaborative Ranking
Huang, Liming
Song, Kechen
Gong, Aojun
Liu, Chuang
Yan, Yunhui
IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 1585 - 1589
[27] RGB-D Saliency Detection based on Cross-Modal and Multi-scale Feature Fusion
Zhu, Xuxing
Wu, Jin
Zhu, Lei
2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 6154 - 6160
[28] Cross-Modal Adaptation for RGB-D Detection
Hoffman, Judy
Gupta, Saurabh
Leong, Jian
Guadarrama, Sergio
Darrell, Trevor
2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 5032 - 5039
[29] Cross-modal Collaborative Manifold Propagation for Image Recommendation
Jian, Meng
Jia, Ting
Yang, Xun
Wu, Lifang
Huo, Lina
ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 344 - 348
[30] Learning Static-Adaptive Graphs for RGB-T Image Saliency Detection
Xu, Zhengmei
Tang, Jin
Zhou, Aiwu
Liu, Huaming
INFORMATION, 2022, 13 (02)

← 1 2 3 4 5 →