Category-Aware Saliency Enhance Learning Based on CLIP for Weakly Supervised Salient Object Detection

被引：0

作者：

Yunde Zhang

Zhili Zhang

Tianshan Liu

Jun Kong

机构：

[1] Jiangnan University,Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education)

[2] Anhui University,School of Computer Science and Technology

[3] The Hong Kong Polytechnic University,Department of Electronic and Information Engineering

来源：

Neural Processing Letters | / 56卷

关键词：

Weakly supervised; Salient object detection; Category-aware Saliency Enhance Learning; CLIP;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Weakly supervised salient object detection (SOD) using image-level category labels has been proposed to reduce the annotation cost of pixel-level labels. However, existing methods mostly train a classification network to generate a class activation map, which suffers from coarse localization and difficult pseudo-label updating. To address these issues, we propose a novel Category-aware Saliency Enhance Learning (CSEL) method based on contrastive vision-language pre-training (CLIP), which can perform image-text classification and pseudo-label updating simultaneously. Our proposed method transforms image-text classification into pixel-text matching and generates a category-aware saliency map, which is evaluated by the classification accuracy. Moreover, CSEL assesses the quality of the category-aware saliency map and the pseudo saliency map, and uses the quality confidence scores as weights to update the pseudo labels. The two maps mutually enhance each other to guide the pseudo saliency map in the correct direction. Our SOD network can be trained jointly under the supervision of the updated pseudo saliency maps. We test our model on various well-known RGB-D and RGB SOD datasets. Our model achieves an S-measure of 87.6%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} on the RGB-D NLPR dataset and 84.3%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} on the RGB ECSSD dataset. Additionally, we obtain satisfactory performance on the weakly supervised E-measure, F-measure, and mean absolute error metrics for other datasets. These results demonstrate the effectiveness of our model.

引用

共 50 条

[31] Weakly Supervised Salient Object Detection by Learning A Classifier-Driven Map Generator
Hsu, Kuang-Jui
Lin, Yen-Yu
Chuang, Yung-Yu
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (11) : 5435 - 5449
[32] Cross-frame feature-saliency mutual reinforcing for weakly supervised video salient object detection
Wang, Jian
Yu, Siyue
Zhang, Bingfeng
Zhao, Xinqiao
Garcia-Fernandez, Angel F.
Lim, Eng Gee
Xiao, Jimin
PATTERN RECOGNITION, 2024, 150
[33] Weakly-Supervised Salient Object Detection on Light Fields
Liang, Zijian
Wang, Pengjie
Xu, Ke
Zhang, Pingping
Lau, Rynson W. H.
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6295 - 6305
[34] A complementary dual model for weakly supervised salient object detection
Chen, Liyuan
Zhang, Dawei
Wang, Xiao
Wan, Chang
Jin, Shan
Zheng, Zhonglong
PATTERN RECOGNITION, 2025, 163
[35] Weakly Supervised Salient Object Detection Using Image Labels
Li, Guanbin
Xie, Yuan
Lin, Liang
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7024 - 7031
[36] Weakly Supervised Salient Object Detection by Hierarchically Enhanced Scribbles
Wang, Xiongying
Al-Huda, Zaid
Peng, Bo
Tang, Xin
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (02)
[37] Scribble-based boundary-aware network for weakly supervised salient object detection in remote sensing images
Huang, Zhou
Xiang, Tian-Zhu
Chen, Huai-Xin
Dai, Hang
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 191 : 290 - 301
[38] Weakly supervised salient object detection via double object proposals guidance
Zhou, Zhiheng
Guo, Yongfan
Dai, Ming
Huang, Junchu
Li, Xiangwei
IET IMAGE PROCESSING, 2021, 15 (09) : 1957 - 1970
[39] Category-Aware Transformer Network for Better Human-Object Interaction Detection
Dong, Leizhen
Li, Zhimin
Xu, Kunlun
Zhang, Zhijun
Yan, Luxin
Zhong, Sheng
Zou, Xu
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19516 - 19525
[40] Salient Contour-Aware Based Twice Learning Strategy for Saliency Detection
Zhu, Chunbiao
Yan, Wei
Liu, Shan
Li, Thomas
Li, Ge
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2541 - 2548

← 1 2 3 4 5 →