Category-Aware Saliency Enhance Learning Based on CLIP for Weakly Supervised Salient Object Detection

被引:0
|
作者
Yunde Zhang
Zhili Zhang
Tianshan Liu
Jun Kong
机构
[1] Jiangnan University,Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education)
[2] Anhui University,School of Computer Science and Technology
[3] The Hong Kong Polytechnic University,Department of Electronic and Information Engineering
来源
关键词
Weakly supervised; Salient object detection; Category-aware Saliency Enhance Learning; CLIP;
D O I
暂无
中图分类号
学科分类号
摘要
Weakly supervised salient object detection (SOD) using image-level category labels has been proposed to reduce the annotation cost of pixel-level labels. However, existing methods mostly train a classification network to generate a class activation map, which suffers from coarse localization and difficult pseudo-label updating. To address these issues, we propose a novel Category-aware Saliency Enhance Learning (CSEL) method based on contrastive vision-language pre-training (CLIP), which can perform image-text classification and pseudo-label updating simultaneously. Our proposed method transforms image-text classification into pixel-text matching and generates a category-aware saliency map, which is evaluated by the classification accuracy. Moreover, CSEL assesses the quality of the category-aware saliency map and the pseudo saliency map, and uses the quality confidence scores as weights to update the pseudo labels. The two maps mutually enhance each other to guide the pseudo saliency map in the correct direction. Our SOD network can be trained jointly under the supervision of the updated pseudo saliency maps. We test our model on various well-known RGB-D and RGB SOD datasets. Our model achieves an S-measure of 87.6%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} on the RGB-D NLPR dataset and 84.3%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} on the RGB ECSSD dataset. Additionally, we obtain satisfactory performance on the weakly supervised E-measure, F-measure, and mean absolute error metrics for other datasets. These results demonstrate the effectiveness of our model.
引用
收藏
相关论文
共 50 条
  • [31] Weakly Supervised Salient Object Detection by Learning A Classifier-Driven Map Generator
    Hsu, Kuang-Jui
    Lin, Yen-Yu
    Chuang, Yung-Yu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (11) : 5435 - 5449
  • [32] Cross-frame feature-saliency mutual reinforcing for weakly supervised video salient object detection
    Wang, Jian
    Yu, Siyue
    Zhang, Bingfeng
    Zhao, Xinqiao
    Garcia-Fernandez, Angel F.
    Lim, Eng Gee
    Xiao, Jimin
    PATTERN RECOGNITION, 2024, 150
  • [33] Weakly-Supervised Salient Object Detection on Light Fields
    Liang, Zijian
    Wang, Pengjie
    Xu, Ke
    Zhang, Pingping
    Lau, Rynson W. H.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6295 - 6305
  • [34] A complementary dual model for weakly supervised salient object detection
    Chen, Liyuan
    Zhang, Dawei
    Wang, Xiao
    Wan, Chang
    Jin, Shan
    Zheng, Zhonglong
    PATTERN RECOGNITION, 2025, 163
  • [35] Weakly Supervised Salient Object Detection Using Image Labels
    Li, Guanbin
    Xie, Yuan
    Lin, Liang
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7024 - 7031
  • [36] Weakly Supervised Salient Object Detection by Hierarchically Enhanced Scribbles
    Wang, Xiongying
    Al-Huda, Zaid
    Peng, Bo
    Tang, Xin
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (02)
  • [37] Scribble-based boundary-aware network for weakly supervised salient object detection in remote sensing images
    Huang, Zhou
    Xiang, Tian-Zhu
    Chen, Huai-Xin
    Dai, Hang
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 191 : 290 - 301
  • [38] Weakly supervised salient object detection via double object proposals guidance
    Zhou, Zhiheng
    Guo, Yongfan
    Dai, Ming
    Huang, Junchu
    Li, Xiangwei
    IET IMAGE PROCESSING, 2021, 15 (09) : 1957 - 1970
  • [39] Category-Aware Transformer Network for Better Human-Object Interaction Detection
    Dong, Leizhen
    Li, Zhimin
    Xu, Kunlun
    Zhang, Zhijun
    Yan, Luxin
    Zhong, Sheng
    Zou, Xu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19516 - 19525
  • [40] Salient Contour-Aware Based Twice Learning Strategy for Saliency Detection
    Zhu, Chunbiao
    Yan, Wei
    Liu, Shan
    Li, Thomas
    Li, Ge
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2541 - 2548