Category-Aware Saliency Enhance Learning Based on CLIP for Weakly Supervised Salient Object Detection

被引:0
|
作者
Yunde Zhang
Zhili Zhang
Tianshan Liu
Jun Kong
机构
[1] Jiangnan University,Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education)
[2] Anhui University,School of Computer Science and Technology
[3] The Hong Kong Polytechnic University,Department of Electronic and Information Engineering
来源
关键词
Weakly supervised; Salient object detection; Category-aware Saliency Enhance Learning; CLIP;
D O I
暂无
中图分类号
学科分类号
摘要
Weakly supervised salient object detection (SOD) using image-level category labels has been proposed to reduce the annotation cost of pixel-level labels. However, existing methods mostly train a classification network to generate a class activation map, which suffers from coarse localization and difficult pseudo-label updating. To address these issues, we propose a novel Category-aware Saliency Enhance Learning (CSEL) method based on contrastive vision-language pre-training (CLIP), which can perform image-text classification and pseudo-label updating simultaneously. Our proposed method transforms image-text classification into pixel-text matching and generates a category-aware saliency map, which is evaluated by the classification accuracy. Moreover, CSEL assesses the quality of the category-aware saliency map and the pseudo saliency map, and uses the quality confidence scores as weights to update the pseudo labels. The two maps mutually enhance each other to guide the pseudo saliency map in the correct direction. Our SOD network can be trained jointly under the supervision of the updated pseudo saliency maps. We test our model on various well-known RGB-D and RGB SOD datasets. Our model achieves an S-measure of 87.6%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} on the RGB-D NLPR dataset and 84.3%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} on the RGB ECSSD dataset. Additionally, we obtain satisfactory performance on the weakly supervised E-measure, F-measure, and mean absolute error metrics for other datasets. These results demonstrate the effectiveness of our model.
引用
收藏
相关论文
共 50 条
  • [21] Weakly Supervised Salient Object Detection with Box Annotation
    Jiang, Zhentao
    Chen, Qiang
    Jiang, Bo
    Leng, Cong
    Cheng, Jian
    PATTERN RECOGNITION, ACPR 2021, PT I, 2022, 13188 : 197 - 211
  • [22] Local saliency consistency-based label inference for weakly supervised salient object detection using scribble annotations
    Zhao, Shuo
    Cui, Peng
    Shen, Jing
    Liu, Haibo
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024, 9 (01) : 239 - 249
  • [23] A Weakly Supervised Learning Framework for Salient Object Detection via Hybrid Labels
    Cong, Runmin
    Qin, Qi
    Zhang, Chen
    Jiang, Qiuping
    Wang, Shiqi
    Zhao, Yao
    Kwong, Sam
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (02) : 534 - 548
  • [24] Noise-Sensitive Adversarial Learning for Weakly Supervised Salient Object Detection
    Piao, Yongri
    Wu, Wei
    Zhang, Miao
    Jiang, Yongyao
    Lu, Huchuan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2888 - 2897
  • [25] WUSL-SOD: Joint weakly supervised, unsupervised and supervised learning for salient object detection
    Liu, Yan
    Zhang, Yunzhou
    Wang, Zhenyu
    Ma, Rong
    Qiu, Feng
    Coleman, Sonya
    Kerr, Dermot
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (21): : 15837 - 15856
  • [26] Weakly Supervised Object Localization with Latent Category Learning
    Wang, Chong
    Ren, Weiqiang
    Huang, Kaiqi
    Tan, Tieniu
    COMPUTER VISION - ECCV 2014, PT VI, 2014, 8694 : 431 - 445
  • [27] Weakly Supervised Object Detection Based on Active Learning
    Xiao Wang
    Xiang Xiang
    Baochang Zhang
    Xuhui Liu
    Jianying Zheng
    QingLei Hu
    Neural Processing Letters, 2022, 54 : 5169 - 5183
  • [28] CaT: Weakly Supervised Object Detection with Category Transfer
    Cao, Tianyue
    Du, Lianyu
    Zhang, Xiaoyun
    Chen, Siheng
    Zhang, Ya
    Wang, Yan-Feng
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3050 - 3059
  • [29] Weakly Supervised Object Detection Based on Active Learning
    Wang, Xiao
    Xiang, Xiang
    Zhang, Baochang
    Liu, Xuhui
    Zheng, Jianying
    Hu, Qinglei
    NEURAL PROCESSING LETTERS, 2022, 54 (06) : 5169 - 5183
  • [30] Weakly Supervised Real-time Object Detection Based on Saliency Map
    Li Y.
    Wang P.
    Liu Y.
    Liu G.-J.
    Wang C.-Y.
    Liu X.-Y.
    Guo M.-Z.
    Liu, Yang (yliu76@hit.edu.cn), 1600, Science Press (46): : 242 - 255