Webly-supervised learning for salient object detection

被引:18
|
作者
Luo, Ao [1 ]
Li, Xin [2 ]
Yang, Fan [2 ]
Jiao, Zhicheng [3 ]
Cheng, Hong [1 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu 611731, Peoples R China
[2] Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
[3] Univ N Carolina, Chapel Hill, NC 27599 USA
关键词
Salient object detection; Webly-supervised learning; Deep learning; OPTIMIZATION; FRAMEWORK; FUSION;
D O I
10.1016/j.patcog.2020.107308
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
End-to-end training of a deep CNN-Based model for salient object detection usually requires a huge number of training samples with pixel-level annotations, which are costly and time-consuming to obtain. In this paper, we propose an approach that can utilize large amounts of web data for learning a deep salient object detection model. With thousands of images collected from the Web, we first employ several bottom-up saliency detection techniques to generate salient object masks for all images, and then use a novel quality evaluation method to pick out a subset of images with reliable masks for training. After that, we develop a self-training approach to boost the performance of our initial network, which iterates between the network training process and the training set updating process. Importantly, different from existing webly-supervised or weakly-supervised methods, our approach is able to automatically select reliable images for network training without requiring any human intervention (e.g., dividing images into different difficulty levels). Results of extensive experiments on several widely-used benchmarks demonstrate that our method has achieved state-of-the-art performance. It significantly outperforms existing unsupervised and weakly-supervised salient object detection methods, and achieves competitive or even better performance than fully supervised approaches. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Webly-Supervised Food Detection with Foodness Proposal
    Shimoda, Wataru
    Yanai, Keiji
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (07) : 1230 - 1239
  • [2] Webly-Supervised Learning of Multimodal Video Detectors
    Liang, Junwei
    Jiang, Lu
    Hauptmann, Alexander
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 5099 - 5100
  • [3] Webly-supervised semantic segmentation via curriculum learning
    Huang, Zuxian
    Wu, Gangshan
    Wang, Limin
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 236
  • [4] Learning Everything about Anything: Webly-Supervised Visual Concept Learning
    Divvala, Santosh K.
    Farhadi, Ali
    Guestrin, Carlos
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3270 - 3277
  • [5] NoisyArt: A Dataset for Webly-supervised Artwork Recognition
    Del Chiaro, R.
    Bagdanov, Andrew
    Del Bimbo, A.
    VISAPP: PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 4, 2019, : 467 - 475
  • [6] Synergic learning for noise-insensitive webly-supervised temporal action localization
    Zhang, Can
    Cao, Meng
    Yang, Dongming
    Jiang, Ji
    Zou, Yuexian
    IMAGE AND VISION COMPUTING, 2021, 113
  • [7] Webly-supervised zero-shot learning for artwork instance recognition
    Del Chiaro, Riccardo
    Bagdanov, Andrew D.
    Del Bimbo, Alberto
    PATTERN RECOGNITION LETTERS, 2019, 128 : 420 - 426
  • [8] Multi-Modal Knowledge Representation Learning via Webly-Supervised Relationships Mining
    Nian, Fudong
    Bao, Bing-Kun
    Li, Teng
    Xu, Changsheng
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 411 - 419
  • [9] CTLC: A Co-Training-Based Label Correction Method for Webly-Supervised Learning
    Lv, Xueshuai
    Li, Mengyao
    Zhang, Rumeng
    Gao, Ling
    NEURAL PROCESSING LETTERS, 2023, 55 (09) : 12401 - 12419
  • [10] WEBLY-SUPERVISED VISUAL CONCEPT LEARNING WITH CARDINALITY GUIDED INSTANCE MINING AND CLUSTERED MULTITASK REFINEMENT
    Ni, Saijie
    Zhang, Xiaopeng
    Wang, Botao
    Xiong, Hongkai
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 979 - 984