S 3 Net: Self-Supervised Self-Ensembling Network for Semi-Supervised RGB-D Salient Object Detection

被引:6
|
作者
Zhu, Lei [1 ,2 ]
Wang, Xiaoqiang [3 ]
Li, Ping [4 ]
Yang, Xin [5 ]
Zhang, Qing [6 ]
Wang, Weiming [7 ]
Schonlieb, Carola-Bibiane [8 ]
Chen, C. L. Philip [9 ,10 ,11 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[2] Univ Cambridge, Dept Appl Math & Theoret Phys DAMTP, Cambridge CB3 0WA, England
[3] Zhejiang Univ, Coll Comp Sci & Technol, Shatin, Hangzhou 310058, Peoples R China
[4] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong 00852, Peoples R China
[5] Dalian Univ Technol, Dept Comp Sci, Dalian 116024, Peoples R China
[6] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Peoples R China
[7] Hong Kong Metropolitan Univ, Sch Sci & Technol, Ho Man Tin, Hong Kong 00852, Peoples R China
[8] Univ Cambridge, Dept Appl Math & Theoret Phys DAMTP, Cambridge CB3 0WA, England
[9] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
[10] Dalian Maritime Univ, Nav Coll, Dalian 116026, Peoples R China
[11] Univ Macau, Fac Sci & Technol, Macau 999078, Peoples R China
基金
中国国家自然科学基金;
关键词
Saliency detection; Feature extraction; Convolutional neural networks; Task analysis; Detectors; Object detection; Training; RGB-D salient object detection; self-supervised learning; semi-supervised learning; and cross-model and cross-level feature aggregation; SEGMENTATION; FUSION;
D O I
10.1109/TMM.2021.3129730
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
RGB-D salient object detection aims to detect visually distinctive objects or regions from a pair of the RGB image and the depth image. State-of-the-art RGB-D saliency detectors are mainly based on convolutional neural networks but almost suffer from an intrinsic limitation relying on the labeled data, thus degrading detection accuracy in complex cases. In this work, we present a self-supervised self-ensembling network (S-3 Net) for semi-supervised RGB-D salient object detection by leveraging the unlabeled data and exploring a self-supervised learning mechanism. To be specific, we first build a self-guided convolutional neural network (SG-CNN) as a baseline model by developing a series of three-layer cross-model feature fusion (TCF) modules to leverage complementary information among depth and RGB modalities and formulating an auxiliary task that predicts a self-supervised image rotation angle. After that, to further explore the knowledge from unlabeled data, we assign SG-CNN to a student network and a teacher network, and encourage the saliency predictions and self-supervised rotation predictions from these two networks to be consistent on the unlabeled data. Experimental results on seven widely-used benchmark datasets demonstrate that our network quantitatively and qualitatively outperforms the state-of-the-art methods.
引用
收藏
页码:676 / 689
页数:14
相关论文
共 50 条
  • [21] Self-supervised Bernoulli Autoencoders for Semi-supervised Hashing
    Nanculef, Ricardo
    Mena, Francisco
    Macaluso, Antonio
    Lodi, Stefano
    Sartori, Claudio
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2021, 2021, 12702 : 258 - 268
  • [22] S6: SEMI-SUPERVISED SELF-SUPERVISED SEMANTIC SEGMENTATION
    Soliman, Moamen
    Lehman, Charles
    AlRegib, Ghassan
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1861 - 1865
  • [23] S4L: Self-Supervised Semi-Supervised Learning
    Zhai, Xiaohua
    Oliver, Avital
    Kolesnikov, Alexander
    Beyer, Lucas
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1476 - 1485
  • [24] Joint Semantic Mining for Weakly Supervised RGB-D Salient Object Detection
    Li, Jingjing
    Ji, Wei
    Bi, Qi
    Yan, Cheng
    Zhang, Miao
    Piao, Yongri
    Lu, Huchuan
    Cheng, Li
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [25] A Self-ensembling Framework for Semi-supervised Knee Cartilage Defects Assessment with Dual-Consistency
    Huo, Jiayu
    Si, Liping
    Ouyang, Xi
    Xuan, Kai
    Yao, Weiwu
    Xue, Zhong
    Wang, Qian
    Shen, Dinggang
    Zhang, Lichi
    PREDICTIVE INTELLIGENCE IN MEDICINE, PRIME 2020, 2020, 12329 : 200 - 209
  • [26] Self-Supervised Implicit 3D Reconstruction via RGB-D Scans
    Yang, Hongji
    Liu, Jiao
    Lu, Shaoping
    Ren, Bo
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1115 - 1120
  • [27] Self-Supervised Assisted Semi-Supervised Residual Network for Hyperspectral Image Classification
    Song, Liangliang
    Feng, Zhixi
    Yang, Shuyuan
    Zhang, Xinyu
    Jiao, Licheng
    REMOTE SENSING, 2022, 14 (13)
  • [28] RGB-D mutual guidance for semi-supervised defocus blur detection
    Li, Huaguang
    Qian, Wenhua
    Nie, Rencan
    Cao, Jinde
    Liu, Peng
    Xu, Dan
    KNOWLEDGE-BASED SYSTEMS, 2022, 255
  • [29] A semi-supervised recurrent neural network for video salient object detection
    Aditya Kompella
    Raghavendra V. Kulkarni
    Neural Computing and Applications, 2021, 33 : 2065 - 2083
  • [30] A semi-supervised recurrent neural network for video salient object detection
    Kompella, Aditya
    Kulkarni, Raghavendra, V
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (06): : 2065 - 2083