S 3 Net: Self-Supervised Self-Ensembling Network for Semi-Supervised RGB-D Salient Object Detection

被引:4
|
作者
Zhu, Lei [1 ,2 ]
Wang, Xiaoqiang [3 ]
Li, Ping [4 ]
Yang, Xin [5 ]
Zhang, Qing [6 ]
Wang, Weiming [7 ]
Schonlieb, Carola-Bibiane [8 ]
Chen, C. L. Philip [9 ,10 ,11 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[2] Univ Cambridge, Dept Appl Math & Theoret Phys DAMTP, Cambridge CB3 0WA, England
[3] Zhejiang Univ, Coll Comp Sci & Technol, Shatin, Hangzhou 310058, Peoples R China
[4] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong 00852, Peoples R China
[5] Dalian Univ Technol, Dept Comp Sci, Dalian 116024, Peoples R China
[6] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Peoples R China
[7] Hong Kong Metropolitan Univ, Sch Sci & Technol, Ho Man Tin, Hong Kong 00852, Peoples R China
[8] Univ Cambridge, Dept Appl Math & Theoret Phys DAMTP, Cambridge CB3 0WA, England
[9] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
[10] Dalian Maritime Univ, Nav Coll, Dalian 116026, Peoples R China
[11] Univ Macau, Fac Sci & Technol, Macau 999078, Peoples R China
基金
中国国家自然科学基金;
关键词
Saliency detection; Feature extraction; Convolutional neural networks; Task analysis; Detectors; Object detection; Training; RGB-D salient object detection; self-supervised learning; semi-supervised learning; and cross-model and cross-level feature aggregation; SEGMENTATION; FUSION;
D O I
10.1109/TMM.2021.3129730
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
RGB-D salient object detection aims to detect visually distinctive objects or regions from a pair of the RGB image and the depth image. State-of-the-art RGB-D saliency detectors are mainly based on convolutional neural networks but almost suffer from an intrinsic limitation relying on the labeled data, thus degrading detection accuracy in complex cases. In this work, we present a self-supervised self-ensembling network (S-3 Net) for semi-supervised RGB-D salient object detection by leveraging the unlabeled data and exploring a self-supervised learning mechanism. To be specific, we first build a self-guided convolutional neural network (SG-CNN) as a baseline model by developing a series of three-layer cross-model feature fusion (TCF) modules to leverage complementary information among depth and RGB modalities and formulating an auxiliary task that predicts a self-supervised image rotation angle. After that, to further explore the knowledge from unlabeled data, we assign SG-CNN to a student network and a teacher network, and encourage the saliency predictions and self-supervised rotation predictions from these two networks to be consistent on the unlabeled data. Experimental results on seven widely-used benchmark datasets demonstrate that our network quantitatively and qualitatively outperforms the state-of-the-art methods.
引用
收藏
页码:676 / 689
页数:14
相关论文
共 50 条
  • [31] Self-Supervised Learning for Semi-Supervised Temporal Language Grounding
    Luo, Fan
    Chen, Shaoxiang
    Chen, Jingjing
    Wu, Zuxuan
    Jiang, Yu-Gang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7747 - 7757
  • [32] Self-Supervised Learning for Semi-Supervised Temporal Action Proposal
    Wang, Xiang
    Zhang, Shiwei
    Qing, Zhiwu
    Shao, Yuanjie
    Gao, Changxin
    Sang, Nong
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1905 - 1914
  • [33] Semi-supervised learning made simple with self-supervised clustering
    Fini, Enrico
    Astolfi, Pietro
    Alahari, Karteek
    Alameda-Meda, Xavier
    Mairal, Julien
    Nabi, Moin
    Ricci, Elisa
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 3187 - 3197
  • [34] Mutual Information Regularization for Weakly-Supervised RGB-D Salient Object Detection
    Li, Aixuan
    Mao, Yuxin
    Zhang, Jing
    Dai, Yuchao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 397 - 410
  • [35] Self-Ensembling Co-Training Framework for Semi-Supervised COVID-19 CT Segmentation
    Li, Caizi
    Dong, Li
    Dou, Qi
    Lin, Fan
    Zhang, Kebao
    Feng, Zuxin
    Si, Weixin
    Deng, Xuesong
    Deng, Zhe
    Heng, Pheng-Ann
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (11) : 4140 - 4151
  • [36] Deep Hypersphere Feature Regularization for Weakly Supervised RGB-D Salient Object Detection
    Liu, Zhiyu
    Hayat, Munawar
    Yang, Hong
    Peng, Duo
    Lei, Yinjie
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5423 - 5437
  • [37] SSAL-Net: Semi-supervised network integrating self-supervised adversarial learning for diagnosing subtypes of pulmonary nodules
    Fu, Yu
    Liu, Changli
    Wang, Shaoqiang
    Dong, Enqing
    Xia, Hui
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 96
  • [38] Semi-supervised self-training of object detection models
    Rosenberg, C
    Hebert, M
    Schneiderman, H
    WACV 2005: SEVENTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION, PROCEEDINGS, 2005, : 29 - 36
  • [39] SEMI-SUPERVISED AND SELF-SUPERVISED COLLABORATIVE LEARNING FOR PROSTATE 3D MR IMAGE SEGMENTATION
    Osman, Yousuf Babiker M.
    Li, Cheng
    Huang, Weijian
    Elsayed, Nazik
    Ying, Leslie
    Zheng, Hairong
    Wang, Shanshan
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [40] InverseTime: A Self-Supervised Technique for Semi-Supervised Classification of Time Series
    Goyo, Manuel Alejandro
    Nanculef, Ricardo
    Valle, Carlos
    IEEE Access, 2024, 12 : 165081 - 165093