Monocular Depth Estimation via Self-Supervised Self-Distillation

被引:1
|
作者
Hu, Haifeng [1 ]
Feng, Yuyang [1 ]
Li, Dapeng [1 ]
Zhang, Suofei [2 ]
Zhao, Haitao [2 ,3 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Telecommun & Informat Engn, Nanjing 210003, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Coll Internet Things, Nanjing 210003, Peoples R China
[3] Nanjing Univ Posts & Telecommun, Engn Res Ctr Hlth Serv Syst Based Ubiquitous Wirel, Minist Educ, Nanjing 210003, Peoples R China
基金
中国国家自然科学基金;
关键词
monocular depth estimation; self-distillation; self-supervised learning; normal estimate;
D O I
10.3390/s24134090
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Self-supervised monocular depth estimation can exhibit excellent performance in static environments due to the multi-view consistency assumption during the training process. However, it is hard to maintain depth consistency in dynamic scenes when considering the occlusion problem caused by moving objects. For this reason, we propose a method of self-supervised self-distillation for monocular depth estimation (SS-MDE) in dynamic scenes, where a deep network with a multi-scale decoder and a lightweight pose network are designed to predict depth in a self-supervised manner via the disparity, motion information, and the association between two adjacent frames in the image sequence. Meanwhile, in order to improve the depth estimation accuracy of static areas, the pseudo-depth images generated by the LeReS network are used to provide the pseudo-supervision information, enhancing the effect of depth refinement in static areas. Furthermore, a forgetting factor is leveraged to alleviate the dependency on the pseudo-supervision. In addition, a teacher model is introduced to generate depth prior information, and a multi-view mask filter module is designed to implement feature extraction and noise filtering. This can enable the student model to better learn the deep structure of dynamic scenes, enhancing the generalization and robustness of the entire model in a self-distillation manner. Finally, on four public data datasets, the performance of the proposed SS-MDE method outperformed several state-of-the-art monocular depth estimation techniques, achieving an accuracy (delta 1) of 89% while minimizing the error (AbsRel) by 0.102 in NYU-Depth V2 and achieving an accuracy (delta 1) of 87% while minimizing the error (AbsRel) by 0.111 in KITTI.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Self-supervised monocular depth estimation with self-distillation and dense skip connection
    Xiang, Xuezhi
    Li, Wei
    Wang, Yao
    El Saddik, Abdulmotaleb
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 246
  • [2] Edge Devices Friendly Self-Supervised Monocular Depth Estimation via Knowledge Distillation
    Gao, Wei
    Rao, Di
    Yang, Yang
    Chen, Jie
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (12) : 8470 - 8477
  • [3] Multi-resolution distillation for self-supervised monocular depth estimation
    Lee, Sebin
    Im, Woobin
    Yoon, Sung-Eui
    [J]. PATTERN RECOGNITION LETTERS, 2023, 176 : 215 - 222
  • [4] Multi-resolution distillation for self-supervised monocular depth estimation
    Lee, Sebin
    Im, Woobin
    Yoon, Sung-Eui
    [J]. PATTERN RECOGNITION LETTERS, 2023, 176
  • [5] Digging Into Self-Supervised Monocular Depth Estimation
    Godard, Clement
    Mac Aodha, Oisin
    Firman, Michael
    Brostow, Gabriel
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3827 - 3837
  • [6] On the uncertainty of self-supervised monocular depth estimation
    Poggi, Matteo
    Aleotti, Filippo
    Tosi, Fabio
    Mattoccia, Stefano
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3224 - 3234
  • [7] Revisiting Self-supervised Monocular Depth Estimation
    Kim, Ue-Hwan
    Lee, Gyeong-Min
    Kim, Jong-Hwan
    [J]. ROBOT INTELLIGENCE TECHNOLOGY AND APPLICATIONS 6, 2022, 429 : 336 - 350
  • [8] Self-supervised monocular depth estimation in fog
    Tao, Bo
    Hu, Jiaxin
    Jiang, Du
    Li, Gongfa
    Chen, Baojia
    Qian, Xinbo
    [J]. OPTICAL ENGINEERING, 2023, 62 (03)
  • [9] Self-Supervised Monocular Depth Estimation With Self-Reference Distillation and Disparity Offset Refinement
    Liu, Zhong
    Li, Ran
    Shao, Shuwei
    Wu, Xingming
    Chen, Weihai
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7565 - 7577
  • [10] Self-distillation framework for indoor and outdoor monocular depth estimation
    Meng Pan
    Huanrong Zhang
    Jiahao Wu
    Zhi Jin
    [J]. Multimedia Tools and Applications, 2022, 81 : 35899 - 35913