Self-Supervised Learning for Monocular Depth Estimation on Minimally Invasive Surgery Scenes

被引:6
|
作者
Shao, Shuwei [1 ]
Pei, Zhongcai [1 ]
Chen, Weihai [1 ]
Zhang, Baochang [1 ,2 ]
Wu, Xingming [1 ]
Sun, Dianmin [3 ]
Doermann, David [4 ]
机构
[1] Beihang Univ, Beijing, Peoples R China
[2] Shenzhen Acad Aerosp Technol, Shenzhen, Peoples R China
[3] Shandong Univ, Shandong Canc Hosp, Jinan, Peoples R China
[4] Univ Buffalo, Buffalo, NY USA
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICRA48506.2021.9561508
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Self-supervised learning algorithms that compute depth map from monocular videos have achieved remarkable performance on urban scenes and have been applied extensively. These techniques still face significant challenges, however, when applied directly to endoscopic videos because of the brightness variations from frame to frame and inadequate representation learning during the training phase. Inspired by the optical flow for motion alignment between adjacent frames, we design a AFNet with structural stability loss and residual-based smoothness loss to learn the appearance flow across adjacent frames, which handles the brightness inconsistency issue efficaciously. In addition, we propose a novel self-attention mechanism named feature scaling module to alleviate the inadequate representation learning problem. In a comparison study to the current state-of-the-art self-supervised methods explored for urban videos on the SCARED dataset, the developed model surpasses existing methods by a large margin.
引用
收藏
页码:7159 / 7165
页数:7
相关论文
共 50 条
  • [1] Self-supervised monocular depth estimation in dynamic scenes based on deep learning
    Cheng, Binbin
    Yu, Ying
    Zhang, Lei
    Wang, Ziquan
    Jiang, Zhipeng
    [J]. National Remote Sensing Bulletin, 2024, 28 (09) : 2170 - 2186
  • [2] Self-supervised monocular image depth learning and confidence estimation
    Chen, Long
    Tang, Wen
    Wan, Tao Ruan
    John, Nigel W.
    [J]. NEUROCOMPUTING, 2020, 381 : 272 - 281
  • [3] Self-supervised Learning for Dense Depth Estimation in Monocular Endoscopy
    Liu, Xingtong
    Sinha, Ayushi
    Unberath, Mathias
    Ishii, Masaru
    Hager, Gregory D.
    Taylor, Russell H.
    Reiter, Austin
    [J]. OR 2.0 CONTEXT-AWARE OPERATING THEATERS, COMPUTER ASSISTED ROBOTIC ENDOSCOPY, CLINICAL IMAGE-BASED PROCEDURES, AND SKIN IMAGE ANALYSIS, OR 2.0 2018, 2018, 11041 : 128 - 138
  • [4] Self-Supervised Multi-Frame Monocular Depth Estimation for Dynamic Scenes
    Wu, Guanghui
    Liu, Hao
    Wang, Longguang
    Li, Kunhong
    Guo, Yulan
    Chen, Zengping
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 4989 - 5001
  • [5] Self-supervised monocular depth estimation in dynamic scenes with moving instance loss
    Yue, Min
    Fu, Guangyuan
    Wu, Ming
    Zhang, Xin
    Gu, Hongyang
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 112
  • [6] Digging Into Self-Supervised Monocular Depth Estimation
    Godard, Clement
    Mac Aodha, Oisin
    Firman, Michael
    Brostow, Gabriel
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3827 - 3837
  • [7] On the uncertainty of self-supervised monocular depth estimation
    Poggi, Matteo
    Aleotti, Filippo
    Tosi, Fabio
    Mattoccia, Stefano
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3224 - 3234
  • [8] Revisiting Self-supervised Monocular Depth Estimation
    Kim, Ue-Hwan
    Lee, Gyeong-Min
    Kim, Jong-Hwan
    [J]. ROBOT INTELLIGENCE TECHNOLOGY AND APPLICATIONS 6, 2022, 429 : 336 - 350
  • [9] Self-supervised monocular depth estimation in fog
    Tao, Bo
    Hu, Jiaxin
    Jiang, Du
    Li, Gongfa
    Chen, Baojia
    Qian, Xinbo
    [J]. OPTICAL ENGINEERING, 2023, 62 (03)
  • [10] Self-supervised monocular depth estimation on water scenes via specular reflection prior
    Lu, Zhengyang
    Chen, Ying
    [J]. DIGITAL SIGNAL PROCESSING, 2024, 149