Geometric Pretraining for Monocular Depth Estimation

被引:0
|
作者
Wang, Kaixuan [1 ]
Chen, Yao [2 ]
Guo, Hengkai [2 ]
Wen, Linfu [2 ]
Shen, Shaojie [1 ]
机构
[1] HKUST, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[2] ByteDance AI Lab, Beijing, Peoples R China
关键词
D O I
10.1109/icra40945.2020.9196847
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
ImageNet-pretrained networks have been widely used in transfer learning for monocular depth estimation. These pretrained networks are trained with classification losses for which only semantic information is exploited while spatial information is ignored. However, both semantic and spatial information is important for per-pixel depth estimation. In this paper, we design a novel self-supervised geometric pretraining task that is tailored for monocular depth estimation using uncalibrated videos. The designed task decouples the structure information from input videos by a simple yet effective conditional autoencoder-decoder structure. Using almost unlimited videos from the internet, networks are pretrained to capture a variety of structures of the scene and can be easily transferred to depth estimation tasks using calibrated images. Extensive experiments are used to demonstrate that the proposed geometric-pretrained networks perform better than ImageNet-pretrained networks in terms of accuracy, few-shot learning and generalization ability. Using existing learning methods, geometric-transferred networks achieve new state-of-the-art results by a large margin. The pretrained networks will be open source soon(1).
引用
收藏
页码:4782 / 4788
页数:7
相关论文
共 50 条
  • [1] Self-Supervised Monocular Depth Estimation With Extensive Pretraining
    Choi, Hyukdoo
    IEEE ACCESS, 2021, 9 : 157236 - 157246
  • [2] Self-Supervised Monocular Depth Estimation with Extensive Pretraining
    Choi, Hyukdoo
    IEEE Access, 2021, 9 : 157236 - 157246
  • [3] Monocular Depth Estimation with Adaptive Geometric Attention
    Naderi, Taher
    Sadovnik, Amir
    Hayward, Jason
    Qi, Hairong
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 617 - 627
  • [4] The Monocular Depth Estimation Challenge
    Spencer, Jaime
    Qian, C. Stella
    Russell, Chris
    Hadfield, Simon
    Graf, Erich
    Adams, Wendy
    Schofield, Andrew J.
    Elder, James
    Bowden, Richard
    Cong, Heng
    Mattoccia, Stefano
    Poggi, Matteo
    Suri, Zeeshan Khan
    Tang, Yang
    Tosi, Fabio
    Wang, Hao
    Zhang, Youmin
    Zhang, Yusheng
    Zhao, Chaoqiang
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW), 2023, : 623 - 632
  • [5] Perceptual Monocular Depth Estimation
    Pan, Janice
    Bovik, Alan C.
    NEURAL PROCESSING LETTERS, 2021, 53 (02) : 1205 - 1228
  • [6] Perceptual Monocular Depth Estimation
    Janice Pan
    Alan C. Bovik
    Neural Processing Letters, 2021, 53 : 1205 - 1228
  • [7] Sparse depth densification for monocular depth estimation
    Zhen Liang
    Tiyu Fang
    Yanzhu Hu
    Yingjian Wang
    Multimedia Tools and Applications, 2024, 83 : 14821 - 14838
  • [8] Depth Map Decomposition for Monocular Depth Estimation
    Jun, Jinyoung
    Lee, Jae-Han
    Lee, Chul
    Kim, Chang-Su
    COMPUTER VISION - ECCV 2022, PT II, 2022, 13662 : 18 - 34
  • [9] Sparse depth densification for monocular depth estimation
    Liang, Zhen
    Fang, Tiyu
    Hu, Yanzhu
    Wang, Yingjian
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (05) : 14821 - 14838
  • [10] Unsupervised Monocular Depth and Camera Pose Estimation with Multiple Masks and Geometric Consistency Constraints
    Zhang, Xudong
    Zhao, Baigan
    Yao, Jiannan
    Wu, Guoqing
    SENSORS, 2023, 23 (11)