Self-Supervised Human Depth Estimation from Monocular Videos

被引:19
|
作者
Tan, Feitong [1 ]
Zhu, Hao [2 ]
Cui, Zhaopeng [3 ]
Zhu, Siyu [4 ]
Pollefeys, Marc [3 ]
Tan, Ping [1 ]
机构
[1] Simon Fraser Univ, Burnaby, BC, Canada
[2] Nanjing Univ, Nanjing, Jiangsu, Peoples R China
[3] Swiss Fed Inst Technol, Zurich, Switzerland
[4] Alibaba AI Labs, Hangzhou, Zhejiang, Peoples R China
关键词
SHAPE;
D O I
10.1109/CVPR42600.2020.00073
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous methods on estimating detailed human depth often require supervised training with 'ground truth' depth data. This paper presents a self-supervised method that can be trained on YouTube videos without known depth, which makes training data collection simple and improves the generalization of the learned network. The self-supervised learning is achieved by minimizing a photo-consistency loss, which is evaluated between a video frame and its neighboring frames warped according to the estimated depth and the 3D non-rigid motion of the human body. To solve this non-rigid motion, we first estimate a rough SMPL model at each video frame and compute the non-rigid body motion accordingly, which enables self-supervised learning on estimating the shape details. Experiments demonstrate that our method enjoys better generalization and performs much better on data in the wild.
引用
收藏
页码:647 / 656
页数:10
相关论文
共 50 条
  • [1] Adaptive Self-supervised Depth Estimation in Monocular Videos
    Mendoza, Julio
    Pedrini, Helio
    IMAGE AND GRAPHICS (ICIG 2021), PT III, 2021, 12890 : 687 - 699
  • [2] Self-supervised monocular depth estimation from oblique UAV videos
    Madhuanand, Logambal
    Nex, Francesco
    Yang, Michael Ying
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 176 : 1 - 14
  • [3] Learning Effective Geometry Representation from Videos for Self-Supervised Monocular Depth Estimation
    Zhao, Hailiang
    Kong, Yongyi
    Zhang, Chonghao
    Zhang, Haoji
    Zhao, Jiansen
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2024, 13 (06)
  • [4] Digging Into Self-Supervised Monocular Depth Estimation
    Godard, Clement
    Mac Aodha, Oisin
    Firman, Michael
    Brostow, Gabriel
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3827 - 3837
  • [5] Self-supervised monocular depth estimation in fog
    Tao, Bo
    Hu, Jiaxin
    Jiang, Du
    Li, Gongfa
    Chen, Baojia
    Qian, Xinbo
    OPTICAL ENGINEERING, 2023, 62 (03)
  • [6] On the uncertainty of self-supervised monocular depth estimation
    Poggi, Matteo
    Aleotti, Filippo
    Tosi, Fabio
    Mattoccia, Stefano
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3224 - 3234
  • [7] Revisiting Self-supervised Monocular Depth Estimation
    Kim, Ue-Hwan
    Lee, Gyeong-Min
    Kim, Jong-Hwan
    ROBOT INTELLIGENCE TECHNOLOGY AND APPLICATIONS 6, 2022, 429 : 336 - 350
  • [8] Transferring knowledge from monocular completion for self-supervised monocular depth estimation
    Sun, Lin
    Li, Yi
    Liu, Bingzheng
    Xu, Liying
    Zhang, Zhe
    Zhu, Jie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 42485 - 42495
  • [9] Transferring knowledge from monocular completion for self-supervised monocular depth estimation
    Lin Sun
    Yi Li
    Bingzheng Liu
    Liying Xu
    Zhe Zhang
    Jie Zhu
    Multimedia Tools and Applications, 2022, 81 : 42485 - 42495
  • [10] Semantically guided self-supervised monocular depth estimation
    Lu, Xiao
    Sun, Haoran
    Wang, Xiuling
    Zhang, Zhiguo
    Wang, Haixia
    IET IMAGE PROCESSING, 2022, 16 (05) : 1293 - 1304