OnionNet: Single-View Depth Prediction and Camera Pose Estimation for Unlabeled Video

被引:4
|
作者
Gu, Tianhao [1 ,2 ]
Wang, Zhe [1 ,2 ]
Li, Dongdong [2 ]
Yang, Hai [2 ]
Du, Wenli [1 ]
Zhou, Yangming [2 ]
机构
[1] East China Univ Sci & Technol, Key Lab Adv Control & Optimizat Chem Proc, Minist Educ, Shanghai 200237, Peoples R China
[2] East China Univ Sci & Technol, Dept Comp Sci & Engn, Shanghai 200237, Peoples R China
基金
美国国家科学基金会;
关键词
Cameras; Training; Pose estimation; Geometry; Robustness; Task analysis; Decoding; Camera pose estimation; multitask learning; single-view depth prediction; unsupervised learning; LOCALIZATION; SLAM;
D O I
10.1109/TCDS.2020.3042521
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In real scenes, humans can easily infer their positions and distances from other objects with their own eyes. To make the robots have the same visual ability, this article presents an unsupervised OnionNet framework, including LeafNet and ParachuteNet, for single-view depth prediction and camera pose estimation. In OnionNet, for speeding up OnionNet's convergence and concretizing objects against the gradient locality and moving objects in videos, LeafNet adopts two decoders and enhanced upconvolution modules. Meanwhile, for improving the robustness of fast camera movement and rotation, ParachuteNet uses and integrates three pose networks to estimate multiview camera pose parameters by combining with the modified image preprocess. Different from existing methods, single-view depth prediction and camera pose estimation are trained view by view, where the variations between views is gradual reduction of view range and outer pixels disappear in next view, similar to onion peeling. Moreover, the LeafNet is optimized with pose parameter from each pose network in turn. Experimental results on the KITTI data set show the outstanding effectiveness of our method: single-view depth performs better than most supervised and unsupervised methods which contain two same subtasks, and pose estimation gets the state-of-the-art performance compared with existing methods under the comparable input settings.
引用
收藏
页码:995 / 1009
页数:15
相关论文
共 50 条
  • [41] Comparison of a single-view and a double-view aerosol optical depth retrieval algorithm
    Henderson, BG
    Chylek, P
    OPTICAL SPECTROSCOPIC TECHNIQUES AND INSTRUMENTATION FOR ATMOSPHERIC AND SPACE RESEARCH V, 2003, 5157 : 116 - 123
  • [42] Single-View Fluoroscopic X-Ray Pose Estimation: A Comparison of Alternative Loss Functions and Volumetric Scene Representations
    Zhou, Chaochao
    Faruqui, Syed Hasib Akhter
    An, Dayeong
    Patel, Abhinav
    Abdalla, Ramez N.
    Hurley, Michael C.
    Shaibani, Ali
    Potts, Matthew B.
    Jahromi, Babak S.
    Ansari, Sameer A.
    Cantrell, Donald R.
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024,
  • [43] Bayesian Deep Neural Networks for Supervised Learning of Single-View Depth
    Rodriguez-Puigvert, Javier
    Martinez-Cantin, Ruben
    Civera, Javier
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 2565 - 2572
  • [44] AN ADAPTIVE PYRAMID SINGLE-VIEW DEPTH LOOKUP TABLE CODING METHOD
    Cai, Yangang
    Wang, Ronggang
    Gu, Song
    Zhang, Jian
    Gao, Wen
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1940 - 1944
  • [45] DynOcc: Learning Single-View Depth from Dynamic Occlusion Cues
    Wang, Yifan
    Luo, Linjie
    Shen, Xiaohui
    Mei, Xing
    2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 514 - 523
  • [46] Single-View Food Portion Estimation Based on Geometric Models
    Fang, Shaobo
    Liu, Chang
    Zhu, Fengqing
    Delp, Edward J.
    Boushey, Carol J.
    2015 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2015, : 385 - 390
  • [47] Intrinsic Image Diffusion for Indoor Single-view Material Estimation
    Kocsis, Peter
    Sitzmann, Vincent
    Niessner, Matthias
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 5198 - 5208
  • [48] Absolute Human Pose Estimation with Depth Prediction Network
    Veges, Marton
    Lorincz, Andras
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [49] Iterative Camera Motion and Depth Estimation in a Video Sequence
    Dibos, Francoise
    Jonchery, Claire
    Koepfler, Georges
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2009, 5702 : 1028 - +
  • [50] Single-camera pose estimation using mirage
    Singhirunnusorn, Khomsun
    Fahimi, Farbod
    Aygun, Ramazan
    IET COMPUTER VISION, 2018, 12 (05) : 720 - 727