Full Surround Monodepth From Multiple Cameras

被引:13
|
作者
Guizilini, Vitor [1 ]
Vasiljevic, Igor [2 ]
Ambrus, Rares [1 ]
Shakhnarovich, Greg [2 ]
Gaidon, Adrien [1 ]
机构
[1] Toyota Res Inst TRI, Los Altos, CA 95051 USA
[2] Toyota Technol Inst Chicago, Chicago, IL 60194 USA
关键词
Computer vision; machine learning; autonomous automobiles;
D O I
10.1109/LRA.2022.3150884
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Self-supervised monocular depth and ego-motion estimation is a promising approach to replace or supplement expensive depth sensors such as LiDAR for robotics applications like autonomous driving. However, most research in this area focuses on a single monocular camera or stereo pairs that cover only a fraction of the scene around the vehicle. In this work, we extend monocular self-supervised depth and ego-motion estimation to large-baseline multi-camera rigs. Using generalized spatio-temporal contexts, pose consistency constraints, and carefully designed photometric loss masking, we learn a single network generating dense, consistent, and scale-aware point clouds that cover the same full surround 360 degrees field of view as a typical LiDAR scanner. We also propose a new scale-consistent evaluation metric more suitable to multicamera settings. Experiments on two challenging benchmarks illustrate the benefits of our approach over strong baselines.
引用
收藏
页码:5397 / 5404
页数:8
相关论文
共 50 条
  • [21] Data fusion from multiple cameras for automatic disassembly
    Gil, P
    Puente, ST
    Torres, F
    Pomares, J
    Candelas, FA
    INTELLIGENT ASSEMBLY AND DISASSEMBLY 2001, 2002, : 85 - 90
  • [22] Acquisition of sharp depth map from multiple cameras
    Park, JI
    Inoue, S
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 1998, 14 (1-2) : 7 - 19
  • [23] FCam for Multiple Cameras
    Troccoli, Alejandro
    Pajak, Dawid
    Pulli, Kari
    MULTIMEDIA ON MOBILE DEVICES 2012 AND MULTIMEDIA CONTENT ACCESS: ALGORITHMS AND SYSTEMS VI, 2012, 8304
  • [24] Real-Time, Full 3-D Reconstruction of Moving Foreground Objects From Multiple Consumer Depth Cameras
    Alexiadis, Dimitrios S.
    Zarpalas, Dimitrios
    Daras, Petros
    IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (02) : 339 - 358
  • [25] Cameras which throw the full sight
    不详
    BIOFUTUR, 2013, (345) : 18 - 18
  • [26] MULTIPLE GUINIER CAMERAS
    DEWOLFF, PM
    ACTA CRYSTALLOGRAPHICA, 1948, 1 (1-6): : 207 - &
  • [27] Hough Forests Revisited: An Approach to Multiple Instance Tracking from Multiple Cameras
    Poier, Georg
    Schulter, Samuel
    Sternig, Sabine
    Roth, Peter M.
    Bischof, Horst
    PATTERN RECOGNITION, GCPR 2014, 2014, 8753 : 499 - 510
  • [28] Machine Learning-based Stereo Vision Algorithm for Surround View Fisheye Cameras
    Akita, Tokihiko
    Yamauchi, Yuji
    Fujiyoshi, Hironobu
    2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 1103 - 1108
  • [29] FSNet: Redesign Self-Supervised MonoDepth for Full-Scale Depth Prediction for Autonomous Driving
    Liu, Yuxuan
    Xu, Zhenhua
    Huang, Huaiyang
    Wang, Lujia
    Liu, Ming
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 21 (03) : 1 - 11
  • [30] Estimation of human motion from multiple cameras for gesture recognition
    Tominaga, Masafumi
    Hongo, Hitoshi
    Koshimizu, Hiroyasu
    Niwa, Yoshinori
    Yamamoto, Kazuhiko
    Proceedings - International Conference on Pattern Recognition, 2002, 16 (01): : 401 - 404