Full Surround Monodepth From Multiple Cameras

被引:13
|
作者
Guizilini, Vitor [1 ]
Vasiljevic, Igor [2 ]
Ambrus, Rares [1 ]
Shakhnarovich, Greg [2 ]
Gaidon, Adrien [1 ]
机构
[1] Toyota Res Inst TRI, Los Altos, CA 95051 USA
[2] Toyota Technol Inst Chicago, Chicago, IL 60194 USA
关键词
Computer vision; machine learning; autonomous automobiles;
D O I
10.1109/LRA.2022.3150884
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Self-supervised monocular depth and ego-motion estimation is a promising approach to replace or supplement expensive depth sensors such as LiDAR for robotics applications like autonomous driving. However, most research in this area focuses on a single monocular camera or stereo pairs that cover only a fraction of the scene around the vehicle. In this work, we extend monocular self-supervised depth and ego-motion estimation to large-baseline multi-camera rigs. Using generalized spatio-temporal contexts, pose consistency constraints, and carefully designed photometric loss masking, we learn a single network generating dense, consistent, and scale-aware point clouds that cover the same full surround 360 degrees field of view as a typical LiDAR scanner. We also propose a new scale-consistent evaluation metric more suitable to multicamera settings. Experiments on two challenging benchmarks illustrate the benefits of our approach over strong baselines.
引用
收藏
页码:5397 / 5404
页数:8
相关论文
共 50 条
  • [41] Multiple Objects Monitoring Based on 3D Information from Multiple Cameras
    Chitmaitredejsakul, Thun
    Marukatat, Sanparith
    Thiemjarus, Surapa
    Srijuntongsiri, Gun
    2014 INTERNATIONAL ELECTRICAL ENGINEERING CONGRESS (IEECON), 2014,
  • [42] Human tracking in multiple cameras
    Khan, S
    Javed, O
    Rasheed, Z
    Shah, M
    EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL I, PROCEEDINGS, 2001, : 331 - 336
  • [43] Object detection with multiple cameras
    Junior, BM
    Anido, RD
    IEEE WORKSHOP ON MOTION AND VIDEO COMPUTING (MOTION 2002), PROCEEDINGS, 2002, : 187 - 192
  • [44] Multiple Cameras Now Connected
    2001, Chief Engineers Association of Chicagoland (66):
  • [45] Vibration measurements with multiple cameras
    Del Sal, R.
    Dal Bo, L.
    Turco, E.
    Fusiello, A.
    Zanarini, A.
    Rinaldo, R.
    Gardonio, P.
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NOISE AND VIBRATION ENGINEERING (ISMA2020) / INTERNATIONAL CONFERENCE ON UNCERTAINTY IN STRUCTURAL DYNAMICS (USD2020), 2020, : 2275 - 2292
  • [46] Attention-Based Deep Driving Model for Autonomous Vehicles with Surround-View Cameras
    Zhao, Yang
    Li, Jie
    Huang, Rui
    Li, Boqi
    Luo, Ao
    Li, Yaochen
    Cheng, Hong
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 286 - 292
  • [47] An Online Learning System for Wireless Charging Alignment Using Surround-View Fisheye Cameras
    Dahal, Ashok
    Kumar, Varun Ravi
    Yogamani, Senthil
    Eising, Ciaran
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 20553 - 20562
  • [48] End-to-End Learning of Driving Models with Surround-View Cameras and Route Planners
    Hecker, Simon
    Dai, Dengxin
    Van Gool, Luc
    COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 449 - 468
  • [49] Immersive full-surround multi-user system design
    Kuchera-Morin, JoAnn
    Wright, Matthew
    Wakefield, Graham
    Roberts, Charles
    Adderton, Dennis
    Sajadi, Behzad
    Hoellerer, Tobias
    Majumder, Aditi
    COMPUTERS & GRAPHICS-UK, 2014, 40 : 10 - 21
  • [50] Pose estimation from multiple cameras based on Sylvester's equation
    Chen, Chong
    Schonfeld, Dan
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2010, 114 (06) : 652 - 666