Online supervised attention-based recurrent depth estimation from monocular video

被引:0
|
作者
Maslov D. [1 ]
Makarov I. [1 ,2 ]
机构
[1] School of Data Analysis and Artificial Intelligence, HSE University, Moscow
[2] Samsung-PDMI Joint AI Center, St. Petersburg Department of Steklov Institute of Mathematics, St. Petersburg
来源
Maslov, Dmitrii (dvmaslov@edu.hse.ru) | 1600年 / PeerJ Inc.卷 / 06期
关键词
Augmented Reality; Autonomous Vehicles; Computer Science Methods; Computer Vision; Deep Convolutional Neural Networks; Depth Reconstruction; Recurrent Neural Networks;
D O I
10.7717/PEERJ-CS.317
中图分类号
学科分类号
摘要
Autonomous driving highly depends on depth information for safe driving. Recently, major improvements have been taken towards improving both supervised and self-supervised methods for depth reconstruction. However, most of the current approaches focus on single frame depth estimation, where quality limit is hard to beat due to limitations of supervised learning of deep neural networks in general. One of the way to improve quality of existing methods is to utilize temporal information from frame sequences. In this paper, we study intelligent ways of integrating recurrent block in common supervised depth estimation pipeline. We propose a novel method, which takes advantage of the convolutional gated recurrent unit (convGRU) and convolutional long short-term memory (convLSTM). We compare use of convGRU and convLSTM blocks and determine the best model for real-time depth estimation task. We carefully study training strategy and provide new deep neural networks architectures for the task of depth estimation from monocular video using information from past frames based on attention mechanism. We demonstrate the efficiency of exploiting temporal information by comparing our best recurrent method with existing image-based and video-based solutions for monocular depth reconstruction. © 2020. Maslov and Makarov. All Rights Reserved.
引用
收藏
页码:1 / 22
页数:21
相关论文
共 50 条
  • [41] Recurrent Neural Network for (Un-)supervised Learning of Monocular Video Visual Odometry and Depth
    Wang, Rui
    Pizer, Stephen M.
    Frahm, Jan-Michael
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5647 - 5656
  • [42] Semi-Supervised Monocular Depth Estimation Based on Semantic Supervision
    Yue, Min
    Fu, Guangyuan
    Wu, Ming
    Wang, Hongqiao
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2020, 100 (02) : 455 - 463
  • [43] Semi-Supervised Monocular Depth Estimation Based on Semantic Supervision
    Min Yue
    Guangyuan Fu
    Ming Wu
    Hongqiao Wang
    Journal of Intelligent & Robotic Systems, 2020, 100 : 455 - 463
  • [44] Realtime depth estimation and obstacle detection from monocular video
    Wedel, Andreas
    Franke, Uwe
    Klappstein, Jens
    Brox, Thomas
    Cremers, Daniel
    PATTERN RECOGNITION, PROCEEDINGS, 2006, 4174 : 475 - 484
  • [45] ADU-Depth: Attention-based Distillation with Uncertainty Modeling for Depth Estimation
    Wu, Zizhang
    Li, Zhuozheng
    Fan, Zhi-Gang
    Wu, Yunzhe
    Wang, Xiaoquan
    Tang, Rui
    Pu, Jian
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [46] Self-Supervised Human Depth Estimation from Monocular Videos
    Tan, Feitong
    Zhu, Hao
    Cui, Zhaopeng
    Zhu, Siyu
    Pollefeys, Marc
    Tan, Ping
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 647 - 656
  • [47] Visual Attention-Based Self-Supervised Absolute Depth Estimation Using Geometric Priors in Autonomous Driving
    Xiang, Jie
    Wang, Yun
    An, Lifeng
    Liu, Haiyang
    Wang, Zijun
    Liu, Jian
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 11998 - 12005
  • [48] EdgeConv with Attention Module for Monocular Depth Estimation
    Lee, Minhyeok
    Hwang, Sangwon
    Park, Chaewon
    Lee, Sangyoun
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2364 - 2373
  • [49] MonoDVPS: A Self-Supervised Monocular Depth Estimation Approach to Depth-aware Video Panoptic Segmentation
    Petrovai, Andra
    Nedevschi, Sergiu
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3076 - 3085
  • [50] Bidirectional Attention Network for Monocular Depth Estimation
    Aich, Shubhra
    Vianney, Jean Marie Uwabeza
    Islam, Md Amirul
    Kaur, Mannat
    Liu, Bingbing
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 11746 - 11752