Online supervised attention-based recurrent depth estimation from monocular video

被引:0
|
作者
Maslov D. [1 ]
Makarov I. [1 ,2 ]
机构
[1] School of Data Analysis and Artificial Intelligence, HSE University, Moscow
[2] Samsung-PDMI Joint AI Center, St. Petersburg Department of Steklov Institute of Mathematics, St. Petersburg
来源
Maslov, Dmitrii (dvmaslov@edu.hse.ru) | 1600年 / PeerJ Inc.卷 / 06期
关键词
Augmented Reality; Autonomous Vehicles; Computer Science Methods; Computer Vision; Deep Convolutional Neural Networks; Depth Reconstruction; Recurrent Neural Networks;
D O I
10.7717/PEERJ-CS.317
中图分类号
学科分类号
摘要
Autonomous driving highly depends on depth information for safe driving. Recently, major improvements have been taken towards improving both supervised and self-supervised methods for depth reconstruction. However, most of the current approaches focus on single frame depth estimation, where quality limit is hard to beat due to limitations of supervised learning of deep neural networks in general. One of the way to improve quality of existing methods is to utilize temporal information from frame sequences. In this paper, we study intelligent ways of integrating recurrent block in common supervised depth estimation pipeline. We propose a novel method, which takes advantage of the convolutional gated recurrent unit (convGRU) and convolutional long short-term memory (convLSTM). We compare use of convGRU and convLSTM blocks and determine the best model for real-time depth estimation task. We carefully study training strategy and provide new deep neural networks architectures for the task of depth estimation from monocular video using information from past frames based on attention mechanism. We demonstrate the efficiency of exploiting temporal information by comparing our best recurrent method with existing image-based and video-based solutions for monocular depth reconstruction. © 2020. Maslov and Makarov. All Rights Reserved.
引用
收藏
页码:1 / 22
页数:21
相关论文
共 50 条
  • [1] Online supervised attention-based recurrent depth estimation from monocular video
    Maslov, Dmitrii
    Makarov, Ilya
    PEERJ COMPUTER SCIENCE, 2020,
  • [2] ATTENTION-BASED SELF-SUPERVISED LEARNING MONOCULAR DEPTH ESTIMATION WITH EDGE REFINEMENT
    Jiang, Chenweinan
    Liu, Haichun
    Li, Lanzhen
    Pan, Changchun
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3218 - 3222
  • [3] Attention-Based Grasp Detection With Monocular Depth Estimation
    Xuan Tan, Phan
    Hoang, Dinh-Cuong
    Nguyen, Anh-Nhat
    Nguyen, Van-Thiep
    Vu, Van-Duc
    Nguyen, Thu-Uyen
    Hoang, Ngoc-Anh
    Phan, Khanh-Toan
    Tran, Duc-Thanh
    Vu, Duy-Quang
    Ngo, Phuc-Quan
    Duong, Quang-Tri
    Ho, Ngoc-Trung
    Tran, Cong-Trinh
    Duong, Van-Hiep
    Mai, Anh-Truong
    IEEE ACCESS, 2024, 12 : 65041 - 65057
  • [4] Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth Estimation
    Yan, Jiaxing
    Zhao, Hong
    Bu, Penghui
    Jin, YuSheng
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 464 - 473
  • [5] Attention-based context aggregation network for monocular depth estimation
    Chen, Yuru
    Zhao, Haitao
    Hu, Zhengwei
    Peng, Jingchao
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (06) : 1583 - 1596
  • [6] Attention-based context aggregation network for monocular depth estimation
    Yuru Chen
    Haitao Zhao
    Zhengwei Hu
    Jingchao Peng
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 1583 - 1596
  • [7] Attention-Based Dense Decoding Network for Monocular Depth Estimation
    Wang, Jianrong
    Zhang, Ge
    Yu, Mei
    Xu, Tianyi
    Luo, Tao
    IEEE ACCESS, 2020, 8 (08): : 85802 - 85812
  • [8] Self-Supervised Monocular Depth Estimation Based on Channel Attention
    Tao, Bo
    Chen, Xinbo
    Tong, Xiliang
    Jiang, Du
    Chen, Baojia
    PHOTONICS, 2022, 9 (06)
  • [9] MLDA-Net: Multi-Level Dual Attention-Based Network for Self-Supervised Monocular Depth Estimation
    Song, Xibin
    Li, Wei
    Zhou, Dingfu
    Dai, Yuchao
    Fang, Jin
    Li, Hongdong
    Zhang, Liangjun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4691 - 4705
  • [10] Self-Supervised Monocular Depth Estimation With Frequency-Based Recurrent Refinement
    Li, Rui
    Xue, Danna
    Zhu, Yu
    Wu, Hao
    Sun, Jinqiu
    Zhang, Yanning
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 5626 - 5637