Online supervised attention-based recurrent depth estimation from monocular video

被引：0

作者：

Maslov D. ^{[1
]}

Makarov I. ^{[1
,2
]}

机构：

[1] School of Data Analysis and Artificial Intelligence, HSE University, Moscow

[2] Samsung-PDMI Joint AI Center, St. Petersburg Department of Steklov Institute of Mathematics, St. Petersburg

来源：

Maslov, Dmitrii (dvmaslov@edu.hse.ru) | 1600年 / PeerJ Inc.卷 / 06期

关键词：

Augmented Reality; Autonomous Vehicles; Computer Science Methods; Computer Vision; Deep Convolutional Neural Networks; Depth Reconstruction; Recurrent Neural Networks;

D O I：

10.7717/PEERJ-CS.317

中图分类号：

学科分类号：

摘要：

Autonomous driving highly depends on depth information for safe driving. Recently, major improvements have been taken towards improving both supervised and self-supervised methods for depth reconstruction. However, most of the current approaches focus on single frame depth estimation, where quality limit is hard to beat due to limitations of supervised learning of deep neural networks in general. One of the way to improve quality of existing methods is to utilize temporal information from frame sequences. In this paper, we study intelligent ways of integrating recurrent block in common supervised depth estimation pipeline. We propose a novel method, which takes advantage of the convolutional gated recurrent unit (convGRU) and convolutional long short-term memory (convLSTM). We compare use of convGRU and convLSTM blocks and determine the best model for real-time depth estimation task. We carefully study training strategy and provide new deep neural networks architectures for the task of depth estimation from monocular video using information from past frames based on attention mechanism. We demonstrate the efficiency of exploiting temporal information by comparing our best recurrent method with existing image-based and video-based solutions for monocular depth reconstruction. © 2020. Maslov and Makarov. All Rights Reserved.

引用

页码：1 / 22

页数：21

共 50 条

[1] Online supervised attention-based recurrent depth estimation from monocular video
Maslov, Dmitrii
Makarov, Ilya
PEERJ COMPUTER SCIENCE, 2020,
[2] ATTENTION-BASED SELF-SUPERVISED LEARNING MONOCULAR DEPTH ESTIMATION WITH EDGE REFINEMENT
Jiang, Chenweinan
Liu, Haichun
Li, Lanzhen
Pan, Changchun
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3218 - 3222
[3] Attention-Based Grasp Detection With Monocular Depth Estimation
Xuan Tan, Phan
Hoang, Dinh-Cuong
Nguyen, Anh-Nhat
Nguyen, Van-Thiep
Vu, Van-Duc
Nguyen, Thu-Uyen
Hoang, Ngoc-Anh
Phan, Khanh-Toan
Tran, Duc-Thanh
Vu, Duy-Quang
Ngo, Phuc-Quan
Duong, Quang-Tri
Ho, Ngoc-Trung
Tran, Cong-Trinh
Duong, Van-Hiep
Mai, Anh-Truong
IEEE ACCESS, 2024, 12 : 65041 - 65057
[4] Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth Estimation
Yan, Jiaxing
Zhao, Hong
Bu, Penghui
Jin, YuSheng
2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 464 - 473
[5] Attention-based context aggregation network for monocular depth estimation
Chen, Yuru
Zhao, Haitao
Hu, Zhengwei
Peng, Jingchao
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (06) : 1583 - 1596
[6] Attention-based context aggregation network for monocular depth estimation
Yuru Chen
Haitao Zhao
Zhengwei Hu
Jingchao Peng
International Journal of Machine Learning and Cybernetics, 2021, 12 : 1583 - 1596
[7] Attention-Based Dense Decoding Network for Monocular Depth Estimation
Wang, Jianrong
Zhang, Ge
Yu, Mei
Xu, Tianyi
Luo, Tao
IEEE ACCESS, 2020, 8 (08): : 85802 - 85812
[8] Self-Supervised Monocular Depth Estimation Based on Channel Attention
Tao, Bo
Chen, Xinbo
Tong, Xiliang
Jiang, Du
Chen, Baojia
PHOTONICS, 2022, 9 (06)
[9] MLDA-Net: Multi-Level Dual Attention-Based Network for Self-Supervised Monocular Depth Estimation
Song, Xibin
Li, Wei
Zhou, Dingfu
Dai, Yuchao
Fang, Jin
Li, Hongdong
Zhang, Liangjun
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4691 - 4705
[10] Self-Supervised Monocular Depth Estimation With Frequency-Based Recurrent Refinement
Li, Rui
Xue, Danna
Zhu, Yu
Wu, Hao
Sun, Jinqiu
Zhang, Yanning
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 5626 - 5637

← 1 2 3 4 5 →