Efficient unsupervised monocular depth estimation using attention guided generative adversarial network

被引：4

作者：

Bhattacharyya, Sumanta ^{[1
]}

Shen, Ju ^{[3
]}

Welch, Stephen ^{[1
]}

Chen, Chen ^{[2
]}

机构：

[1] Univ N Carolina, Charlotte, NC 28223 USA

[2] Univ N Carolina, Dept Elect & Comp Engn, Charlotte, NC 28223 USA

[3] Univ Dayton, Comp Sci, 300 Coll Pk, Dayton, OH 45469 USA

来源：

JOURNAL OF REAL-TIME IMAGE PROCESSING | 2021年 / 18卷 / 04期

基金：

美国国家科学基金会;

关键词：

Attention; Efficient GAN; Unsupervised depth estimation; Convolution factorization;

D O I：

10.1007/s11554-021-01092-0

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep-learning-based approaches to depth estimation are rapidly advancing, offering better performance over traditional computer vision approaches across many domains. However, for many critical applications, cutting-edge deep-learning based approaches require too much computational overhead to be operationally feasible. This is especially true for depth-estimation methods that leverage adversarial learning, such as Generative Adversarial Networks (GANs). In this paper, we propose a computationally efficient GAN for unsupervised monocular depth estimation using factorized convolutions and an attention mechanism. Specifically, we leverage the Extremely Efficient Spatial Pyramid of Depth-wise Dilated Separable Convolutions (EESP) module of ESPNetv2 inside the network, leading to a total reduction of 22.8%, 35.37%, and 31.5% in the number of model parameters, FLOPs, and inference time respectively, as compared to the previous unsupervised GAN approach. Finally, we propose a context-aware attention architecture to generate detail-oriented depth images. We demonstrate superior performance of our proposed model on two benchmark datasets KITTI and Cityscapes. We have also provided more qualitative examples (Fig. 8) at the end of this paper.

引用

页码：1357 / 1368

页数：12

共 50 条

[1] Efficient unsupervised monocular depth estimation using attention guided generative adversarial network
Sumanta Bhattacharyya
Ju Shen
Stephen Welch
Chen Chen
[J]. Journal of Real-Time Image Processing, 2021, 18 : 1357 - 1368
[2] Unsupervised Monocular Depth Estimation and Visual Odometry Based on Generative Adversarial Network and Self-attention Mechanism
Ye, Xingyu
He, Yuanlie
Ru, Shaonan
[J]. Jiqiren/Robot, 2021, 43 (02): : 203 - 213
[3] Structured Coupled Generative Adversarial Networks for Unsupervised Monocular Depth Estimation
Puscas, Mihai Marian
Xu, Dan
Pilzer, Andrea
Sebe, Niculae
[J]. 2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 18 - 26
[4] Generative Adversarial Networks for Unsupervised Monocular Depth Prediction
Aleotti, Filippo
Tosi, Fabio
Poggi, Matteo
Mattoccia, Stefano
[J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT I, 2019, 11129 : 337 - 354
[5] GANVO: Unsupervised Deep Monocular Visual Odometry and Depth Estimation with Generative Adversarial Networks
Almalioglu, Yasin
Saputra, Muhamad Risqi U.
de Gusmao, Pedro P. B.
Markham, Andrew
Trigoni, Niki
[J]. 2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 5474 - 5480
[6] Unsupervised Adversarial Depth Estimation using Cycled Generative Networks
Pilzer, Andrea
Xu, Dan
Puscas, Mihai Marian
Ricci, Elisa
Sebe, Nicu
[J]. 2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 587 - 595
[7] Monocular Image Depth Estimation Using a Conditional Generative Adversarial Net
Zhang, Xiaofeng
Chen, Shuo
Xu, Qingyang
Zhang, Xiaoxue
[J]. 2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 9176 - 9180
[8] Structured Adversarial Training for Unsupervised Monocular Depth Estimation
Mehta, Ishit
Sakurikar, Parikshit
Narayanan, P. J.
[J]. 2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 314 - 323
[9] Unsupervised Monocular Depth Estimation With Channel and Spatial Attention
Wang, Zhuping
Dai, Xinke
Guo, Zhanyu
Huang, Chao
Zhang, Hao
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (06) : 7860 - 7870
[10] Monocular Depth Prediction using Generative Adversarial Networks
Kumar, Arun C. S.
Bhandarkar, Suchendra M.
Prasad, Mukta
[J]. PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 413 - 421

← 1 2 3 4 5 →