Efficient unsupervised monocular depth estimation using attention guided generative adversarial network

被引:4
|
作者
Bhattacharyya, Sumanta [1 ]
Shen, Ju [3 ]
Welch, Stephen [1 ]
Chen, Chen [2 ]
机构
[1] Univ N Carolina, Charlotte, NC 28223 USA
[2] Univ N Carolina, Dept Elect & Comp Engn, Charlotte, NC 28223 USA
[3] Univ Dayton, Comp Sci, 300 Coll Pk, Dayton, OH 45469 USA
基金
美国国家科学基金会;
关键词
Attention; Efficient GAN; Unsupervised depth estimation; Convolution factorization;
D O I
10.1007/s11554-021-01092-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep-learning-based approaches to depth estimation are rapidly advancing, offering better performance over traditional computer vision approaches across many domains. However, for many critical applications, cutting-edge deep-learning based approaches require too much computational overhead to be operationally feasible. This is especially true for depth-estimation methods that leverage adversarial learning, such as Generative Adversarial Networks (GANs). In this paper, we propose a computationally efficient GAN for unsupervised monocular depth estimation using factorized convolutions and an attention mechanism. Specifically, we leverage the Extremely Efficient Spatial Pyramid of Depth-wise Dilated Separable Convolutions (EESP) module of ESPNetv2 inside the network, leading to a total reduction of 22.8%, 35.37%, and 31.5% in the number of model parameters, FLOPs, and inference time respectively, as compared to the previous unsupervised GAN approach. Finally, we propose a context-aware attention architecture to generate detail-oriented depth images. We demonstrate superior performance of our proposed model on two benchmark datasets KITTI and Cityscapes. We have also provided more qualitative examples (Fig. 8) at the end of this paper.
引用
收藏
页码:1357 / 1368
页数:12
相关论文
共 50 条
  • [1] Efficient unsupervised monocular depth estimation using attention guided generative adversarial network
    Sumanta Bhattacharyya
    Ju Shen
    Stephen Welch
    Chen Chen
    [J]. Journal of Real-Time Image Processing, 2021, 18 : 1357 - 1368
  • [2] Unsupervised Monocular Depth Estimation and Visual Odometry Based on Generative Adversarial Network and Self-attention Mechanism
    Ye, Xingyu
    He, Yuanlie
    Ru, Shaonan
    [J]. Jiqiren/Robot, 2021, 43 (02): : 203 - 213
  • [3] Structured Coupled Generative Adversarial Networks for Unsupervised Monocular Depth Estimation
    Puscas, Mihai Marian
    Xu, Dan
    Pilzer, Andrea
    Sebe, Niculae
    [J]. 2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 18 - 26
  • [4] Generative Adversarial Networks for Unsupervised Monocular Depth Prediction
    Aleotti, Filippo
    Tosi, Fabio
    Poggi, Matteo
    Mattoccia, Stefano
    [J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT I, 2019, 11129 : 337 - 354
  • [5] GANVO: Unsupervised Deep Monocular Visual Odometry and Depth Estimation with Generative Adversarial Networks
    Almalioglu, Yasin
    Saputra, Muhamad Risqi U.
    de Gusmao, Pedro P. B.
    Markham, Andrew
    Trigoni, Niki
    [J]. 2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 5474 - 5480
  • [6] Unsupervised Adversarial Depth Estimation using Cycled Generative Networks
    Pilzer, Andrea
    Xu, Dan
    Puscas, Mihai Marian
    Ricci, Elisa
    Sebe, Nicu
    [J]. 2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 587 - 595
  • [7] Monocular Image Depth Estimation Using a Conditional Generative Adversarial Net
    Zhang, Xiaofeng
    Chen, Shuo
    Xu, Qingyang
    Zhang, Xiaoxue
    [J]. 2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 9176 - 9180
  • [8] Structured Adversarial Training for Unsupervised Monocular Depth Estimation
    Mehta, Ishit
    Sakurikar, Parikshit
    Narayanan, P. J.
    [J]. 2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 314 - 323
  • [9] Unsupervised Monocular Depth Estimation With Channel and Spatial Attention
    Wang, Zhuping
    Dai, Xinke
    Guo, Zhanyu
    Huang, Chao
    Zhang, Hao
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (06) : 7860 - 7870
  • [10] Monocular Depth Prediction using Generative Adversarial Networks
    Kumar, Arun C. S.
    Bhandarkar, Suchendra M.
    Prasad, Mukta
    [J]. PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 413 - 421