Efficient unsupervised monocular depth estimation using attention guided generative adversarial network

被引:4
|
作者
Bhattacharyya, Sumanta [1 ]
Shen, Ju [3 ]
Welch, Stephen [1 ]
Chen, Chen [2 ]
机构
[1] Univ N Carolina, Charlotte, NC 28223 USA
[2] Univ N Carolina, Dept Elect & Comp Engn, Charlotte, NC 28223 USA
[3] Univ Dayton, Comp Sci, 300 Coll Pk, Dayton, OH 45469 USA
基金
美国国家科学基金会;
关键词
Attention; Efficient GAN; Unsupervised depth estimation; Convolution factorization;
D O I
10.1007/s11554-021-01092-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep-learning-based approaches to depth estimation are rapidly advancing, offering better performance over traditional computer vision approaches across many domains. However, for many critical applications, cutting-edge deep-learning based approaches require too much computational overhead to be operationally feasible. This is especially true for depth-estimation methods that leverage adversarial learning, such as Generative Adversarial Networks (GANs). In this paper, we propose a computationally efficient GAN for unsupervised monocular depth estimation using factorized convolutions and an attention mechanism. Specifically, we leverage the Extremely Efficient Spatial Pyramid of Depth-wise Dilated Separable Convolutions (EESP) module of ESPNetv2 inside the network, leading to a total reduction of 22.8%, 35.37%, and 31.5% in the number of model parameters, FLOPs, and inference time respectively, as compared to the previous unsupervised GAN approach. Finally, we propose a context-aware attention architecture to generate detail-oriented depth images. We demonstrate superior performance of our proposed model on two benchmark datasets KITTI and Cityscapes. We have also provided more qualitative examples (Fig. 8) at the end of this paper.
引用
收藏
页码:1357 / 1368
页数:12
相关论文
共 50 条
  • [21] DIFA-GAN: Differential Attention-Guided Generative Adversarial Network for Unsupervised Video Forecasting
    Jin, Beibei
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1795 - 1799
  • [22] Unsupervised Monocular Estimation of Depth and Visual Odometry Using Attention and Depth-Pose Consistency Loss
    Song, Xiaogang
    Hu, Haoyue
    Liang, Li
    Shi, Weiwei
    Xie, Guo
    Lu, Xiaofeng
    Hei, Xinhong
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3517 - 3529
  • [23] STRUCTURE GENERATION AND GUIDANCE NETWORK FOR UNSUPERVISED MONOCULAR DEPTH ESTIMATION
    Wang, Chaoqun
    Chen, Xuejin
    Min, Shaobo
    Wu, Feng
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1264 - 1269
  • [24] Efficient Unsupervised Monocular Depth Estimation with Inter-Frame Depth Interpolation
    Zhang, Min
    Li, Jianhua
    [J]. IMAGE AND GRAPHICS (ICIG 2021), PT III, 2021, 12890 : 729 - 741
  • [25] Lightweight Monocular Depth Estimation with an Edge Guided Network
    Dong, Xingshuai
    Garratt, Matthew A.
    Anavatti, Sreenatha G.
    Abbass, Hussein A.
    Dong, Junyu
    [J]. 2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 204 - 210
  • [26] Spatial Correspondence with Generative Adversarial Network: Learning Depth from Monocular Videos
    Wu, Zhenyao
    Wu, Xinyi
    Zhang, Xiaoping
    Wang, Song
    Ju, Lili
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7493 - 7503
  • [27] DerainAttentionGAN: unsupervised single-image deraining using attention-guided generative adversarial networks
    Guo, ZhaoKang
    Hou, Mingzheng
    Sima, Mingjun
    Feng, ZiLiang
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (01) : 185 - 192
  • [28] DerainAttentionGAN: unsupervised single-image deraining using attention-guided generative adversarial networks
    ZhaoKang Guo
    Mingzheng Hou
    Mingjun Sima
    ZiLiang Feng
    [J]. Signal, Image and Video Processing, 2022, 16 : 185 - 192
  • [29] An Efficient Monocular Depth Prediction Network Using Coordinate Attention and Feature Fusion
    Xu, Huihui
    Li, Fei
    [J]. JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2022, 18 (06): : 794 - 802
  • [30] Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation
    Tang, Hao
    Xu, Dan
    Sebel, Nicu
    Yan, Yan
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,