Monocular Depth Estimation with Adaptive Geometric Attention

被引:8
|
作者
Naderi, Taher [1 ]
Sadovnik, Amir [1 ]
Hayward, Jason [2 ]
Qi, Hairong [1 ]
机构
[1] Univ Tennessee, Dept Elect Engn & Comp Sci, Knoxville, TN 37996 USA
[2] Univ Tennessee, Dept Nucl Engn, Knoxville, TN 37996 USA
关键词
SEGMENTATION; NETWORK;
D O I
10.1109/WACV51458.2022.00069
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Single image depth estimation is an ill-posed problem. That is, it is not mathematically possible to uniquely estimate the 3rd dimension (or depth) from a single 2D image. Hence, additional constraints need to be incorporated in order to regulate the solution space. In this paper, we explore the idea of constraining the model by taking advantage of the similarity between the RGB image and the corresponding depth map at the geometric edges of the 3D scene for more accurate depth estimation. We propose a general light-weight adaptive geometric attention module that uses the cross-correlation between the encoder and the decoder as a measure of this similarity. More precisely, we use the cosine similarity between the local embedded features in the encoder and the decoder at each spatial point. The proposed module along with the encoder-decoder network is trained in an end-to-end fashion and achieves superior and competitive performance in comparison with other state-of-the-art methods. In addition, adding our module to the base encoder-decoder model adds only an additional 0.03% (or 0.0003) parameters. Therefore, this module can be added to any base encoder-decoder network without changing its structure to address any task at hand.
引用
收藏
页码:617 / 627
页数:11
相关论文
共 50 条
  • [21] Monocular Depth Estimation with Optical Flow Attention for Autonomous Drones
    Shimhada, Tomoyasu
    Nishikawa, Hiroki
    Kong, Xiangbo
    Tomiyama, Hiroyuki
    [J]. 2022 19TH INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2022, : 197 - 198
  • [22] Monocular depth estimation with multi-view attention autoencoder
    Jung, Geunho
    Yoon, Sang Min
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 33759 - 33770
  • [23] MAMo: Leveraging Memory and Attention for Monocular Video Depth Estimation
    Yasarla, Rajeev
    Cai, Hong
    Jeong, Jisoo
    Shi, Yunxiao
    Garrepalli, Risheek
    Porikli, Fatih
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8720 - 8730
  • [24] Boosting Monocular Depth Estimation with Channel Attention and Mutual Learning
    Takagi, Kazunari
    Ito, Seiya
    Kaneko, Naoshi
    Sumi, Kazuhiko
    [J]. 2019 JOINT 8TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2019 3RD INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR) WITH INTERNATIONAL CONFERENCE ON ACTIVITY AND BEHAVIOR COMPUTING (ABC), 2019, : 228 - 233
  • [25] DAttNet: monocular depth estimation network based on attention mechanisms
    Armando Astudillo
    Alejandro Barrera
    Carlos Guindel
    Abdulla Al-Kaff
    Fernando García
    [J]. Neural Computing and Applications, 2024, 36 : 3347 - 3356
  • [26] Joint Attention Mechanisms for Monocular Depth Estimation With Multi-Scale Convolutions and Adaptive Weight Adjustment
    Liu, Peng
    Zhang, Zonghua
    Meng, Zhaozong
    Gao, Nan
    [J]. IEEE ACCESS, 2020, 8 : 184437 - 184450
  • [27] Learning Feature Decomposition for Domain Adaptive Monocular Depth Estimation
    Lo, Shao-Yuan
    Wang, Wei
    Thomas, Jim
    Zheng, Jingjing
    Patel, Vishal M.
    Kuo, Cheng-Hao
    [J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 8376 - 8382
  • [28] Adaptive Co-teaching for Unsupervised Monocular Depth Estimation
    Ren, Weisong
    Wang, Lijun
    Piao, Yongri
    Zhang, Miao
    Lu, Huchuan
    Liu, Ting
    [J]. COMPUTER VISION - ECCV 2022, PT I, 2022, 13661 : 89 - 105
  • [29] Adaptive Self-supervised Depth Estimation in Monocular Videos
    Mendoza, Julio
    Pedrini, Helio
    [J]. IMAGE AND GRAPHICS (ICIG 2021), PT III, 2021, 12890 : 687 - 699
  • [30] CFDepthNet: Monocular Depth Estimation Introducing Coordinate Attention and Texture Features
    Wei, Feng
    Zhu, Jie
    Wang, Huibin
    Shen, Jie
    [J]. NEURAL PROCESSING LETTERS, 2024, 56 (03)