Pyramid frequency network with spatial attention residual refinement module for monocular depth estimation

被引:9
|
作者
Lu, Zhengyang [1 ]
Chen, Ying [1 ]
机构
[1] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
monocular depth estimation; three-dimensional reconstruction; frequency domain; convolutional neural network;
D O I
10.1117/1.JEI.31.2.023005
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep-learning-based approaches to depth estimation are rapidly advancing, offering superior performance over existing methods. To estimate the depth in real-world scenarios, depth estimation models require the robustness of various noise environments. We propose a pyramid frequency network (PFN) with spatial attention residual refinement module (SARRM) to deal with the weak robustness of existing deep-learning methods. To reconstruct depth maps with accurate details, the SARRM constructs a residual fusion method with an attention mechanism to refine the blur depth. The frequency division strategy is designed, and the frequency pyramid network is developed to extract features from multiple frequency bands. With the frequency strategy, PFN achieves better visual accuracy than state-of-the-art methods in both indoor and outdoor scenes on Make3D, KITTI depth, and NYUv2 datasets. Additional experiments on the noisy NYUv2 dataset demonstrate that PFN is more reliable than existing deep-learning methods in high noise scenes. (C) 2022 SPIE and IS&T
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Attention Mechanism Used in Monocular Depth Estimation: An Overview
    Li, Yundong
    Wei, Xiaokun
    Fan, Hanlu
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (17):
  • [42] Residual-Shuffle Network with Spatial Pyramid Pooling Module for COVID-19 Screening
    Zulkifley, Mohd Asyraf
    Abdani, Siti Raihanah
    Zulkifley, Nuraisyah Hani
    Shahrimin, Mohamad Ibrani
    [J]. DIAGNOSTICS, 2021, 11 (08)
  • [43] ADAPTIVE WEIGHTED NETWORK WITH EDGE ENHANCEMENT MODULE FOR MONOCULAR SELF-SUPERVISED DEPTH ESTIMATION
    Liu, Hong
    Zhu, Ying
    Hua, Guoliang
    Huang, Weibo
    Ding, Runwei
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2340 - 2344
  • [44] Multi-Scale Spatial Attention-Guided Monocular Depth Estimation With Semantic Enhancement
    Xu, Xianfa
    Chen, Zhe
    Yin, Fuliang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 8811 - 8822
  • [45] Dynamic Guided Network for Monocular Depth Estimation
    Xing, Xiaoxia
    Cai, Yinghao
    Wang, Yanqing
    Lu, Tao
    Yang, Yiping
    Wen, Dayong
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5459 - 5465
  • [46] IterDepth: Iterative Residual Refinement for Outdoor Self-Supervised Multi-Frame Monocular Depth Estimation
    Feng, Cheng
    Chen, Zhen
    Zhang, Congxuan
    Hu, Weiming
    Li, Bing
    Lu, Feng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 329 - 341
  • [47] DTTNet: Depth Transverse Transformer Network for Monocular Depth Estimation
    Kamath, Shreyas K. M.
    Rajeev, Srijith
    Panetta, Karen
    Agaian, Sos S.
    [J]. MULTIMODAL IMAGE EXPLOITATION AND LEARNING 2022, 2022, 12100
  • [48] Attention based multilayer feature fusion convolutional neural network for unsupervised monocular depth estimation
    Lei, Zeyu
    Wang, Yan
    Li, Zijian
    Yang, Junyao
    [J]. NEUROCOMPUTING, 2021, 423 : 343 - 352
  • [49] Single-Stage Refinement CNN for Depth Estimation in Monocular Images
    Valdez Rodriguez, Jose E.
    Calvo, Hiram
    Felipe Riveron, Edgardo M.
    [J]. COMPUTACION Y SISTEMAS, 2020, 24 (02): : 439 - 451
  • [50] Monocular depth estimation with multi-view attention autoencoder
    Geunho Jung
    Sang Min Yoon
    [J]. Multimedia Tools and Applications, 2022, 81 : 33759 - 33770