Structure-Aware Residual Pyramid Network for Monocular Depth Estimation

被引:0
|
作者
Chen, Xiaotian [1 ]
Chen, Xuejin [1 ]
Zha, Zheng-Jun [1 ]
机构
[1] Univ Sci & Technol China, Natl Engn Lab Brain Inspired Intelligence Technol, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monocular depth estimation is an essential task for scene understanding. The underlying structure of objects and stuff in a complex scene is critical to recovering accurate and visually-pleasing depth maps. Global structure conveys scene layouts, while local structure reflects shape details. Recently developed approaches based on convolutional neural networks (CNNs) significantly improve the performance of depth estimation. However, few of them take into account multi-scale structures in complex scenes. In this paper, we propose a Structure-Aware Residual Pyramid Network (SARPN) to exploit multi-scale structures for accurate depth prediction. We propose a Residual Pyramid Decoder (RPD) which expresses global scene structure in upper levels to represent layouts, and local structure in lower levels to present shape details. At each level, we propose Residual Refinement Modules (RRM) that predict residual maps to progressively add finer structures on the coarser structure predicted at the upper level. In order to fully exploit multi-scale image features, an Adaptive Dense Feature Fusion (ADFF) module, which adaptively fuses effective features from all scales for inferring structures of each scale, is introduced. Experiment results on the challenging NYU-Depth v2 dataset demonstrate that our proposed approach achieves state-of-the-art performance in both qualitative and quantitative evaluation. The code is available at haps://github.com/Xt-Chen/SARPN.
引用
收藏
页码:694 / 700
页数:7
相关论文
共 50 条
  • [1] Multi-scale Residual Pyramid Attention Network for Monocular Depth Estimation
    Liu, Jing
    Zhang, Xiaona
    Li, Zhaoxin
    Mao, Tianlu
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5137 - 5144
  • [2] Pyramid frequency network with spatial attention residual refinement module for monocular depth estimation
    Lu, Zhengyang
    Chen, Ying
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (02)
  • [3] Structure-aware Priority Belief Propagation for Depth Estimation
    Ju, Kuanyu
    Wang, Botao
    Xiong, Hongkai
    [J]. 2015 VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2015,
  • [4] DCPNet: A Densely Connected Pyramid Network for Monocular Depth Estimation
    Lai, Zhitong
    Tian, Rui
    Wu, Zhiguo
    Ding, Nannan
    Sun, Linjian
    Wang, Yanjie
    [J]. SENSORS, 2021, 21 (20)
  • [5] Structure-aware dehazing of sewer inspection images based on monocular depth cues
    Xia, Zixia
    Guo, Shuai
    Sun, Di
    Lv, Yaozhi
    Li, Honglie
    Pan, Gang
    [J]. COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2023, 38 (06) : 762 - 778
  • [6] Promoting Monocular Depth Estimation by Multi-Scale Residual Laplacian Pyramid Fusion
    Zhang, Anmei
    Ma, Yunchao
    Liu, Jiangyu
    Sun, Jian
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 205 - 209
  • [7] Depth Estimation of Monocular Road Images Based on Pyramid Scene Analysis Network
    Zhou Wujie
    Pan Ting
    Gu Pengli
    Zhai Zhinian
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (10) : 2509 - 2515
  • [8] A geometry-aware deep network for depth estimation in monocular endoscopy
    Yang, Yongming
    Shao, Shuwei
    Yang, Tao
    Wang, Peng
    Yang, Zhuo
    Wu, Chengdong
    Liu, Hao
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
  • [9] CASCADED DETAIL-AWARE NETWORK FOR UNSUPERVISED MONOCULAR DEPTH ESTIMATION
    Ye, Xinchen
    Zhang, Mingliang
    Fan, Xin
    Xu, Rui
    Pu, Juncheng
    Yan, Ruoke
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [10] SA-DPNet: Structure-aware dual pyramid network for salient object detection
    Xu, Xuemiao
    Chen, Jiaxing
    Zhang, Huaidong
    Han, Guoqiang
    [J]. PATTERN RECOGNITION, 2022, 127