Structure-Aware Residual Pyramid Network for Monocular Depth Estimation

被引:0
|
作者
Chen, Xiaotian [1 ]
Chen, Xuejin [1 ]
Zha, Zheng-Jun [1 ]
机构
[1] Univ Sci & Technol China, Natl Engn Lab Brain Inspired Intelligence Technol, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monocular depth estimation is an essential task for scene understanding. The underlying structure of objects and stuff in a complex scene is critical to recovering accurate and visually-pleasing depth maps. Global structure conveys scene layouts, while local structure reflects shape details. Recently developed approaches based on convolutional neural networks (CNNs) significantly improve the performance of depth estimation. However, few of them take into account multi-scale structures in complex scenes. In this paper, we propose a Structure-Aware Residual Pyramid Network (SARPN) to exploit multi-scale structures for accurate depth prediction. We propose a Residual Pyramid Decoder (RPD) which expresses global scene structure in upper levels to represent layouts, and local structure in lower levels to present shape details. At each level, we propose Residual Refinement Modules (RRM) that predict residual maps to progressively add finer structures on the coarser structure predicted at the upper level. In order to fully exploit multi-scale image features, an Adaptive Dense Feature Fusion (ADFF) module, which adaptively fuses effective features from all scales for inferring structures of each scale, is introduced. Experiment results on the challenging NYU-Depth v2 dataset demonstrate that our proposed approach achieves state-of-the-art performance in both qualitative and quantitative evaluation. The code is available at haps://github.com/Xt-Chen/SARPN.
引用
收藏
页码:694 / 700
页数:7
相关论文
共 50 条
  • [21] An Articulated Structure-aware Network for 3D Human Pose Estimation
    Tang, Zhenhua
    Zhang, Xiaoyan
    Hou, Junhui
    [J]. ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101, 2019, 101 : 48 - 63
  • [22] RADepthNet:Reflectance-aware monocular depth estimation
    Chuxuan LI
    Ran YI
    Saba Ghazanfar ALI
    Lizhuang MA
    Enhua WU
    Jihong WANG
    Lijuan MAO
    Bin SHENG
    [J]. 虚拟现实与智能硬件(中英文), 2022, 4 (05) : 418 - 431
  • [23] LA-Net: Layout-Aware Dense Network for Monocular Depth Estimation
    Zheng, Kecheng
    Zha, Zheng-Jun
    Cao, Yang
    Chen, Xuejin
    Wu, Feng
    [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1381 - 1388
  • [24] RADepthNet: Reflectance-Aware Monocular Depth Estimation
    Li, Chuxuan
    Yi, Ran
    Ali, Saba Ghazanfar
    Ma, Lizhuang
    Wu, Enhua
    Wang, Jihong
    Mao, Lijuan
    Sheng, Bin
    [J]. Virtual Reality and Intelligent Hardware, 2022, 4 (05): : 418 - 431
  • [25] Structure-aware attributed heterogeneous network embedding
    Hao Wei
    Gang Xiong
    Qiang Wei
    Weiquan Cao
    Xin Li
    [J]. Knowledge and Information Systems, 2023, 65 : 1769 - 1785
  • [26] SANet: Structure-Aware Network for Visual Tracking
    Fan, Heng
    Ling, Haibin
    [J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 2217 - 2224
  • [27] Structure-aware attributed heterogeneous network embedding
    Wei, Hao
    Xiong, Gang
    Wei, Qiang
    Cao, Weiquan
    Li, Xin
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (04) : 1769 - 1785
  • [28] Depth Estimation From Light Field Using Graph-Based Structure-Aware Analysis
    Zhang, Yuchen
    Dai, Wenrui
    Xu, Mingxing
    Zou, Junni
    Zhang, Xiaopeng
    Xiong, Hongkai
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 4269 - 4283
  • [29] Dynamic Guided Network for Monocular Depth Estimation
    Xing, Xiaoxia
    Cai, Yinghao
    Wang, Yanqing
    Lu, Tao
    Yang, Yiping
    Wen, Dayong
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5459 - 5465
  • [30] Bidirectional Attention Network for Monocular Depth Estimation
    Aich, Shubhra
    Vianney, Jean Marie Uwabeza
    Islam, Md Amirul
    Kaur, Mannat
    Liu, Bingbing
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 11746 - 11752