Structure-Aware Residual Pyramid Network for Monocular Depth Estimation

被引:0
|
作者
Chen, Xiaotian [1 ]
Chen, Xuejin [1 ]
Zha, Zheng-Jun [1 ]
机构
[1] Univ Sci & Technol China, Natl Engn Lab Brain Inspired Intelligence Technol, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monocular depth estimation is an essential task for scene understanding. The underlying structure of objects and stuff in a complex scene is critical to recovering accurate and visually-pleasing depth maps. Global structure conveys scene layouts, while local structure reflects shape details. Recently developed approaches based on convolutional neural networks (CNNs) significantly improve the performance of depth estimation. However, few of them take into account multi-scale structures in complex scenes. In this paper, we propose a Structure-Aware Residual Pyramid Network (SARPN) to exploit multi-scale structures for accurate depth prediction. We propose a Residual Pyramid Decoder (RPD) which expresses global scene structure in upper levels to represent layouts, and local structure in lower levels to present shape details. At each level, we propose Residual Refinement Modules (RRM) that predict residual maps to progressively add finer structures on the coarser structure predicted at the upper level. In order to fully exploit multi-scale image features, an Adaptive Dense Feature Fusion (ADFF) module, which adaptively fuses effective features from all scales for inferring structures of each scale, is introduced. Experiment results on the challenging NYU-Depth v2 dataset demonstrate that our proposed approach achieves state-of-the-art performance in both qualitative and quantitative evaluation. The code is available at haps://github.com/Xt-Chen/SARPN.
引用
收藏
页码:694 / 700
页数:7
相关论文
共 50 条
  • [31] Structure-Aware Slow Feature Analysis for Age Estimation
    He, Zhouzhou
    Li, Xi
    Zhang, Zhongfei
    Zhang, Yaqing
    Xiao, Jun
    Zhou, Xue
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (12) : 1702 - 1706
  • [32] Structure-Aware Cross-Modal Transformer for Depth Completion
    Zhao, Linqing
    Wei, Yi
    Li, Jiaxin
    Zhou, Jie
    Lu, Jiwen
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1016 - 1031
  • [33] Self-Supervised Monocular Depth Estimation by Direction-aware Cumulative Convolution Network
    Han, Wencheng
    Yin, Junbo
    Shen, Jianbing
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8579 - 8589
  • [34] DTTNet: Depth Transverse Transformer Network for Monocular Depth Estimation
    Kamath, Shreyas K. M.
    Rajeev, Srijith
    Panetta, Karen
    Agaian, Sos S.
    [J]. MULTIMODAL IMAGE EXPLOITATION AND LEARNING 2022, 2022, 12100
  • [35] Distortion-Aware Monocular Depth Estimation for Omnidirectional Images
    Chen, Hong-Xiang
    Li, Kunhong
    Fu, Zhiheng
    Li, Mengyi
    Chen, Zonghao
    Guo, Yulan
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 (28) : 334 - 338
  • [36] Edge-Aware Monocular Dense Depth Estimation with Morphology
    Li, Zhi
    Zhu, Xiaoyang
    Yu, Haitao
    Zhang, Qi
    Jiang, Yongshi
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2935 - 2942
  • [37] SALMNet: A Structure-Aware Lane Marking Detection Network
    Xu, Xuemiao
    Yu, Tianfei
    Hu, Xiaowei
    Ng, Wing W. Y.
    Heng, Pheng-Ann
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (08) : 4986 - 4997
  • [38] Relation Structure-Aware Heterogeneous Graph Neural Network
    Zhu, Shichao
    Zhou, Chuan
    Pan, Shirui
    Zhu, Xingquan
    Wang, Bin
    [J]. 2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 1534 - 1539
  • [39] Relation Structure-Aware Heterogeneous Information Network Embedding
    Lu, Yuanfu
    Shi, Chuan
    Hu, Linmei
    Liu, Zhiyuan
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 4456 - 4463
  • [40] Structure-aware halftoning
    Pang, Wai-Man
    Qu, Yingge
    Wong, Tien-Tsin
    Cohen-Or, Daniel
    Heng, Pheng-Ann
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2008, 27 (03):