Neural Contourlet Network for Monocular 360° Depth Estimation

被引:4
|
作者
Shen, Zhijie [1 ,2 ]
Lin, Chunyu [1 ,2 ]
Nie, Lang [1 ,2 ]
Liao, Kang [1 ,2 ]
Zhao, Yao [1 ,2 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 100044, Peoples R China
[2] Beijing Key Lab Adv Informat Sci & Network Technol, Beijing 100044, Peoples R China
基金
中国国家自然科学基金;
关键词
Estimation; Distortion; Wavelet transforms; Feature extraction; Convolutional neural networks; Task analysis; Wavelet analysis; Monocular; 360; degrees; depth sstimation; distortion; contourlet;
D O I
10.1109/TCSVT.2022.3192283
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
For a monocular 360 degrees image, depth estimation is a challenging because the distortion increases along the latitude. To perceive the distortion, existing methods devote to designing a deep and complex network architecture. In this paper, we provide a new perspective that constructs an interpretable and sparse representation for a 360 degrees image. Considering the importance of the geometric structure in depth estimation, we utilize the contourlet transform to capture an explicit geometric cue in the spectral domain and integrate it with an implicit cue in the spatial domain. Specifically, we propose a neural contourlet network consisting of a convolutional neural network and a contourlet transform branch. In the encoder stage, we design a spatial-spectral fusion module to effectively fuse two types of cues. Contrary to the encoder, we employ the inverse contourlet transform with learned low-pass subbands and band-pass directional subbands to compose the depth in the decoder. Experiments on the three popular 360 degrees panoramic image datasets demonstrate that the proposed approach outperforms the state-of-the-art schemes with faster convergence. Code is available at https://github.com/zhijieshenbjtu/Neural-Contourlet-Network-for-MODE.
引用
收藏
页码:8574 / 8585
页数:12
相关论文
共 50 条
  • [31] Depth estimation from monocular vehicle infrared images based on KPCA and BP neural network
    Sun, S. (shysun@dhu.edu.cn), 1600, Chinese Society of Astronautics (42):
  • [32] Novel Hybrid Neural Network for Dense Depth Estimation using On-Board Monocular Images
    Jia, Shaocheng
    Pei, Xin
    Yang, Zi
    Tian, Shan
    Yue, Yun
    TRANSPORTATION RESEARCH RECORD, 2020, 2674 (12) : 312 - 323
  • [33] Depth estimation for monocular image based on convolutional neural networks
    Niu B.
    Tang M.
    Chen X.
    International Journal of Circuits, Systems and Signal Processing, 2021, 15 : 533 - 540
  • [34] CNNapsule: A Lightweight Network with Fusion Features for Monocular Depth Estimation
    Wang, Yinchu
    Zhu, Haijiang
    Liu, Mengze
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT I, 2021, 12891 : 507 - 518
  • [35] AFNet: Asymmetric fusion network for monocular panorama depth estimation
    Huang, Chengchao
    Shao, Feng
    Chen, Hangwei
    Mu, Baoyang
    Jiang, Qiuping
    DISPLAYS, 2024, 84
  • [36] STRUCTURE GENERATION AND GUIDANCE NETWORK FOR UNSUPERVISED MONOCULAR DEPTH ESTIMATION
    Wang, Chaoqun
    Chen, Xuejin
    Min, Shaobo
    Wu, Feng
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1264 - 1269
  • [37] DepthNet: A Recurrent Neural Network Architecture for Monocular Depth Prediction
    Kumar, Arun C. S.
    Bhandarkar, Suchendra M.
    Prasad, Mukta
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 396 - 404
  • [38] LW-Net: A Lightweight Network for Monocular Depth Estimation
    Feng, Cheng
    Zhang, Congxuan
    Chen, Zhen
    Li, Ming
    Chen, Hao
    Fan, Bingbing
    IEEE ACCESS, 2020, 8 : 196287 - 196298
  • [39] DENAO: Monocular Depth Estimation Network With Auxiliary Optical Flow
    Chen, Jingyu
    Yang, Xin
    Jia, Qizeng
    Liao, Chunyuan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (08) : 2598 - 2610
  • [40] A contextual conditional random field network for monocular depth estimation
    Liu, Jun
    Li, Qing
    Cao, Rui
    Tang, Wenming
    Qiu, Guoping
    IMAGE AND VISION COMPUTING, 2020, 98