A Dual Encoder-Decoder Network for Self-Supervised Monocular Depth Estimation

被引:1
|
作者
Zheng, Mingkui [1 ,2 ]
Luo, Lin [1 ]
Zheng, Haifeng [3 ]
Ye, Zhangfan [3 ]
Su, Zhe [1 ]
机构
[1] Fuzhou Univ, Sch Adv Mfg, Quanzhou 362200, Peoples R China
[2] Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou 350108, Peoples R China
[3] Fuzhou Univ, Coll Phys & Informat Engn, Fujian Key Lab Intelligent Proc & Wireless Transm, Quanzhou 350108, Fujian, Peoples R China
关键词
Estimation; Feature extraction; Data mining; Fuses; Decoding; Convolutional neural networks; Training; Accuracy; dual encoder-decoder; global information; monocular depth estimation; self-supervised;
D O I
10.1109/JSEN.2023.3296497
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Depth estimation from a single image is a fundamental problem in the field of computer vision. With the great success of deep learning techniques, various self-supervised monocular depth estimation methods using encoder-decoder architectures have emerged. However, most previous approaches regress the depth map directly using a single encoder-decoder structure, which may not obtain sufficient features in the image and results in a depth map with low accuracy and blurred details. To improve the accuracy of self-supervised monocular depth estimation, we propose a simple but very effective scheme for depth estimation using a dual encoder-decoder structure network. Specifically, we introduce a novel global feature extraction network (GFN) to extract global features from images. GFN includes PoolAttentionFormer and ResBlock, which work together to extract and fuse hierarchical global features into the depth estimation network (DEN). To further improve the accuracy, we design two feature fusion mechanisms, including global feature fusion and multiscale fusion. The experimental results of various dual encoder-decoder combination schemes tested on the KITTI dataset show that our proposed one is effective in improving the accuracy of self-supervised monocular depth estimation, which reached 89.6% (delta < 1.25).
引用
收藏
页码:19747 / 19756
页数:10
相关论文
共 50 条
  • [1] Self-Supervised Monocular Depth Estimation Using Hybrid Transformer Encoder
    Hwang, Seung-Jun
    Park, Sung-Jun
    Baek, Joong-Hwan
    Kim, Byungkyu
    [J]. IEEE SENSORS JOURNAL, 2022, 22 (19) : 18762 - 18770
  • [2] Encoder-decoder with densely convolutional networks for monocular depth estimation
    Chen, Songnan
    Tang, Mengxia
    Kan, Jiangming
    [J]. JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2019, 36 (10) : 1709 - 1718
  • [3] Digging Into Self-Supervised Monocular Depth Estimation
    Godard, Clement
    Mac Aodha, Oisin
    Firman, Michael
    Brostow, Gabriel
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3827 - 3837
  • [4] On the uncertainty of self-supervised monocular depth estimation
    Poggi, Matteo
    Aleotti, Filippo
    Tosi, Fabio
    Mattoccia, Stefano
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3224 - 3234
  • [5] Revisiting Self-supervised Monocular Depth Estimation
    Kim, Ue-Hwan
    Lee, Gyeong-Min
    Kim, Jong-Hwan
    [J]. ROBOT INTELLIGENCE TECHNOLOGY AND APPLICATIONS 6, 2022, 429 : 336 - 350
  • [6] Self-supervised monocular depth estimation in fog
    Tao, Bo
    Hu, Jiaxin
    Jiang, Du
    Li, Gongfa
    Chen, Baojia
    Qian, Xinbo
    [J]. OPTICAL ENGINEERING, 2023, 62 (03)
  • [7] Joint Encoder-Decoder Self-Supervised Pre-training for ASR
    Arunkumar, A.
    Umesh, S.
    [J]. INTERSPEECH 2022, 2022, : 3418 - 3422
  • [8] HQDec: Self-Supervised Monocular Depth Estimation Based on a High-Quality Decoder
    Wang, Fei
    Cheng, Jun
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2453 - 2468
  • [9] Shufflemono: Rethinking Lightweight Network for Self-Supervised Monocular Depth Estimation
    Feng, Yingwei
    Hong, Zhiyong
    Xiong, Liping
    Zeng, Zhiqiang
    Li, Jingmin
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2024, 14 (03) : 191 - 205
  • [10] Semantically guided self-supervised monocular depth estimation
    Lu, Xiao
    Sun, Haoran
    Wang, Xiuling
    Zhang, Zhiguo
    Wang, Haixia
    [J]. IET IMAGE PROCESSING, 2022, 16 (05) : 1293 - 1304