FastMDE: A Fast CNN Architecture for Monocular Depth Estimation at High Resolution

被引:6
|
作者
Thien-Thanh Dao [1 ]
Quoc-Viet Pham [2 ]
Hwang, Won-Joo [3 ]
机构
[1] Pusan Natl Univ, Dept Comp Engn, Yangsan Si 50612, South Korea
[2] Pusan Natl Univ, Korean Southeast Ctr 4th Ind Revolut Leader Educ, Busan 46241, South Korea
[3] Pusan Natl Univ, Dept Biomed Convergence Engn, Yangsan Si 50612, South Korea
基金
新加坡国家研究基金会;
关键词
Estimation; Semantics; Decoding; Convolutional neural networks; Computer architecture; Computational modeling; Unsupervised learning; Efficient CNN; deep neural network; depth map; supervised learning; self-supervised learning; IMAGE; NETWORK;
D O I
10.1109/ACCESS.2022.3145969
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A depth map helps robots and autonomous vehicles (AVs) visualize the three-dimensional world to navigate and localize neighboring obstacles. However, it is difficult to develop a deep learning model that can estimate the depth map from a single image in real-time. This study proposes a fast monocular depth estimation model named FastMDE by optimizing the deep convolutional neural network according to the encoder-decoder architecture. The decoder needs to obtain partial and semantic feature maps from the encoding phase to improve the depth estimation accuracy. Therefore, we designed FastMDE with two effective strategies. The first one involved redesigning the skip connection with the features of the squeeze-excitation module to obtain partial and semantic feature maps of the encoding phase. The second strategy involved redesigning the decoder by using the fusion dense block to permit the usage of high-resolution features that were learned earlier in the network before upsampling. The proposed FastMDE model utilizes only 4.1 M parameters, which is much lesser than the parameters utilized by state-of-art models. Thus, FastDME has a higher accuracy and lower latency than previous models. This study also demonstrates that MDE can leverage deep neural networks in real-time (i.e., 30 fps) with the Linux embedded board Nvidia Jetson Xavier NX. The model can facilitate the development and applications with superior performances and easy deployment on an embedded platform.
引用
收藏
页码:16111 / 16122
页数:12
相关论文
共 50 条
  • [1] Nested DWT-Based CNN Architecture for Monocular Depth Estimation
    Paul, Sandip
    Mishra, Deepak
    Marimuthu, Senthil Kumar
    SENSORS, 2023, 23 (06)
  • [2] LightDepthNet: Lightweight CNN Architecture for Monocular Depth Estimation on Edge Devices
    Liu, Qingliang
    Zhou, Shuai
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (04) : 2389 - 2393
  • [3] Fast Monocular Depth Estimation on an FPGA
    Sada, Youki
    Soga, Naoto
    Shimoda, Masayuki
    Jinguji, Akira
    Sato, Shimpei
    Nakahara, Hiroki
    2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 143 - 146
  • [4] Dual CNN Models for Unsupervised Monocular Depth Estimation
    Repala, Vamshi Krishna
    Dubey, Shiv Ram
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT I, 2019, 11941 : 209 - 217
  • [5] HR-Depth: High Resolution Self-Supervised Monocular Depth Estimation
    Lyu, Xiaoyang
    Liu, Liang
    Wang, Mengmeng
    Kong, Xin
    Liu, Lina
    Liu, Yong
    Chen, Xinxin
    Yuan, Yi
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2294 - 2301
  • [6] Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation
    Zhang, Ning
    Nex, Francesco
    Vosselman, George
    Kerle, Norman
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18537 - 18546
  • [7] CNN Approach for Monocular Depth Estimation: Ear Case Study
    Magherini, Roberto
    Servi, Michaela
    Mussi, Elisa
    Furferi, Rocco
    Buonamici, Francesco
    Volpe, Yary
    DESIGN TOOLS AND METHODS IN INDUSTRIAL ENGINEERING II, ADM 2021, 2022, : 220 - 228
  • [8] 360MonoDepth: High-Resolution 360° Monocular Depth Estimation
    Rey-Area, Manuel
    Yuan, Mingze
    Richardt, Christian
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3752 - 3762
  • [9] FastDepth: Fast Monocular Depth Estimation on Embedded Systems
    Wolk, Diana
    Ma, Fangchang
    Yang, Tien-Lu
    Karaman, Sertac
    Sze, Vivienne
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 6101 - 6108
  • [10] A NOVEL LIGHTWEIGHT NETWORK FOR FAST MONOCULAR DEPTH ESTIMATION
    Heydrich, Tim
    Yang, Yimin
    Ma, Xiangyu
    Liu, Yu
    Du, Shan
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2260 - 2264