FastMDE: A Fast CNN Architecture for Monocular Depth Estimation at High Resolution

被引：6

作者：

Thien-Thanh Dao ^{[1
]}

Quoc-Viet Pham ^{[2
]}

Hwang, Won-Joo ^{[3
]}

机构：

[1] Pusan Natl Univ, Dept Comp Engn, Yangsan Si 50612, South Korea

[2] Pusan Natl Univ, Korean Southeast Ctr 4th Ind Revolut Leader Educ, Busan 46241, South Korea

[3] Pusan Natl Univ, Dept Biomed Convergence Engn, Yangsan Si 50612, South Korea

来源：

IEEE ACCESS | 2022年 / 10卷

基金：

新加坡国家研究基金会;

关键词：

Estimation; Semantics; Decoding; Convolutional neural networks; Computer architecture; Computational modeling; Unsupervised learning; Efficient CNN; deep neural network; depth map; supervised learning; self-supervised learning; IMAGE; NETWORK;

D O I：

10.1109/ACCESS.2022.3145969

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

A depth map helps robots and autonomous vehicles (AVs) visualize the three-dimensional world to navigate and localize neighboring obstacles. However, it is difficult to develop a deep learning model that can estimate the depth map from a single image in real-time. This study proposes a fast monocular depth estimation model named FastMDE by optimizing the deep convolutional neural network according to the encoder-decoder architecture. The decoder needs to obtain partial and semantic feature maps from the encoding phase to improve the depth estimation accuracy. Therefore, we designed FastMDE with two effective strategies. The first one involved redesigning the skip connection with the features of the squeeze-excitation module to obtain partial and semantic feature maps of the encoding phase. The second strategy involved redesigning the decoder by using the fusion dense block to permit the usage of high-resolution features that were learned earlier in the network before upsampling. The proposed FastMDE model utilizes only 4.1 M parameters, which is much lesser than the parameters utilized by state-of-art models. Thus, FastDME has a higher accuracy and lower latency than previous models. This study also demonstrates that MDE can leverage deep neural networks in real-time (i.e., 30 fps) with the Linux embedded board Nvidia Jetson Xavier NX. The model can facilitate the development and applications with superior performances and easy deployment on an embedded platform.

引用

页码：16111 / 16122

页数：12

共 50 条

[1] Nested DWT-Based CNN Architecture for Monocular Depth Estimation
Paul, Sandip
Mishra, Deepak
Marimuthu, Senthil Kumar
SENSORS, 2023, 23 (06)
[2] LightDepthNet: Lightweight CNN Architecture for Monocular Depth Estimation on Edge Devices
Liu, Qingliang
Zhou, Shuai
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (04) : 2389 - 2393
[3] Fast Monocular Depth Estimation on an FPGA
Sada, Youki
Soga, Naoto
Shimoda, Masayuki
Jinguji, Akira
Sato, Shimpei
Nakahara, Hiroki
2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 143 - 146
[4] Dual CNN Models for Unsupervised Monocular Depth Estimation
Repala, Vamshi Krishna
Dubey, Shiv Ram
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT I, 2019, 11941 : 209 - 217
[5] HR-Depth: High Resolution Self-Supervised Monocular Depth Estimation
Lyu, Xiaoyang
Liu, Liang
Wang, Mengmeng
Kong, Xin
Liu, Lina
Liu, Yong
Chen, Xinxin
Yuan, Yi
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2294 - 2301
[6] Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation
Zhang, Ning
Nex, Francesco
Vosselman, George
Kerle, Norman
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18537 - 18546
[7] CNN Approach for Monocular Depth Estimation: Ear Case Study
Magherini, Roberto
Servi, Michaela
Mussi, Elisa
Furferi, Rocco
Buonamici, Francesco
Volpe, Yary
DESIGN TOOLS AND METHODS IN INDUSTRIAL ENGINEERING II, ADM 2021, 2022, : 220 - 228
[8] 360MonoDepth: High-Resolution 360° Monocular Depth Estimation
Rey-Area, Manuel
Yuan, Mingze
Richardt, Christian
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3752 - 3762
[9] FastDepth: Fast Monocular Depth Estimation on Embedded Systems
Wolk, Diana
Ma, Fangchang
Yang, Tien-Lu
Karaman, Sertac
Sze, Vivienne
2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 6101 - 6108
[10] A NOVEL LIGHTWEIGHT NETWORK FOR FAST MONOCULAR DEPTH ESTIMATION
Heydrich, Tim
Yang, Yimin
Ma, Xiangyu
Liu, Yu
Du, Shan
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2260 - 2264

← 1 2 3 4 5 →