Unsupervised Monocular Depth Estimation for Autonomous Flight of Drones

被引:3
|
作者
Zhao Shuanfeng [1 ]
Huang Tao [1 ]
Xu Qian [1 ]
Geng Longlong [1 ]
机构
[1] Xian Univ Sci & Technol, Coll Mech Engn, Xian 710054, Shaanxi, Peoples R China
关键词
image processing; non-supervision; neural network of automatic encoder; image reconstruction; monocular depth estimation; SHAPE;
D O I
10.3788/LOP57.021012
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This study proposes an unsupervised monocular depth estimation model for autonomous drone flight to overcome the limitations of high cost and large size in binocular depth estimation and a large number of depth maps required for training in supervised learning. The model first processes the input image into a pyramid shape to reduce the impact of different target sizes on the depth estimation. In addition, the neural network of the automatic encoder used for image reconstruction is designed based on ResNet-50, which is capable of feature extraction. The corresponding right or left pyramid images arc subsequently reconstructed by the bilinear sampling method based on the left or right input images, and corresponding pyramid disparity map is generated. Finally, the training loss could be assessed as the combination of the disparity smoothness loss, image reconstruction loss based on the structural similarity, and the loss of disparity consistency. Experimental results indicate that the model is more accurate and timely on KITT1 and Make3D compared with other monocular depth estimation methods. When trained on KITT1, the model essentially meets the accuracy requirements and real-time necessities for autonomous drone flight depth estimation.
引用
收藏
页数:10
相关论文
共 21 条
  • [1] [Anonymous], 2016, ACTA OPTICA SINICA
  • [2] Research Progress of Deep Learning in Visual Localization and Three-Dimensional Structure Recovery
    Bao Zhenqiang
    Li Aihua
    Cui Zhigao
    Yuan Meng
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2018, 55 (05)
  • [3] [毕天腾 Bi Tianteng], 2018, [计算机辅助设计与图形学学报, Journal of Computer-Aided Design & Computer Graphics], V30, P1383
  • [4] Estimating Depth From Monocular Images as Classification Using Deep Fully Convolutional Residual Networks
    Cao, Yuanzhouhan
    Wu, Zifeng
    Shen, Chunhua
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (11) : 3174 - 3182
  • [5] Chen W., 2016, NeurIPS, P730, DOI DOI 10.5555/3157096.3157178
  • [6] Eigen D, 2011, INT C NEUR INF PROC, P2366
  • [7] Eli Mayer N, 2016, 2016 IEEE C COMP VIS, P1010
  • [8] A geometric approach to shape from defocus
    Favaro, P
    Soatto, S
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (03) : 406 - 417
  • [9] Unsupervised Monocular Depth Estimation with Left-Right Consistency
    Godard, Clement
    Mac Aodha, Oisin
    Brostow, Gabriel J.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6602 - 6611
  • [10] Gu T T, 2018, INFRARED TECHNOLOGY, V10, P117