eViTBins: Edge-Enhanced Vision-Transformer Bins for Monocular Depth Estimation on Edge Devices

被引:0
|
作者
She, Yutong [1 ]
Li, Peng [1 ]
Wei, Mingqiang [1 ]
Liang, Dong [1 ]
Chen, Yiping [2 ]
Xie, Haoran [3 ]
Wang, Fu Lee [4 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Sch Comp Sci & Technol, Nanjing 211106, Peoples R China
[2] Sun Yat Sen Univ, Sch Geospatial Engn & Sci, Zhuhai 519082, Peoples R China
[3] Lingnan Univ, Sch Data Sci, Hong Kong, Peoples R China
[4] Hong Kong Metropolitan Univ, Sch Sci & Technol, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Edge-enhanced vision transformer; adaptive depth bins; monocular depth estimation; edge AI; unmanned aerial vehicle; traffic monitoring;
D O I
10.1109/TITS.2024.3480114
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Monocular depth estimation (MDE) remains a fundamental yet not well-solved problem in computer vision. Current wisdom of MDE often achieves blurred or even indistinct depth boundaries, degenerating the quality of vision-based intelligent transportation systems. This paper presents an edge-enhanced vision transformer bins network for monocular depth estimation, termed eViTBins. eViTBins has three core modules to predict monocular depth maps with exceptional smoothness, accuracy, and fidelity to scene structures and object edges. First, a multi-scale feature fusion module is proposed to circumvent the loss of depth information at various levels during depth regression. Second, an image-guided edge-enhancement module is proposed to accurately infer depth values around image boundaries. Third, a vision transformer-based depth discretization module is introduced to comprehend the global depth distribution. Meanwhile, unlike most MDE models that rely on high-performance GPUs, eViTBins is optimized for seamless deployment on edge devices, such as NVIDIA Jetson Nano and Google Coral SBC, making it ideal for real-time intelligent transportation systems applications. Extensive experimental evaluations corroborate the superiority of eViTBins over competing methods, notably in terms of preserving depth edges and global depth representations.
引用
收藏
页码:20320 / 20334
页数:15
相关论文
共 50 条
  • [1] Edge-Enhanced Dual-Stream Perception Network for Monocular Depth Estimation
    Liu, Zihang
    Wang, Quande
    ELECTRONICS, 2024, 13 (09)
  • [2] Monocular depth estimation with enhanced edge
    Wang Q.
    Wang Q.
    Cheng K.
    Liu Z.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2022, 50 (03): : 36 - 42
  • [3] Lightweight Monocular Depth Estimation on Edge Devices
    Liu, Siping
    Yang, Laurence Tianruo
    Tu, Xiaohan
    Li, Renfa
    Xu, Cheng
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (17) : 16168 - 16180
  • [4] Efficient Monocular Depth Estimation for Edge Devices in Internet of Things
    Tu, Xiaohan
    Xu, Cheng
    Liu, Siping
    Li, Renfa
    Xie, Guoqi
    Huang, Jing
    Yang, Laurence Tianruo
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (04) : 2821 - 2832
  • [5] Real-Time Monocular Depth Estimation Merging Vision Transformers on Edge Devices for AIoT
    Liu, Xihao
    Wei, Wei
    Liu, Cheng
    Peng, Yuyang
    Huang, Jinhao
    Li, Jun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [6] Depth Estimation from Monocular Vision using Image Edge Complexity
    Haris, Sallehuddin Mohamed
    Zakaria, Muhammad Khalid
    Nuawi, Mohd Zaki
    2011 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2011, : 868 - 873
  • [7] LightDepthNet: Lightweight CNN Architecture for Monocular Depth Estimation on Edge Devices
    Liu, Qingliang
    Zhou, Shuai
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (04) : 2389 - 2393
  • [8] ET: Edge-Enhanced Transformer for Image Splicing Detection
    Sun, Yu
    Ni, Rongrong
    Zhao, Yao
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1232 - 1236
  • [9] The Constraints between Edge Depth and Uncertainty for Monocular Depth Estimation
    Wu, Shouying
    Li, Wei
    Liang, Binbin
    Huang, Guoxin
    ELECTRONICS, 2021, 10 (24)
  • [10] EDRNet: Edge-Enhanced Dynamic Routing Adaptive for Depth Completion
    Sun, Fuyun
    Li, Baoquan
    Zhang, Qiaomei
    MATHEMATICS, 2025, 13 (06)