Improving Monocular Depth Estimation by Semantic Pre-training

被引:1
|
作者
Rottmann, Peter [1 ]
Posewsky, Thorbjorn [1 ,2 ]
Milioto, Andres [1 ]
Stachniss, Cyrill [1 ]
Behley, Jens [1 ]
机构
[1] Univ Bonn, Bonn, Germany
[2] Ibeo Automot Syst GmbH, Hamburg, Germany
关键词
D O I
10.1109/IROS51168.2021.9636546
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Knowing the distance to nearby objects is crucial for autonomous cars to navigate safely in everyday traffic. In this paper, we investigate monocular depth estimation, which advanced substantially within the last years and is providing increasingly more accurate results while only requiring a single camera image as input. In line with recent work, we use an encoder-decoder structure with so-called packing layers to estimate depth values in a self-supervised fashion. We propose integrating a joint pre-training of semantic segmentation plus depth estimation on a dataset providing semantic labels. By using a separate semantic decoder that is only needed for pre-training, we can keep the network comparatively small. Our extensive experimental evaluation shows that the addition of such pre-training improves the depth estimation performance substantially. Finally, we show that we achieve competitive performance on the KITTI dataset despite using a much smaller and more efficient network.
引用
收藏
页码:5916 / 5923
页数:8
相关论文
共 50 条
  • [1] Multi-modal Masked Pre-training for Monocular Panoramic Depth Completion
    Yan, Zhiqiang
    Li, Xiang
    Wang, Kun
    Zhang, Zhenyu
    Li, Jun
    Yang, Jian
    [J]. COMPUTER VISION - ECCV 2022, PT I, 2022, 13661 : 378 - 395
  • [2] Improving Fractal Pre-training
    Anderson, Connor
    Farrell, Ryan
    [J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2412 - 2421
  • [3] Improving fault localization with pre-training
    Zhang, Zhuo
    Li, Ya
    Xue, Jianxin
    Mao, Xiaoguang
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (01)
  • [4] Improving fault localization with pre-training
    Zhuo Zhang
    Ya Li
    Jianxin Xue
    Xiaoguang Mao
    [J]. Frontiers of Computer Science, 2024, 18
  • [5] Bootstrapped Self-Supervised Training with Monocular Video for Semantic Segmentation and Depth Estimation
    Zhang, Yihao
    Leonard, John J.
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 2420 - 2427
  • [6] SEEP: Semantic-enhanced question embeddings pre-training for improving knowledge tracing
    Wang, Wentao
    Ma, Huifang
    Zhao, Yan
    Yang, Fanyi
    Chang, Liang
    [J]. INFORMATION SCIENCES, 2022, 614 : 153 - 169
  • [7] SEEP: Semantic-enhanced question embeddings pre-training for improving knowledge tracing
    Wang, Wentao
    Ma, Huifang
    Zhao, Yan
    Yang, Fanyi
    Chang, Liang
    [J]. INFORMATION SCIENCES, 2022, 614 : 153 - 169
  • [8] MONOCULAR SEGMENT-WISE DEPTH: MONOCULAR DEPTH ESTIMATION BASED ON A SEMANTIC SEGMENTATION PRIOR
    Atapour-Abarghouei, Amir
    Breckon, Toby P.
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 4295 - 4299
  • [9] Semantic Monocular Depth Estimation Based on Artificial Intelligence
    Gurram, Akhil
    Urfalioglu, Onay
    Halfaoui, Ibrahim
    Bouzaraa, Fahd
    Lopez, Antonio M.
    [J]. IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2021, 13 (04) : 99 - 103
  • [10] On the benefit of adversarial training for monocular depth estimation
    Groenendijk, Rick
    Karaoglu, Sezer
    Gevers, Theo
    Mensink, Thomas
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 190