Real-Time Joint Semantic Segmentation and Depth Estimation Using Asymmetric Annotations

被引:0
|
作者
Nekrasov, Vladimir [1 ]
Dharmasiri, Thanuja [2 ]
Spek, Andrew [2 ]
Drummond, Tom [2 ]
Shen, Chunhua [1 ]
Reid, Ian [1 ]
机构
[1] Univ Adelaide, Sch Comp Sci, Adelaide, SA, Australia
[2] Monash Univ, Clayton, Vic, Australia
基金
澳大利亚研究理事会;
关键词
D O I
10.1109/icra.2019.8794220
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deployment of deep learning models in robotics as sensory information extractors can be a daunting task to handle, even using generic GPU cards. Here, we address three of its most prominent hurdles, namely, i) the adaptation of a single model to perform multiple tasks at once (in this work, we consider depth estimation and semantic segmentation crucial for acquiring geometric and semantic understanding of the scene), while ii) doing it in real-time, and iii) using asymmetric datasets with uneven numbers of annotations per each modality. To overcome the first two issues, we adapt a recently proposed real-time semantic segmentation network, making changes to further reduce the number of floating point operations. To approach the third issue, we embrace a simple solution based on hard knowledge distillation under the assumption of having access to a powerful 'teacher' network. We showcase how our system can be easily extended to handle more tasks, and more datasets, all at once, performing depth estimation and segmentation both indoors and outdoors with a single model. Quantitatively, we achieve results equivalent to (or better than) current state-of-the-art approaches with one forward pass costing just 13ms and 6.5 GFLOPs on 640x480 inputs. This efficiency allows us to directly incorporate the raw predictions of our network into the SemanticFusion framework [1] for dense 3D semantic reconstruction of the scene.
引用
收藏
页码:7101 / 7107
页数:7
相关论文
共 50 条
  • [1] Driving Scene Perception Network: Real-time Joint Detection, Depth Estimation and Semantic Segmentation
    Chen, Liangfu
    Yang, Zeng
    Ma, Jianjun
    Luo, Zheng
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1283 - 1291
  • [2] DAABNet: depth-wise asymmetric attention bottleneck for real-time semantic segmentation
    Qingsong Tang
    Yingli Chen
    Minghui Zhao
    Shitong Min
    Wuming Jiang
    International Journal of Multimedia Information Retrieval, 2024, 13
  • [3] DAABNet: depth-wise asymmetric attention bottleneck for real-time semantic segmentation
    Tang, Qingsong
    Chen, Yingli
    Zhao, Minghui
    Min, Shitong
    Jiang, Wuming
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2024, 13 (01)
  • [4] Real-Time Semantic Segmentation via Auto Depth, Downsampling Joint Decision and Feature Aggregation
    Sun, Peng
    Wu, Jiaxiang
    Li, Songyuan
    Lin, Peiwen
    Huang, Junzhou
    Li, Xi
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (05) : 1506 - 1525
  • [5] Real-Time Semantic Segmentation via Auto Depth, Downsampling Joint Decision and Feature Aggregation
    Peng Sun
    Jiaxiang Wu
    Songyuan Li
    Peiwen Lin
    Junzhou Huang
    Xi Li
    International Journal of Computer Vision, 2021, 129 : 1506 - 1525
  • [6] EACNet: Enhanced Asymmetric Convolution for Real-Time Semantic Segmentation
    Li, Yaqian
    Li, Xiaokun
    Xiao, Cunjun
    Li, Haibin
    Zhang, Wenming
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 234 - 238
  • [7] Lightweight Asymmetric Dilation Network for Real-Time Semantic Segmentation
    Hu, Xuegang
    Gong, Yu
    IEEE ACCESS, 2021, 9 : 55630 - 55643
  • [8] Real-time Semantic Segmentation-based Depth Upsampling using Deep Learning
    Miclea, Vlad-Cristian
    Nedevschi, Sergiu
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 300 - 306
  • [9] Fast and Accurate Real-Time Semantic Segmentation with Dilated Asymmetric Convolutions
    Rosas-Arias, Leonel
    Benitez-Garcia, Gibran
    Portillo-Portillo, Jose
    Sanchez-Perez, Gabriel
    Yanai, Keiji
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2264 - 2271
  • [10] ADSCNet: asymmetric depthwise separable convolution for semantic segmentation in real-time
    Wang, Jiawei
    Xiong, Hongyun
    Wang, Haibo
    Nian, Xiaohong
    APPLIED INTELLIGENCE, 2020, 50 (04) : 1045 - 1056