Evaluation of Multimodal Semantic Segmentation using RGB-D Data

被引:1
|
作者
Hu, Jiesi [1 ]
Zhao, Ganning [1 ]
You, Suya [2 ]
Kuo, C. C. Jay [1 ]
机构
[1] Univ Southern Calif, 3551 Trousdale Pkwy, Los Angeles, CA 90089 USA
[2] Army Res Lab, 12025 E Waterfront Dr, Los Angeles, CA 90094 USA
关键词
RGB-D semantic segmentation; multiple datasets learning; autonomous driving; obstacle detection; OBSTACLE DETECTION;
D O I
10.1117/12.2587991
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Our goal is to develop stable, accurate, and robust semantic scene understanding methods for wide-area scene perception and understanding, especially in challenging outdoor environments. To achieve this, we are exploring and evaluating a range of related technology and solutions, including AI-driven multimodal scene perception, fusion, processing, and understanding. This work reports our efforts on the evaluation of a state-of-the-art approach for semantic segmentation with multiple RGB and depth sensing data. We employ four large datasets composed of diverse urban and terrain scenes and design various experimental methods and metrics. In addition, we also develop new strategies of multi-datasets learning to improve the detection and recognition of unseen objects. Extensive experiments, implementations, and results are reported in the paper.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] A Fusion Network for Semantic Segmentation Using RGB-D Data
    Yuan, Jiahui
    Zhang, Kun
    Xia, Yifan
    Qi, Lin
    Dong, Junyu
    [J]. NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
  • [2] Multimodal Neural Networks: RGB-D for Semantic Segmentation and Object Detection
    Schneider, Lukas
    Jasch, Manuel
    Froehlich, Bjoern
    Weber, Thomas
    Franke, Uwe
    Pollefeys, Marc
    Raetsch, Matthias
    [J]. IMAGE ANALYSIS, SCIA 2017, PT I, 2017, 10269 : 98 - 109
  • [3] RGB-D SEMANTIC SEGMENTATION: A REVIEW
    Hu, Yaosi
    Chen, Zhenzhong
    Lin, Weiyao
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
  • [4] Gated-Residual Block for Semantic Segmentation Using RGB-D Data
    Qian, Yeqiang
    Deng, Liuyuan
    Li, Tianyi
    Wang, Chunxiang
    Yang, Ming
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 11836 - 11844
  • [5] Joining geometric and RGB features for RGB-D semantic segmentation
    Zhang, Shaopeng
    Zhong, Min
    Zeng, Gang
    Gan, Rui
    [J]. 2019 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2019, 11321
  • [6] Enhancing Deep Semantic Segmentation of RGB-D Data with Entangled Forests
    Terreran, Matteo
    Bonetto, Elia
    Ghidoni, Stefano
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4634 - 4641
  • [7] Safety monitoring for workers using RGB-D camera and semantic segmentation
    Hong, Hyosung
    Kim, Jeong-Jung
    Koh, Doo-Yeol
    Park, Jinseong
    Kim, Chang-Hyun
    Jeong, Hyeonho
    Park, Gyuha
    Won, Mooncheol
    [J]. Journal of Institute of Control, Robotics and Systems, 2019, 25 (08): : 722 - 728
  • [8] A brief survey on RGB-D semantic segmentation using deep learning*
    Wang, Changshuo
    Wang, Chen
    Li, Weijun
    Wang, Haining
    [J]. DISPLAYS, 2021, 70
  • [9] 2.5D CONVOLUTION FOR RGB-D SEMANTIC SEGMENTATION
    Xing, Yajie
    Wang, Jingbo
    Chen, Xiaokang
    Zeng, Gang
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1410 - 1414
  • [10] DEPTH REMOVAL DISTILLATION FOR RGB-D SEMANTIC SEGMENTATION
    Fang, Tiyu
    Liang, Zhen
    Shao, Xiuli
    Dong, Zihao
    Li, Jinping
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2405 - 2409