Multi-modal dataset and fusion network for simultaneous semantic segmentation of on-road dynamic objects

被引:0
|
作者
Cho, Jieun [1 ]
Ha, Jinsu [1 ]
Song, Hamin [1 ]
Jang, Sungmoon [2 ]
Jo, Kichun [3 ]
机构
[1] Konkuk Univ, Dept Smart Vehicle Engn, Seoul 05029, South Korea
[2] Hyundai Motor Co, Automot R&D Div, Seoul 06182, South Korea
[3] Hanyang Univ, Dept Automot Engn, Seoul 04763, South Korea
基金
新加坡国家研究基金会;
关键词
Semantic segmentation; Sensor fusion; Perception; Deep learning; Autonomous driving;
D O I
10.1016/j.engappai.2025.110024
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An accurate and robust perception system is essential for autonomous vehicles to interact with various dynamic objects on the road. By applying semantic segmentation techniques to the data from the camera sensor and light detection and ranging sensor, dynamic objects can be classified at pixel and point levels respectively. However, there are challenges when using a single sensor, especially under adverse lighting conditions or with sparse point densities. To address these challenges, this paper proposes a network for simultaneous point cloud and image semantic segmentation based on sensor fusion. The proposed network adopts a modal- specific architecture to fully leverage the characteristics of sensor data and achieves geometrically accurate matching through the image, point, and voxel feature fusion module. Additionally, we introduce the dataset that provides semantic labels for synchronized images and point clouds. Experimental results show that the proposed fusion approach outperforms uni-modal based methods and demonstrates robust performance even in challenging real-world scenarios. The dataset is publicly available at https://github.com/ailab-konkuk/MultiModal-Dataset.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] SPFUSIONNET: SKETCH SEGMENTATION USING MULTI-MODAL DATA FUSION
    Wang, Fei
    Lin, Shujin
    Wu, Hefeng
    Li, Hanhui
    Wang, Ruomei
    Luo, Xiaonan
    He, Xiangjian
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1654 - 1659
  • [42] Multi-modal Action Segmentation in the Kitchen with a Feature Fusion Approach
    Kogure, Shunsuke
    Aoki, Yoshimitsu
    FIFTEENTH INTERNATIONAL CONFERENCE ON QUALITY CONTROL BY ARTIFICIAL VISION, 2021, 11794
  • [43] Gesture Recognition and Multi-modal Fusion on a New Hand Gesture Dataset
    Schak, Monika
    Gepperth, Alexander
    PATTERN RECOGNITION APPLICATIONS AND METHODS, ICPRAM 2021, ICPRAM 2022, 2023, 13822 : 76 - 97
  • [44] Effective Fusion of Multi-Modal Remote Sensing Data in a Fully Convolutional Network for Semantic Labeling
    Zhang, Wenkai
    Huang, Hai
    Schmitz, Matthias
    Sun, Xian
    Wang, Hongqi
    Mayer, Helmut
    REMOTE SENSING, 2018, 10 (01)
  • [45] 3M2RNet: Multi-Modal Multi-Resolution Refinement Network for Semantic Segmentation
    Fooladgar, Fahimeh
    Kasaei, Shohreh
    ADVANCES IN COMPUTER VISION, VOL 2, 2020, 944 : 544 - 557
  • [46] MGFNet: An MLP-dominated gated fusion network for semantic segmentation of high-resolution multi-modal remote sensing images
    Wei, Kan
    Dai, Jinkun
    Hong, Danfeng
    Ye, Yuanxin
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 135
  • [47] Unsupervised domain adaptation multi-level adversarial network for semantic segmentation based on multi-modal features
    Wang Z.
    Bu S.
    Huang W.
    Zheng Y.
    Wu Q.
    Chang H.
    Zhang X.
    Tongxin Xuebao/Journal on Communications, 2022, 43 (12): : 157 - 171
  • [48] Robust 3D Semantic Segmentation Based on Multi-Phase Multi-Modal Fusion for Intelligent Vehicles
    Ni, Peizhou
    Li, Xu
    Xu, Wang
    Kong, Dong
    Hu, Yue
    Wei, Kun
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 1602 - 1614
  • [49] Multi-Modal Multi-Task (3MT) Road Segmentation
    Milli, Erkan
    Erkent, Ozgur
    Ylmaz, Asm Egemen
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (09) : 5408 - 5415
  • [50] Multi-modal simultaneous machine translation fusion of image information
    Huang, Yan
    Wanga, Zhanyang
    Zhang, TianYuan
    Xu, Chun
    Lianga, Hui
    JOURNAL OF ENGINEERING RESEARCH, 2023, 11 (02):