Late-Fusion Multimodal Human Detection based on RGB and Thermal Images for Robotic Perception

被引:2
|
作者
Sousa, Elisio [1 ,2 ]
Mota, Kennedy O. S. [1 ,2 ]
Gomes, Iago P. [1 ,2 ]
Garrote, Luis [1 ,2 ]
Wolf, Denis F. [1 ,2 ]
Premebida, Cristiano [1 ,2 ]
机构
[1] Univ Coimbra, Inst Syst & Robot, Dept Elect & Comp Engn, Coimbra, Portugal
[2] Univ Sao Paulo, Inst Math & Comp Sci, Sao Paulo, Brazil
关键词
PEDESTRIAN DETECTION;
D O I
10.1109/ECMR59166.2023.10256301
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses the problem of detecting humans in RGB and Thermal (long-wave IR) images taken by cameras mounted onboard a mobile robot. Human/Pedestrian detection is currently one of the most pertinent object detection problems, mainly due to safety concerns in autonomous vehicles. The majority of approaches apply deep-learning techniques based solely on RGB images. However, they have a few shortcomings, namely that during foggy weather, nighttime, and low-light scenarios, these images may not contain sufficient information. To address these issues, this work studies the use of thermal cameras as a complementary source of information for human detection in indoor and outdoor environments. The proposed approach uses YOLOv5 to detect pedestrians in both thermal and RGB images. Moreover, the different modalities are combined using early and late fusion techniques. Evaluation of the proposed approach is carried out in the FLIR Aligned dataset and in a new in-house dataset. Results indicate that the use of fusion techniques highlights a promising way to improve the overall performance in this application domain.
引用
收藏
页码:381 / 386
页数:6
相关论文
共 50 条
  • [41] Multiscale multilevel context and multimodal fusion for RGB-D salient object detection
    Wu, Junwei
    Zhou, Wujie
    Luo, Ting
    Yu, Lu
    Lei, Jingsheng
    SIGNAL PROCESSING, 2021, 178
  • [42] Joint Inpainting of RGB and Depth Images by Generative Adversarial Network with a Late Fusion approach
    Fujii, Ryo
    Hachiuma, Ryo
    Saito, Hideo
    ADJUNCT PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR-ADJUNCT 2019), 2019, : 203 - 204
  • [43] MFFNet: Multimodal feature fusion network for RGB-D transparent object detection
    Zhu, Li
    Li, Tuanjie
    Ning, Yuming
    Zhang, Yan
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2024, 21 (05):
  • [44] Efficient Fully Convolutional Network and Optimization Approach for Robotic Grasping Detection Based on RGB-D Images
    Nie, Wei
    Liang, Xinwu
    Journal of Shanghai Jiaotong University (Science), 2023,
  • [45] Multimodal Convolutional Neural Network for Object Detection Using RGB-D Images
    Mocanu, Irina
    Clapon, Cosmin
    2018 41ST INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2018, : 307 - 310
  • [46] Pedestrian Detection by Fusion of RGB and Infrared Images in Low-Light Environment
    Deng, Qing
    Tian, Wei
    Huang, Yuyao
    Xiong, Lu
    Bi, Xin
    2021 IEEE 24TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2021, : 286 - 293
  • [47] Indoor Human Detection using RGB-D images
    Li, Baopu
    Jin, Haoyang
    Zhang, Qi
    Xia, Wei
    Li, Huiyun
    2016 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION (ICIA), 2016, : 1354 - 1360
  • [48] Multimodal Deep Learning-based Feature Fusion for Object Detection in Remote Sensing Images
    Yin, Shoulin
    Wang, Qunming
    Wang, Liguo
    Ivanovic, Mirjana
    Li, Hang
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2025, 22 (01) : 327 - 344
  • [49] Evaluating fusion of RGB-D and inertial sensors for multimodal human action recognition
    Javed Imran
    Balasubramanian Raman
    Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 189 - 208
  • [50] Evaluating fusion of RGB-D and inertial sensors for multimodal human action recognition
    Imran, Javed
    Raman, Balasubramanian
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (01) : 189 - 208