Late-Fusion Multimodal Human Detection based on RGB and Thermal Images for Robotic Perception

被引:2
|
作者
Sousa, Elisio [1 ,2 ]
Mota, Kennedy O. S. [1 ,2 ]
Gomes, Iago P. [1 ,2 ]
Garrote, Luis [1 ,2 ]
Wolf, Denis F. [1 ,2 ]
Premebida, Cristiano [1 ,2 ]
机构
[1] Univ Coimbra, Inst Syst & Robot, Dept Elect & Comp Engn, Coimbra, Portugal
[2] Univ Sao Paulo, Inst Math & Comp Sci, Sao Paulo, Brazil
关键词
PEDESTRIAN DETECTION;
D O I
10.1109/ECMR59166.2023.10256301
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses the problem of detecting humans in RGB and Thermal (long-wave IR) images taken by cameras mounted onboard a mobile robot. Human/Pedestrian detection is currently one of the most pertinent object detection problems, mainly due to safety concerns in autonomous vehicles. The majority of approaches apply deep-learning techniques based solely on RGB images. However, they have a few shortcomings, namely that during foggy weather, nighttime, and low-light scenarios, these images may not contain sufficient information. To address these issues, this work studies the use of thermal cameras as a complementary source of information for human detection in indoor and outdoor environments. The proposed approach uses YOLOv5 to detect pedestrians in both thermal and RGB images. Moreover, the different modalities are combined using early and late fusion techniques. Evaluation of the proposed approach is carried out in the FLIR Aligned dataset and in a new in-house dataset. Results indicate that the use of fusion techniques highlights a promising way to improve the overall performance in this application domain.
引用
收藏
页码:381 / 386
页数:6
相关论文
共 50 条
  • [1] Saliency-based Object Discovery on RGB-D Data with a Late-Fusion Approach
    Garcia, German M.
    Potapova, Ekaterina
    Werner, Thomas
    Zillich, Michael
    Vincze, Markus
    Frintrop, Simone
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 1866 - 1873
  • [2] A Late-Fusion Approach to Community Detection in Attributed Networks
    Liu, Chang
    Largeron, Christine
    Zaiane, Osmar R.
    Gharaghooshi, Shiva Zamani
    ADVANCES IN INTELLIGENT DATA ANALYSIS XVIII, IDA 2020, 2020, 12080 : 300 - 312
  • [3] Pedestrian detection based on light perception fusion of visible and thermal images
    Li, Guofa
    Lai, Weijian
    Qu, Xingda
    OPTICS AND LASER TECHNOLOGY, 2022, 156
  • [4] Modality-specific Learning Rates for Effective Multimodal Additive Late-fusion
    Yao, Yiqun
    Mihalcea, Rada
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 1824 - 1834
  • [5] A multimodal fusion method for sarcasm detection based on late fusion
    Ding, Ning
    Tian, Sheng-wei
    Yu, Long
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (06) : 8597 - 8616
  • [6] A multimodal fusion method for sarcasm detection based on late fusion
    Ning Ding
    Sheng-wei Tian
    Long Yu
    Multimedia Tools and Applications, 2022, 81 : 8597 - 8616
  • [7] Fusion of thermal and RGB images for automated deep learning based crack detection in civil infrastructure
    Quincy G. Alexander
    Vedhus Hoskere
    Yasutaka Narazaki
    Andrew Maxwell
    Billie F. Spencer
    AI in Civil Engineering, 1 (1):
  • [8] A Multimodal Perception System for Detection of Human Operators in Robotic Work Cells
    Costanzo, Marco
    De Maria, Giuseppe
    Lettera, Gaetano
    Natale, Ciro
    Perrone, Dario
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 692 - 699
  • [9] Robotic Material Perception Using Active Multimodal Fusion
    Liu, Huaping
    Sun, Fuchun
    Zhang, Xinyu
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2019, 66 (12) : 9878 - 9886
  • [10] A Visual-based Late-Fusion Framework for Video Genre Classification
    Mironica, Ionut
    Ionescu, Bogdan
    Rasche, Christoph
    Lambert, Patrick
    2013 INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS (ISSCS), 2013,