Late-Fusion Multimodal Human Detection based on RGB and Thermal Images for Robotic Perception

被引:2
|
作者
Sousa, Elisio [1 ,2 ]
Mota, Kennedy O. S. [1 ,2 ]
Gomes, Iago P. [1 ,2 ]
Garrote, Luis [1 ,2 ]
Wolf, Denis F. [1 ,2 ]
Premebida, Cristiano [1 ,2 ]
机构
[1] Univ Coimbra, Inst Syst & Robot, Dept Elect & Comp Engn, Coimbra, Portugal
[2] Univ Sao Paulo, Inst Math & Comp Sci, Sao Paulo, Brazil
关键词
PEDESTRIAN DETECTION;
D O I
10.1109/ECMR59166.2023.10256301
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses the problem of detecting humans in RGB and Thermal (long-wave IR) images taken by cameras mounted onboard a mobile robot. Human/Pedestrian detection is currently one of the most pertinent object detection problems, mainly due to safety concerns in autonomous vehicles. The majority of approaches apply deep-learning techniques based solely on RGB images. However, they have a few shortcomings, namely that during foggy weather, nighttime, and low-light scenarios, these images may not contain sufficient information. To address these issues, this work studies the use of thermal cameras as a complementary source of information for human detection in indoor and outdoor environments. The proposed approach uses YOLOv5 to detect pedestrians in both thermal and RGB images. Moreover, the different modalities are combined using early and late fusion techniques. Evaluation of the proposed approach is carried out in the FLIR Aligned dataset and in a new in-house dataset. Results indicate that the use of fusion techniques highlights a promising way to improve the overall performance in this application domain.
引用
收藏
页码:381 / 386
页数:6
相关论文
共 50 条
  • [21] Multimodal Heartbeat Rate Estimation from the Fusion of Facial RGB and Thermal Videos
    Johansen, Anders S.
    Henriksen, Jesper W.
    Haque, Mohammad A.
    Jahromi, Mohammad Naser Sabet
    Nasrollahi, Kamal
    Moeslund, Thomas B.
    ELEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2018), 2019, 11041
  • [22] RGB-D Visual Saliency Detection Algorithm Based on Information Guided and Multimodal Feature Fusion
    Xu, Lijuan
    Xu, Xuemiao
    IEEE ACCESS, 2024, 12 : 268 - 280
  • [23] A Transformer-based Late-Fusion Mechanism for Fine-Grained Object Recognition in Videos
    Koch, Jannik
    Wolf, Stefan
    Beyerer, Juergen
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW), 2023, : 100 - 109
  • [24] LEARNING-BASED HUMAN DETECTION APPLIED TO RGB-D IMAGES
    Santoso, Patrisia Sherryl
    Hang, Hsueh-Ming
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3365 - 3369
  • [25] Human authentication based on fusion of thermal and visible face images
    Ayan Seal
    Chinmaya Panigrahy
    Multimedia Tools and Applications, 2019, 78 : 30373 - 30395
  • [26] Human authentication based on fusion of thermal and visible face images
    Seal, Ayan
    Panigrahy, Chinmaya
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (21) : 30373 - 30395
  • [27] A Potential Vision-Based Measurements Technology: Information Flow Fusion Detection Method Using RGB-Thermal Infrared Images
    Song, Kechen
    Bao, Yanqi
    Wang, Han
    Huang, Liming
    Yan, Yunhui
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [28] Enhanced Thermal-RGB Fusion for Robust Object Detection
    El Ahmar, Wassim
    Massoud, Yahya
    Kolhatkar, Dhanvin
    AlGhamdi, Hamzah
    Alja'afreh, Mohammad
    Hammoud, Riad
    Laganiere, Robert
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2023, : 365 - 374
  • [29] Recognition of Weeds in Wheat Fields Based on the Fusion of RGB Images and Depth Images
    Xu, Ke
    Li, Huaimin
    Cao, Weixing
    Zhu, Yan
    Chen, Rongjia
    Ni, Jun
    IEEE ACCESS, 2020, 8 : 110362 - 110370
  • [30] DCFNet: Dense Complementary Fusion for RGB-Thermal Urban Scene Perception
    Zhang, Yu-Wen Michael
    Zhang, Gang
    Hu, Xiaolin
    ADVANCES IN NEURAL NETWORKS-ISNN 2024, 2024, 14827 : 317 - 327