Early vs Late Fusion in Multimodal Convolutional Neural Networks

被引:0
|
作者
Gadzicki, Konrad [1 ]
Khamsehashari, Razieh [1 ]
Zetzsche, Christoph [1 ]
机构
[1] Univ Bremen, Cognit Neuroinformat, Bremen, Germany
关键词
Multi-layer neural network; Activity recognition; Sensor fusion;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Combining machine learning in neural networks with multimodal fusion strategies offers an interesting potential for classification tasks but the optimum fusion strategies for many applications have yet to be determined. Here we address this issue in the context of human activity recognition, making use of a state-of-the-art convolutional network architecture (Inception I3D) and a huge dataset (NTU RGB+D). As modalities we consider RGB video, optical flow, and skeleton data. We determine whether the fusion of different modalities can provide an advantage as compared to uni-modal approaches, and whether a more complex early fusion strategy can outperform the simpler late-fusion strategy by making use of statistical correlations between the different modalities. Our results show a clear performance improvement by multi-modal fusion and a substantial advantage of an early fusion strategy.
引用
下载
收藏
页码:292 / 297
页数:6
相关论文
共 50 条
  • [21] Convolutional Neural Networks for Multimodal Remote Sensing Data Classification
    Wu, Xin
    Hong, Danfeng
    Chanussot, Jocelyn
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [22] Multimodal Convolutional Neural Networks for Sperm Motility and Concentration Predictions
    Goh, Voon Hueh
    Mansor, Muhammad Asraf
    As'ari, Muhammad Amir
    Ismail, Lukman Hakim
    MALAYSIAN JOURNAL OF FUNDAMENTAL AND APPLIED SCIENCES, 2024, 20 (02): : 347 - 359
  • [23] Multimodal Emotion Recognition Using a Hierarchical Fusion Convolutional Neural Network
    Zhang, Yong
    Cheng, Cheng
    Zhang, Yidie
    IEEE ACCESS, 2021, 9 : 7943 - 7951
  • [24] Multilevel Features Fusion in Deep Convolutional Neural Networks
    Zhuo, Yi-Fan
    Wang, Yi-Lei
    CLOUD COMPUTING AND SECURITY, PT VI, 2018, 11068 : 600 - 610
  • [25] Infrared and visible image fusion with convolutional neural networks
    Liu, Yu
    Chen, Xun
    Cheng, Juan
    Peng, Hu
    Wang, Zengfu
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2018, 16 (03)
  • [26] ConvFusion: A Model for Layer Fusion in Convolutional Neural Networks
    Waeijen, Luc
    Sioutas, Savvas
    Peemen, Maurice
    Lindwer, Menno
    Corporaal, Henk
    IEEE ACCESS, 2021, 9 : 168245 - 168267
  • [27] Hierarchical fusion convolutional neural networks for SAR image
    Jiang, Yinyin
    Li, Ming
    Zhang, Peng
    Tan, Xiaofeng
    Song, Wanying
    PATTERN RECOGNITION LETTERS, 2021, 147 : 115 - 123
  • [28] Fusion based Heterogeneous Convolutional Neural Networks Architecture
    Komish, David
    Ezekiel, Soundararajan
    Comacchia, Maria
    2018 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR), 2018,
  • [29] PerceptionNet: A Deep Convolutional Neural Network for Late Sensor Fusion
    Kasnesis, Panagiotis
    Patrikakis, Charalampos Z.
    Venieris, Iakovos S.
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, 2019, 868 : 101 - 119
  • [30] Hand Gesture Recognition with Convolutional Neural Networks for the Multimodal UAV Control
    Ma, Yuntao
    Liu, Yuxuan
    Fin, Ruiyang
    Yuan, Xingyang
    Sekha, Raza
    Wilson, Samuel
    Vaidyanathan, Ravi
    2017 WORKSHOP ON RESEARCH, EDUCATION AND DEVELOPMENT OF UNMANNED AERIAL SYSTEMS (RED-UAS), 2017, : 198 - 203