Early vs Late Fusion in Multimodal Convolutional Neural Networks

被引:0
|
作者
Gadzicki, Konrad [1 ]
Khamsehashari, Razieh [1 ]
Zetzsche, Christoph [1 ]
机构
[1] Univ Bremen, Cognit Neuroinformat, Bremen, Germany
关键词
Multi-layer neural network; Activity recognition; Sensor fusion;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Combining machine learning in neural networks with multimodal fusion strategies offers an interesting potential for classification tasks but the optimum fusion strategies for many applications have yet to be determined. Here we address this issue in the context of human activity recognition, making use of a state-of-the-art convolutional network architecture (Inception I3D) and a huge dataset (NTU RGB+D). As modalities we consider RGB video, optical flow, and skeleton data. We determine whether the fusion of different modalities can provide an advantage as compared to uni-modal approaches, and whether a more complex early fusion strategy can outperform the simpler late-fusion strategy by making use of statistical correlations between the different modalities. Our results show a clear performance improvement by multi-modal fusion and a substantial advantage of an early fusion strategy.
引用
下载
收藏
页码:292 / 297
页数:6
相关论文
共 50 条
  • [41] Multilayer and Multimodal Fusion of Deep Neural Networks for Video Classification
    Yang, Xiaodong
    Molchanov, Pavlo
    Kautz, Jan
    MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE, 2016, : 978 - 987
  • [42] Multimodal feature fusion based on heterogeneous optical neural networks
    Yi-zhen, Zheng
    Jian, Dai
    Tian, Zhang
    Kun, Xu
    CHINESE OPTICS, 2023, 16 (06) : 1343 - 1355
  • [43] Multimodal Fusion with Recurrent Neural Networks for Rumor Detection on Microblogs
    Jin, Zhiwei
    Cao, Juan
    Guo, Han
    Zhang, Yongdong
    Luo, Jiebo
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 795 - 803
  • [44] Multimodal convolutional neural networks based on the Raman spectra of serum and clinical features for the early diagnosis of prostate cancer
    Wang, Yan
    Qian, Hongyang
    Shao, Xiaoguang
    Zhang, Heng
    Liu, Shupeng
    Pan, Jiahua
    Xue, Wei
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2023, 293
  • [45] Fusion of Multiple Simple Convolutional Neural Networks for Gender Classification
    Abdalrady, Nihad A.
    Aly, Saleh
    PROCEEDINGS OF 2020 INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN COMMUNICATION AND COMPUTER ENGINEERING (ITCE), 2020, : 251 - 256
  • [46] 3D CONVOLUTIONAL NEURAL NETWORKS BY MODAL FUSION
    Yoshiyasu, Yusuke
    Yoshida, Eiichi
    Pirk, Soeren
    Guibas, Leonidas
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1777 - 1781
  • [47] Rethinking the Value of Local Feature Fusion in Convolutional Neural Networks
    Zhenyu Lou
    Xin Ye
    Luoming Zhang
    Weijia Wu
    Yefei He
    Hong Zhou
    Neural Processing Letters, 2023, 55 : 9085 - 9100
  • [48] AUTOMATED VESICLE FUSION DETECTION USING CONVOLUTIONAL NEURAL NETWORKS
    Li, Haohan
    Yin, Zhaozheng
    Xu, Yingke
    2017 IEEE 14TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2017), 2017, : 183 - 187
  • [49] A Medical Image Fusion Method Based on Convolutional Neural Networks
    Liu, Yu
    Chen, Xun
    Cheng, Juan
    Peng, Hu
    2017 20TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2017, : 1070 - 1076
  • [50] Multiple Feature Fusion in Convolutional Neural Networks for Action Recognition
    LI Hongyang
    CHEN Jun
    HU Ruimin
    Wuhan University Journal of Natural Sciences, 2017, 22 (01) : 73 - 78