Multi-Modal Convolutional Neural Networks for Activity Recognition

被引:118
|
作者
Ha, Sojeong [1 ]
Yun, Jeong-Min [1 ]
Choi, Seungjin [1 ]
机构
[1] Pohang Univ Sci & Technol, Dept Comp Sci & Engn, 77 Cheongam Ro, Pohang 790784, South Korea
关键词
Activity recognition; convolutional neural networks; deep learning;
D O I
10.1109/SMC.2015.525
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional neural network (CNN), which comprises one or more convolutional and pooling layers followed by one or more fully-connected layers, has gained popularity due to its ability to learn fruitful representations from images or speeches, capturing local dependency and slight-distortion invariance. CNN has recently been applied to the problem of activity recognition, where 1D kernels are applied to capture local dependency over time in a series of observations measured at inertial sensors (3-axis accelerometers and gyroscopes). In this paper we present a multi-modal CNN where we use 2D kernels in both convolutional and pooling layers, to capture local dependency over time as well as spatial dependency over sensors. Experiments on benchmark datasets demonstrate the high performance of our multi-modal CNN, compared to several state of the art methods.
引用
收藏
页码:3017 / 3022
页数:6
相关论文
共 50 条
  • [1] Multi-modal Information Extraction and Fusion with Convolutional Neural Networks
    Kumar, Dinesh
    Sharma, Dharmendra
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [2] Learning Confidence Measures by Multi-modal Convolutional Neural Networks
    Fu, Zehua
    Ardabilian Fard, Mohsen
    [J]. 2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1321 - 1330
  • [3] Multi-modal page stream segmentation with convolutional neural networks
    Wiedemann, Gregor
    Heyer, Gerhard
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2021, 55 (01) : 127 - 150
  • [4] Multi-Modal Reflection Removal Using Convolutional Neural Networks
    Sun, Jun
    Chang, Yakun
    Jung, Cheolkon
    Feng, Jiawei
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (07) : 1011 - 1015
  • [5] Multi-Modal Depth Estimation Using Convolutional Neural Networks
    Siddiqui, Sadique Adnan
    Vierling, Axel
    Berns, Karsten
    [J]. 2020 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY, AND RESCUE ROBOTICS (SSRR 2020), 2020, : 354 - 359
  • [6] Multi-modal page stream segmentation with convolutional neural networks
    Gregor Wiedemann
    Gerhard Heyer
    [J]. Language Resources and Evaluation, 2021, 55 : 127 - 150
  • [7] Activity Recognition from Multi-modal Sensor Data Using a Deep Convolutional Neural Network
    Taherkhani, Aboozar
    Cosma, Georgina
    Alani, Ali A.
    McGinnity, T. M.
    [J]. INTELLIGENT COMPUTING, VOL 2, 2019, 857 : 203 - 218
  • [8] Speech recognition with multi-modal features based on neural networks
    Kim, Myung Won
    Ryu, Joung Woo
    Kim, Eun Ju
    [J]. NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2006, 4233 : 489 - 498
  • [9] Multi-modal Brain Tumor Segmentation Utilizing Convolutional Neural Networks
    Jakab, Marek
    Stevuliak, Marek
    Benesova, Wanda
    [J]. TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
  • [10] Multi-information Complementarity Neural Networks for Multi-Modal Action Recognition
    Ding, Chuan
    Tie, Yun
    Qi, Lin
    [J]. 2019 8TH INTERNATIONAL SYMPOSIUM ON NEXT GENERATION ELECTRONICS (ISNE), 2019,