Uncovering Human Multimodal Activity Recognition with a Deep Learning Approach

被引:8
|
作者
Ranieri, Caetano M. [1 ]
Vargas, Patricia A. [2 ]
Romero, Roseli A. F. [1 ]
机构
[1] Univ Sao Paulo, ICMC, Sao Carlos, SP, Brazil
[2] Heriot Watt Univ HWU, Edinburgh Ctr Robot ECR, Edinburgh, Midlothian, Scotland
基金
巴西圣保罗研究基金会;
关键词
Deep learning; CNN; LSTM; TCN; RNN; human activity recognition; human-robot-interaction; CHALLENGE; NETWORKS;
D O I
10.1109/ijcnn48605.2020.9207255
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent breakthroughs on deep learning and computer vision have encouraged the use of multimodal human activity recognition aiming at applications in human-robot-interaction. The wide availability of videos at online platforms has made this modality one of the most promising for this task, whereas some researchers have tried to enhance the video data with wearable sensors attached to human subjects. However, temporal information on both video and inertial sensors are still under investigation. Most of the current work focusing on daily activities do not present comparative studies considering different temporal approaches. In this paper, we are proposing a new model build upon a Two-Stream ConvNet for action recognition, enhanced with Long Short-Term Memory (LSTM) and a Temporal Convolution Networks (TCN) to investigate the temporal information on videos and inertial sensors. A feature-level fusion approach prior to temporal modelling is also proposed and evaluated. Experiments have been conducted on the egocentric multimodal dataset and on the UTD-MHAD. LSTM and TCN showed competitive results, with the TCN performing slightly better for most applications. The feature-level fusion approach also performed well on the UTD-MHAD with some overfitting on the egocentric multimodal dataset. Overall the proposed model presented promising results on both datasets compatible with the state-of-the-art, providing insights on the use of deep learning for human-robot-interaction applications.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] A Deep Learning and Multimodal Ambient Sensing Framework for Human Activity Recognition
    Yachir, Ali
    Amamra, Abdenour
    Djamaa, Badis
    Zerrouki, Ali
    Amour, Ahmed KhierEddine
    [J]. PROCEEDINGS OF THE 2019 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2019, : 101 - 105
  • [2] A Deep Learning Approach for Human Activities Recognition From Multimodal Sensing Devices
    Ihianle, Isibor Kennedy
    Nwajana, Augustine O.
    Ebenuwa, Solomon Henry
    Otuka, Richard, I
    Owa, Kayode
    Orisatoki, Mobolaji O.
    [J]. IEEE ACCESS, 2020, 8 : 179028 - 179038
  • [3] Human Activity Recognition in Smart Home With Deep Learning Approach
    Mehr, Homay Danaei
    Polat, Huseyin
    [J]. 2019 7TH INTERNATIONAL ISTANBUL SMART GRIDS AND CITIES CONGRESS AND FAIR (ICSG ISTANBUL 2019), 2019, : 149 - 153
  • [4] A Classifier Approach using Deep Learning for Human Activity Recognition
    Rawat, Sarthak Singh
    Bisht, Abhishek
    Nijhawan, Rahul
    [J]. 2019 FIFTH INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP 2019), 2019, : 486 - 490
  • [5] A Multimodal Deep Learning Network for Group Activity Recognition
    Rossi, Silvia
    Capasso, Roberto
    Acampora, Giovanni
    Staffa, Mariacarla
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [6] A Multimodal Fusion Approach for Human Activity Recognition
    Koutrintzes, Dimitrios
    Spyrou, Evaggelos
    Mathe, Eirini
    Mylonas, Phivos
    [J]. INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2023, 33 (01)
  • [7] Deep learning based multimodal complex human activity recognition using wearable devices
    Ling Chen
    Xiaoze Liu
    Liangying Peng
    Menghan Wu
    [J]. Applied Intelligence, 2021, 51 : 4029 - 4042
  • [8] Deep learning based multimodal complex human activity recognition using wearable devices
    Chen, Ling
    Liu, Xiaoze
    Peng, Liangying
    Wu, Menghan
    [J]. APPLIED INTELLIGENCE, 2021, 51 (06) : 4029 - 4042
  • [9] Human Activity Recognition System Using Multimodal Sensor and Deep Learning Based on LSTM
    Shin, Soo-Yeun
    Cha, Joo-Heon
    [J]. TRANSACTIONS OF THE KOREAN SOCIETY OF MECHANICAL ENGINEERS A, 2018, 42 (02) : 111 - 121
  • [10] Deep learning for human activity recognition
    Li, Xiaoli
    Zhao, Peilin
    Wu, Min
    Chen, Zhenghua
    Zhang, Le
    [J]. Neurocomputing, 2021, 444 : 214 - 216