Real-time multimodal ADL recognition using convolution neural networks

被引:0
|
作者
Danushka Madhuranga
Rivindu Madushan
Chathuranga Siriwardane
Kutila Gunasekera
机构
[1] University of Moratuwa,Department of Computer Science and Engineering
来源
The Visual Computer | 2021年 / 37卷
关键词
Activity recognition; Depth images; Video classification; Data fusion; Silhouette extraction;
D O I
暂无
中图分类号
学科分类号
摘要
Activities of daily living (ADLs) are the activities which humans perform every day of their lives. Walking, sleeping, eating, drinking and sleeping are examples for ADLs. Compared to RGB videos, depth video-based activity recognition is less intrusive and eliminates many privacy concerns, which are crucial for applications such as life-logging and ambient assisted living systems. Existing methods rely on handcrafted features for depth video classification and ignore the importance of audio stream. In this paper, we propose an ADL recognition system that relies on both audio and depth modalities. We propose to adopt popular convolutional neural network (CNN) architectures used for RGB video analysis to classify depth videos. The adaption poses two challenges: (1) depth data are much nosier and (2) our depth dataset is much smaller compared RGB video datasets. To tackle those challenges, we extract silhouettes from depth data prior to model training and alter deep networks to be shallower. As per our knowledge, we used CNN to segment silhouettes from depth images and fused depth data with audio data to recognize ADLs for the first time. We further extended the proposed techniques to build a real-time ADL recognition system.
引用
收藏
页码:1263 / 1276
页数:13
相关论文
共 50 条
  • [31] Gait recognition using multichannel convolution neural networks
    Xiuhui Wang
    Jiajia Zhang
    Wei Qi Yan
    Neural Computing and Applications, 2020, 32 : 14275 - 14285
  • [32] Handwritten Digit Recognition using Convolution Neural Networks
    Rajput, Shailesh S.
    Choi, Yoonsuk
    2022 IEEE 12TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2022, : 163 - 168
  • [33] Gait recognition using multichannel convolution neural networks
    Wang, Xiuhui
    Zhang, Jiajia
    Yan, Wei Qi
    Neural Computing and Applications, 2020, 32 (18) : 14275 - 14285
  • [34] Real-time construction of neural networks
    Li, Kang
    Peng, Jian Xun
    Fei, Minrui
    ARTIFICIAL NEURAL NETWORKS - ICANN 2006, PT 1, 2006, 4131 : 140 - 149
  • [35] Neural networks for real-time control
    Narendra, KS
    PROCEEDINGS OF THE 36TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 1997, : 1026 - 1031
  • [36] TimeConvNets: A Deep Time Windowed Convolution Neural Network Design for Real-time Video Facial Expression Recognition
    Lee, James Ren Hou
    Wong, Alexander
    2020 17TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV 2020), 2020, : 9 - 16
  • [37] REAL-TIME CONTROL OF A TOKAMAK PLASMA USING NEURAL NETWORKS
    BISHOP, CM
    HAYNES, PS
    SMITH, MEU
    TODD, TN
    TROTMAN, DL
    NEURAL COMPUTATION, 1995, 7 (01) : 206 - 217
  • [38] Schedulability checking in real-time systems using neural networks
    Davoli, Renzo
    Tamburini, Fabio
    Giachini, Luigi-Alberto
    Fiumana, Franca
    Journal of artificial neural networks, 1995, 2 (04): : 421 - 430
  • [39] Multimodal reaching-position prediction for ADL support using neural networks
    Takase, Yutaka
    Yamazaki, Kimitoshi
    ROBOMECH JOURNAL, 2024, 11 (01):
  • [40] Real-time head orientation estimation using neural networks
    Zhao, L
    Pingali, G
    Carlbom, I
    2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2002, : 297 - 300