Real-time multimodal ADL recognition using convolution neural networks

被引:0
|
作者
Danushka Madhuranga
Rivindu Madushan
Chathuranga Siriwardane
Kutila Gunasekera
机构
[1] University of Moratuwa,Department of Computer Science and Engineering
来源
The Visual Computer | 2021年 / 37卷
关键词
Activity recognition; Depth images; Video classification; Data fusion; Silhouette extraction;
D O I
暂无
中图分类号
学科分类号
摘要
Activities of daily living (ADLs) are the activities which humans perform every day of their lives. Walking, sleeping, eating, drinking and sleeping are examples for ADLs. Compared to RGB videos, depth video-based activity recognition is less intrusive and eliminates many privacy concerns, which are crucial for applications such as life-logging and ambient assisted living systems. Existing methods rely on handcrafted features for depth video classification and ignore the importance of audio stream. In this paper, we propose an ADL recognition system that relies on both audio and depth modalities. We propose to adopt popular convolutional neural network (CNN) architectures used for RGB video analysis to classify depth videos. The adaption poses two challenges: (1) depth data are much nosier and (2) our depth dataset is much smaller compared RGB video datasets. To tackle those challenges, we extract silhouettes from depth data prior to model training and alter deep networks to be shallower. As per our knowledge, we used CNN to segment silhouettes from depth images and fused depth data with audio data to recognize ADLs for the first time. We further extended the proposed techniques to build a real-time ADL recognition system.
引用
收藏
页码:1263 / 1276
页数:13
相关论文
共 50 条
  • [1] Real-time multimodal ADL recognition using convolution neural networks
    Madhuranga, Danushka
    Madushan, Rivindu
    Siriwardane, Chathuranga
    Gunasekera, Kutila
    VISUAL COMPUTER, 2021, 37 (06): : 1263 - 1276
  • [2] Real-time Activity Recognition on Smartphones Using Deep Neural Networks
    Zhang, Licheng
    Wu, Xihong
    Luo, Dingsheng
    IEEE 12TH INT CONF UBIQUITOUS INTELLIGENCE & COMP/IEEE 12TH INT CONF ADV & TRUSTED COMP/IEEE 15TH INT CONF SCALABLE COMP & COMMUN/IEEE INT CONF CLOUD & BIG DATA COMP/IEEE INT CONF INTERNET PEOPLE AND ASSOCIATED SYMPOSIA/WORKSHOPS, 2015, : 1236 - 1242
  • [3] Abnormal Gait Recognition in Real-Time using Recurrent Neural Networks
    Jinnovart, Thanaporn
    C, Xiongcai
    Thonglek, Kundjanasith
    2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 972 - 977
  • [4] Real-time motion artifact suppression using convolution neural networks with penalty in fNIRS
    Huang, Ruisen
    Hong, Keum-Shik
    Bao, Shi-Chun
    Gao, Fei
    FRONTIERS IN NEUROSCIENCE, 2024, 18
  • [5] Multimodal Facial Emotion Recognition Using Improved Convolution Neural Networks Model
    Udeh, Chinonso Paschal
    Chen, Luefeng
    Du, Sheng
    Li, Min
    Wu, Min
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2023, 27 (04) : 710 - 719
  • [6] Towards Real-time Speech Emotion Recognition using Deep Neural Networks
    Fayek, H. M.
    Lech, M.
    Cavedon, L.
    2015 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2015,
  • [7] A Real-Time Hand Posture Recognition System Using Deep Neural Networks
    Tang, Ao
    Lu, Ke
    Wang, Yufei
    Huang, Jie
    Li, Houqiang
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2015, 6 (02)
  • [9] Real-time compact optoelectronics neural networks for face recognition
    Javidi, B
    Li, J
    PHOTONIC COMPONENT ENGINEERING AND APPLICATIONS, 1996, 2749 : 195 - 206
  • [10] Real-time license plate detection and recognition using deep convolutional neural networks
    Silva, Sergio Montazzolli
    Jung, Claudio Rosito
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 71