Skeleton-Guided Action Recognition with Multistream 3D Convolutional Neural Network for Elderly-Care Robot

被引:0
|
作者
Zhang, Dawei [1 ]
Zhang, Yanmin [2 ]
Zhou, Meng [1 ]
机构
[1] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou 450001, Henan, Peoples R China
[2] Zhengzhou Univ, Sch Elect & Informat Engn, Zhengzhou 450001, Henan, Peoples R China
基金
中国国家自然科学基金;
关键词
action recognition; deep learning; service robots; 2-STREAM;
D O I
10.1002/aisy.202300326
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the arrival of a global aging society, elderly-care robots are becoming more and more attractive and can provide better caring services through action recognition. This article presents a skeleton-guided action recognition framework with multistream 3D convolutional neural network. Two parallel dual-stream lightweight networks are proposed to enhance the feature extraction ability of human action and meanwhile reduce computation. Two different modes of skeleton input video are constructed to improve the recognition accuracy by decision fusion. The backbone networks adopt Resnet-18, the feature fusion layer and sliding window mechanism are both designed, and two cross-entropy losses are used to supervise their training. A dataset (named elder care action recognition (EC-AR)) with different categories of action is built. The experimental results on HMDB-51 and EC-AR datasets both demonstrate that the proposed framework outperforms the existing methods. The developed method is also applied to a prototype of elderly-care robots, and the test results in home scenarios show that it still has high recognition accuracy and good real-time performance. This article presents a skeleton-guided action recognition framework with multistream 3D convolutional neural network for elderly-care robot. Two parallel dual-stream Light-SlowFast networks based on ResNet-18 are proposed to enhance the feature extraction ability of human action and meanwhile reduce computation. Two different modes of skeleton input video are constructed to improve the recognition accuracy by decision fusion.image & COPY; 2023 WILEY-VCH GmbH
引用
收藏
页数:11
相关论文
共 50 条
  • [21] An Improved Two-stream 3D Convolutional Neural Network for Human Action Recognition
    Chen, Jun
    Xu, Yuanping
    Zhang, Chaolong
    Xu, Zhijie
    Meng, Xiangxiang
    Wang, Jie
    2019 25TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC), 2019, : 135 - 140
  • [22] ENHANCED ACTION RECOGNITION WITH VISUAL ATTRIBUTE-AUGMENTED 3D CONVOLUTIONAL NEURAL NETWORK
    Wang, Yunfeng
    Zhou, Wengang
    Zhang, Qilin
    Li, Houqiang
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
  • [23] Intrinsic Symmetry Detection on 3D Models with Skeleton-guided Combination of Extrinsic Symmetries
    Wang, Wencheng
    Ma, Junhui
    Xu, Panpan
    Chu, Yiyao
    COMPUTER GRAPHICS FORUM, 2019, 38 (07) : 617 - 628
  • [24] ASMGCN: Attention-Based Semantic-Guided Multistream Graph Convolution Network for Skeleton Action Recognition
    Zhang, Moyan
    Quan, Zhenzhen
    Wang, Wei
    Chen, Zhe
    Guo, Xiaoshan
    Li, Yujun
    IEEE SENSORS JOURNAL, 2024, 24 (12) : 20064 - 20075
  • [25] A 3D Tensor Representation of Speech and 3D Convolutional Neural Network for Emotion Recognition
    Mohammad Reza Falahzadeh
    Fardad Farokhi
    Ali Harimi
    Reza Sabbaghi-Nadooshan
    Circuits, Systems, and Signal Processing, 2023, 42 : 4271 - 4291
  • [26] A 3D Tensor Representation of Speech and 3D Convolutional Neural Network for Emotion Recognition
    Falahzadeh, Mohammad Reza
    Farokhi, Fardad
    Harimi, Ali
    Sabbaghi-Nadooshan, Reza
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 42 (07) : 4271 - 4291
  • [27] Automatic 3D Pollen Recognition Based on Convolutional Neural Network
    Wang, Zhuo
    Wang, Zixuan
    Wang, Likai
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [28] 3D Convolutional Neural Network based on memristor for video recognition
    Liu, Jiaqi
    Li, Zhenghao
    Tang, Yongliang
    Hu, Wei
    Wu, Jun
    PATTERN RECOGNITION LETTERS, 2020, 130 (130) : 116 - 124
  • [29] Facial Expression Recognition Using 3D Convolutional Neural Network
    Byeon, Young-Hyen
    Kwak, Keun-Chang
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (12) : 107 - 112
  • [30] END-TO-END LEARNING OF DEEP CONVOLUTIONAL NEURAL NETWORK FOR 3D HUMAN ACTION RECOGNITION
    Li, Chao
    Sun, Shouqian
    Min, Xin
    Lin, Wenqian
    Nie, Binling
    Zhang, Xianfu
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,