Skeleton-Guided Action Recognition with Multistream 3D Convolutional Neural Network for Elderly-Care Robot

被引:0
|
作者
Zhang, Dawei [1 ]
Zhang, Yanmin [2 ]
Zhou, Meng [1 ]
机构
[1] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou 450001, Henan, Peoples R China
[2] Zhengzhou Univ, Sch Elect & Informat Engn, Zhengzhou 450001, Henan, Peoples R China
基金
中国国家自然科学基金;
关键词
action recognition; deep learning; service robots; 2-STREAM;
D O I
10.1002/aisy.202300326
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the arrival of a global aging society, elderly-care robots are becoming more and more attractive and can provide better caring services through action recognition. This article presents a skeleton-guided action recognition framework with multistream 3D convolutional neural network. Two parallel dual-stream lightweight networks are proposed to enhance the feature extraction ability of human action and meanwhile reduce computation. Two different modes of skeleton input video are constructed to improve the recognition accuracy by decision fusion. The backbone networks adopt Resnet-18, the feature fusion layer and sliding window mechanism are both designed, and two cross-entropy losses are used to supervise their training. A dataset (named elder care action recognition (EC-AR)) with different categories of action is built. The experimental results on HMDB-51 and EC-AR datasets both demonstrate that the proposed framework outperforms the existing methods. The developed method is also applied to a prototype of elderly-care robots, and the test results in home scenarios show that it still has high recognition accuracy and good real-time performance. This article presents a skeleton-guided action recognition framework with multistream 3D convolutional neural network for elderly-care robot. Two parallel dual-stream Light-SlowFast networks based on ResNet-18 are proposed to enhance the feature extraction ability of human action and meanwhile reduce computation. Two different modes of skeleton input video are constructed to improve the recognition accuracy by decision fusion.image & COPY; 2023 WILEY-VCH GmbH
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Skeleton-guided 3D convolutional neural network for tubular structure segmentation
    Zhu, Ruiyun
    Oda, Masahiro
    Hayashi, Yuichiro
    Kitasaka, Takayuki
    Misawa, Kazunari
    Fujiwara, Michitaka
    Mori, Kensaku
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2024, : 77 - 87
  • [2] 3D Convolutional Neural Network for Action Recognition
    Zhang, Junhui
    Chen, Li
    Tian, Jing
    COMPUTER VISION, PT I, 2017, 771 : 600 - 607
  • [3] SGM-Net: Skeleton-guided multimodal network for action recognition
    Li, Jianan
    Xie, Xuemei
    Pan, Qingzhe
    Cao, Yuhan
    Zhao, Zhifu
    Shi, Guangming
    PATTERN RECOGNITION, 2020, 104 (104)
  • [4] Human Action Recognition with 3D Convolutional Neural Network
    Lima, Tiago
    Fernandes, Bruno
    Barros, Pablo
    2017 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2017,
  • [5] 3D skeleton-based action recognition with convolutional neural networks
    Van-Nam Hoang
    Thi-Lan Le
    Thanh-Hai Tran
    Hai-Vu
    Van-Toi Nguyen
    2019 INTERNATIONAL CONFERENCE ON MULTIMEDIA ANALYSIS AND PATTERN RECOGNITION (MAPR), 2019,
  • [6] Skeleton-Based Square Grid for Human Action Recognition With 3D Convolutional Neural Network
    Ding, Wenwen
    Ding, Chongyang
    Li, Guang
    Liu, Kai
    IEEE ACCESS, 2021, 9 : 54078 - 54089
  • [7] Skeleton-guided 3D shape distance field metamorphosis
    Wu, Bo
    Xu, Kai
    Zhou, Yang
    Xiong, Yueshan
    Huang, Hui
    GRAPHICAL MODELS, 2016, 85 : 37 - 45
  • [8] Action Recognition by 3D Convolutional Network
    Brezovsky, Matus
    Sopiak, Dominik
    Oravec, Milos
    PROCEEDINGS OF ELMAR-2018: 60TH INTERNATIONAL SYMPOSIUM ELMAR-2018, 2018, : 71 - 74
  • [9] Skeleton Based Action Recognition with Convolutional Neural Network
    Du, Yong
    Fu, Yun
    Wang, Liang
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 579 - 583
  • [10] Recurrent Neural Network based Action Recognition from 3D Skeleton Data
    Shukla, Parul
    Biswas, Kanad K.
    Kalra, Prem K.
    2017 13TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY AND INTERNET-BASED SYSTEMS (SITIS), 2017, : 339 - 345