Learning Action Images Using Deep Convolutional Neural Networks For 3D Action Recognition

被引:0
|
作者
Thien Huynh-The [1 ]
Hua, Cam-Hao [2 ]
Kim, Dong-Seong [1 ]
机构
[1] Kumoh Natl Inst Technol, ICT Convergence Res Ctr, Gumi, South Korea
[2] Kyung Hee Univ, Dept Comp Sci & Engn, Yongin, South Korea
基金
新加坡国家研究基金会;
关键词
Pose feature to image encoding technique; deep convolutional neural networks; human action recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, 3D action recognition has received more attention of research and industrial communities thanks to the popularity of depth sensors and the efficiency of skeleton estimation algorithms. Accordingly, a large number of methods have been studied by using either handcrafted features with traditional classifiers or recurrent neural networks. However, they cannot learn high-level spatial and temporal features of a whole skeleton sequence exhaustively. In this paper, we proposed a novel encoding technique to transform the pose features of joint-joint distance and joint-joint orientation to color pixels. By concatenating the features of all frames in a sequence, the spatial joint correlations and temporal pose dynamics of action appearance are depicted by a color image. For learning action models, we adopt the strategy of end-to-end fine-tuning a pretrained deep convolutional neural networks to completely capture multiple high-level features at multi-scale action representation. The proposed method achieves the state-of-the-art performance on NTU RGB+D, the largest and most challenging 3D action recognition dataset, for both the cross-subject and cross-view evaluation protocols.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Gesture recognition based on deep deformable 3D convolutional neural networks
    Zhang, Yifan
    Shi, Lei
    Wu, Yi
    Cheng, Ke
    Cheng, Jian
    Lu, Hanqing
    [J]. PATTERN RECOGNITION, 2020, 107
  • [42] Keys for Action: An Efficient Keyframe-Based Approach for 3D Action Recognition Using a Deep Neural Network
    Yasin, Hashim
    Hussain, Mazhar
    Weber, Andreas
    [J]. SENSORS, 2020, 20 (08)
  • [43] SIGN LANGUAGE RECOGNITION USING 3D CONVOLUTIONAL NEURAL NETWORKS
    Huang, Jie
    Zhou, Wengang
    Li, Houqiang
    Li, Weiping
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2015,
  • [44] Fixed-Point Quantization of 3D Convolutional Neural Networks for Energy-Efficient Action Recognition
    Lee, Hyunhoon
    Byun, Younghoon
    Hwang, Seokha
    Lee, Sunggu
    Lee, Youngjoo
    [J]. 2018 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2018, : 129 - 130
  • [45] Stratified pooling based deep convolutional neural networks for human action recognition
    Yu, Sheng
    Cheng, Yun
    Su, Songzhi
    Cai, Guorong
    Li, Shaozi
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (11) : 13367 - 13382
  • [46] Deep Residual Split Directed Graph Convolutional Neural Networks for Action Recognition
    Fu, Bo
    Fu, Shilin
    Wang, Liyan
    Dong, Yuhan
    Ren, Yonggong
    [J]. IEEE MULTIMEDIA, 2020, 27 (04) : 9 - 17
  • [47] Stratified pooling based deep convolutional neural networks for human action recognition
    Sheng Yu
    Yun Cheng
    Songzhi Su
    Guorong Cai
    Shaozi Li
    [J]. Multimedia Tools and Applications, 2017, 76 : 13367 - 13382
  • [48] An improved memristor-based 3D Convolutional Neural Network for action recognition
    Wang, Yining
    Li, Ke
    Shen, Siyuan
    Duan, Shukai
    [J]. Proceedings of SPIE - The International Society for Optical Engineering, 2023, 12707
  • [49] Pose prediction using 3D deep convolutional neural networks
    Wallach, Izhar
    Dzamba, Michael
    Schrodl, Stefan
    Rampasek, Ladislav
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 254
  • [50] 3D CONVOLUTIONAL NEURAL NETWORK WITH MULTI-MODEL FRAMEWORK FOR ACTION RECOGNITION
    Jing, Longlong
    Ye, Yuancheng
    Yang, Xiaodong
    Tian, Yingli
    [J]. 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1837 - 1841