Encoding Pose Features to Images With Data Augmentation for 3-D Action Recognition

被引:61
|
作者
Huynh-The, Thien [1 ,2 ]
Hua, Cam-Hao [3 ]
Kim, Dong-Seong [1 ,2 ]
机构
[1] Kumoh Natl Inst Technol, Dept IT Convergence Engn, Gumi 39177, South Korea
[2] Kumoh Natl Inst Technol, ICT Convergence Res Ctr, Gumi 39177, South Korea
[3] Kyung Hee Univ, Dept Comp Sci & Engn, Yongin 446701, South Korea
基金
新加坡国家研究基金会;
关键词
Data augmentation; deep convolutional neural networks (DCNNs); human action recognition; pose feature to image (PoF2I) encoding technique;
D O I
10.1109/TII.2019.2910876
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, numerous methods have been introduced for three-dimensional (3-D) action recognition using handcrafted feature descriptors coupled traditional classifiers. However, they cannot learn high-level features of a whole skeleton sequence exhaustively. In this paper, a novel encoding technique-namely, pose feature to image (PoF2I), is introduced to transform the pose features of joint-joint distance and orientation to color pixels. By concatenating the features of all skeleton frames in a sequence, a color image is generated to depict spatial joint correlations and temporal pose dynamics of an action appearance. The strategy of end-to-end fine-tuning a pretrained deep convolutional neural network, which completely capture multiple high-level features at multiscale action representation, is implemented for learning recognition models. We further propose an efficient data augmentation mechanism for informative enrichment and overfitting prevention. The experimental results on six challenging 3-D action recognition datasets demonstrate that the proposed method outperforms state-of-the-art approaches.
引用
收藏
页码:3100 / 3111
页数:12
相关论文
共 50 条
  • [1] SkeletonNet: Mining Deep Part Features for 3-D Action Recognition
    Ke, Qiuhong
    An, Senjian
    Bennamoun, Mohammed
    Sohel, Ferdous
    Boussaid, Farid
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (06) : 731 - 735
  • [2] INVARIANT DESCRIPTORS FOR 3-D OBJECT RECOGNITION AND POSE
    FORSYTH, D
    MUNDY, JL
    ZISSERMAN, A
    COELHO, C
    HELLER, A
    ROTHWELL, C
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1991, 13 (10) : 971 - 991
  • [3] Robust recognition and pose determination of 3-D objects using range images in eigenspace approach
    Skocaj, D
    Leonardis, A
    [J]. THIRD INTERNATIONAL CONFERENCE ON 3-D DIGITAL IMAGING AND MODELING, PROCEEDINGS, 2001, : 171 - 178
  • [4] Robust Pose Features for Action Recognition
    Lee, Hyungtae
    Morariu, Vlad I.
    Davis, Larry S.
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, : 365 - 372
  • [5] LiDAR-Based 3-D Human Pose Estimation and Action Recognition for Medical Scenes
    Wu, Xuan
    Zhang, Haiyang
    Kong, Chunxiu
    Wang, Yuanze
    Ju, Yezhao
    Zhao, Changming
    [J]. IEEE SENSORS JOURNAL, 2024, 24 (09) : 15531 - 15539
  • [6] A Method of 3-D Object Recognition and Pose Estimation Japan
    [J]. Denki Gakkai Ronbunshi C Denshi Joho Shisutemu Bumonshi, 11 (1325):
  • [7] Face recognition using 3-D models: Pose and illumination
    Romdhani, Sami
    Ho, Jeffrey
    Vetter, Thomas
    Kriegman, David J.
    [J]. PROCEEDINGS OF THE IEEE, 2006, 94 (11) : 1977 - 1999
  • [8] Invariant features for 3-D gesture recognition
    Campbell, LW
    Becker, DA
    Azarbayejani, A
    Bobick, AF
    Pentland, A
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, 1996, : 157 - 162
  • [9] Pose-independent automatic target detection and recognition using 3-D ladar data
    Vasile, A
    Marino, RM
    [J]. AUTOMATIC TARGET RECOGNITION XIV, 2004, 5426 : 67 - 83
  • [10] Monocular 3D Pose Estimation via Pose Grammar and Data Augmentation
    Xu, Yuanlu
    Wang, Wenguan
    Liu, Tengyu
    Liu, Xiaobai
    Xie, Jianwen
    Zhu, Song-Chun
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 6327 - 6344