Encoding Pose Features to Images With Data Augmentation for 3-D Action Recognition

被引:61
|
作者
Huynh-The, Thien [1 ,2 ]
Hua, Cam-Hao [3 ]
Kim, Dong-Seong [1 ,2 ]
机构
[1] Kumoh Natl Inst Technol, Dept IT Convergence Engn, Gumi 39177, South Korea
[2] Kumoh Natl Inst Technol, ICT Convergence Res Ctr, Gumi 39177, South Korea
[3] Kyung Hee Univ, Dept Comp Sci & Engn, Yongin 446701, South Korea
基金
新加坡国家研究基金会;
关键词
Data augmentation; deep convolutional neural networks (DCNNs); human action recognition; pose feature to image (PoF2I) encoding technique;
D O I
10.1109/TII.2019.2910876
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, numerous methods have been introduced for three-dimensional (3-D) action recognition using handcrafted feature descriptors coupled traditional classifiers. However, they cannot learn high-level features of a whole skeleton sequence exhaustively. In this paper, a novel encoding technique-namely, pose feature to image (PoF2I), is introduced to transform the pose features of joint-joint distance and orientation to color pixels. By concatenating the features of all skeleton frames in a sequence, a color image is generated to depict spatial joint correlations and temporal pose dynamics of an action appearance. The strategy of end-to-end fine-tuning a pretrained deep convolutional neural network, which completely capture multiple high-level features at multiscale action representation, is implemented for learning recognition models. We further propose an efficient data augmentation mechanism for informative enrichment and overfitting prevention. The experimental results on six challenging 3-D action recognition datasets demonstrate that the proposed method outperforms state-of-the-art approaches.
引用
收藏
页码:3100 / 3111
页数:12
相关论文
共 50 条
  • [31] Registration of 3-D images using weighted geometrical features
    Maurer, CR
    Aboutanos, GB
    Dawant, BM
    Maciunas, RJ
    Fitzpatrick, JM
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 1996, 15 (06) : 836 - 849
  • [32] Registration of 3-D images using weighted geometrical features
    Vanderbilt Univ, Nashville, United States
    [J]. IEEE Trans Med Imaging, 6 (836-849):
  • [33] Object pose estimation for robotic control with 3-D range data
    Li, ST
    Zhao, DM
    [J]. MACHINE VISION AND THREE-DIMENSIONAL IMAGING SYSTEMS FOR INSPECTION AND METROLOGY, 2001, 4189 : 179 - 190
  • [34] FULLYAUTOMATIC DETECTION OF SALIENT FEATURES IN 3-D TRANSESOPHAGEAL IMAGES
    Curiale, Ariel H.
    Haak, Alexander
    Vegas-Sanchez-Ferrero, Gonzalo
    Ren, Ben
    Aja-Fernandez, Santiago
    Bosch, Johan G.
    [J]. ULTRASOUND IN MEDICINE AND BIOLOGY, 2014, 40 (12): : 2868 - 2884
  • [35] Pose Invariant Method for Emotion Recognition from 3D Images
    Suja, P.
    Krishnasri, D.
    Tripathi, Shikha
    [J]. 2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [36] Enhancing Human Action Recognition with 3D Skeleton Data: A Comprehensive Study of Deep Learning and Data Augmentation
    Xin, Chu
    Kim, Seokhwan
    Cho, Yongjoo
    Park, Kyoung Shin
    [J]. ELECTRONICS, 2024, 13 (04)
  • [37] 3-D Face Recognition Using Curvelet Local Features
    Elaiwat, S.
    Bennamoun, M.
    Boussaid, F.
    El-Sallam, A.
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (02) : 172 - 175
  • [38] THE USE OF FRACTAL FEATURES FOR RECOGNITION OF 3-D DISCHARGE PATTERNS
    KRIVDA, A
    GULSKI, E
    SATISH, L
    ZAENGL, WS
    [J]. IEEE TRANSACTIONS ON DIELECTRICS AND ELECTRICAL INSULATION, 1995, 2 (05) : 889 - 892
  • [39] 3-D Palmprint Recognition With Joint Line and Orientation Features
    Li, Wei
    Zhang, David
    Zhang, Lei
    Lu, Guangming
    Yan, Jingqi
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2011, 41 (02): : 274 - 279
  • [40] Learning the Spherical Harmonic Features for 3-D Face Recognition
    Liu, Peijiang
    Wang, Yunhong
    Huang, Di
    Zhang, Zhaoxiang
    Chen, Liming
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (03) : 914 - 925