Implicit Semantic Data Augmentation for Hand Pose Estimation

被引:4
|
作者
Seo, Kyeongeun [1 ]
Cho, Hyeonjoong [2 ]
Choi, Daewoong [2 ]
Park, Ju-Derk [3 ]
机构
[1] Korea Elect Technol Inst, Informat Media Res Ctr, Seoul 03924, South Korea
[2] Korea Univ, Dept Comp Convergence Software, Sejong 30019, South Korea
[3] Elect & Telecommun Res Inst, Daejeon 34129, South Korea
关键词
Pose estimation; Semantics; Training data; Task analysis; Neural networks; Interpolation; Data models; Hand pose estimation; data augmentation; semantic learning; feature learning; 3D HAND; NETWORK;
D O I
10.1109/ACCESS.2022.3197749
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data augmentation is a well-known technique used for improving the generalization performance of modern neural networks. After the success of several traditional random data augmentation for images (including flipping, translation, or rotation), a recent surge of interest in implicit data augmentation techniques occurs to complement random data augmentation techniques. Implicit data augmentation augments training samples in feature space, rather than in pixel space, resulting in the generation of semantically meaningful data. Several techniques on implicit data augmentation have been introduced for classification tasks. However, few approaches have been introduced for regression tasks with continuous/structured labels, such as object pose estimation. Hence, we are motivated to propose a method for implicit semantic data augmentation for hand pose estimation. By considering semantic distances of hand poses, the proposed method implicitly generates extra training samples in feature space. We propose two additional techniques to improve the performance of this augmentation: metric learning and hand-dependent augmentation. Metric learning aims to learn feature representations to reflect the semantic distance of data. For hand pose estimation, the distribution of augmented hand poses can be regulated by managing the distribution of feature representations. Meanwhile, hand-dependent augmentation is specifically designed for hand pose estimation to prevent semantically meaningless hand poses from being generated (e.g., hands generated by simple interpolation between both hands). Further, we demonstrate the effectiveness of the proposed technique using two well-known hand pose datasets: STB and RHD.
引用
收藏
页码:84680 / 84688
页数:9
相关论文
共 50 条
  • [1] A Survey on GAN-Based Data Augmentation for Hand Pose Estimation Problem
    Farahanipad, Farnaz
    Rezaei, Mohammad
    Nasr, Mohammad Sadegh
    Kamangar, Farhad
    Athitsos, Vassilis
    TECHNOLOGIES, 2022, 10 (02)
  • [2] Implicit Semantic Data Augmentation for Deep Networks
    Wang, Yulin
    Pan, Xuran
    Song, Shiji
    Zhang, Hong
    Wu, Cheng
    Huang, Gao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [3] Optimization of Swim Pose Estimation and Recognition with Data Augmentation
    Ouyang, Jonathan
    Trinh, Derrick
    Choo, Chang Charles
    2024 IEEE SOUTHWEST SYMPOSIUM ON IMAGE ANALYSIS AND INTERPRETATION, SSIAI, 2024, : 101 - +
  • [4] Adaptive data augmentation network for human pose estimation
    Wang, Dong
    Xie, Wenjun
    Cai, Youcheng
    Liu, Xiaoping
    DIGITAL SIGNAL PROCESSING, 2022, 129
  • [5] When Active Learning Meets Implicit Semantic Data Augmentation
    Chen, Zhuangzhuang
    Zhang, Jin
    Wang, Pan
    Chen, Jie
    Li, Jianqiang
    COMPUTER VISION, ECCV 2022, PT XXV, 2022, 13685 : 56 - 72
  • [6] Jointly Optimize Data Augmentation and Network Training: Adversarial Data Augmentation in Human Pose Estimation
    Peng, Xi
    Tang, Zhiqiang
    Yang, Fei
    Feris, Rogerio S.
    Metaxas, Dimitris
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2226 - 2234
  • [7] Monocular 3D Pose Estimation via Pose Grammar and Data Augmentation
    Xu, Yuanlu
    Wang, Wenguan
    Liu, Tengyu
    Liu, Xiaobai
    Xie, Jianwen
    Zhu, Song-Chun
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 6327 - 6344
  • [8] Implicit Gradient-Modulated Semantic Data Augmentation for Deep Crack Recognition
    Chen, Zhuangzhuang
    Lu, Ronghao
    Chen, Jie
    Song, Houbing Herbert
    Li, Jianqiang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 16084 - 16095
  • [9] Depth-Based Hand Pose Estimation: Methods, Data, and Challenges
    Supancic, James Steven, III
    Rogez, Gregory
    Yang, Yi
    Shotton, Jamie
    Ramanan, Deva
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2018, 126 (11) : 1180 - 1198
  • [10] Depth-based hand pose estimation: data, methods, and challenges
    Supancic, James Steven, III
    Rogez, Gregory
    Yang, Yi
    Shotton, Jamie
    Ramanan, Deva
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1868 - 1876