One-shot learning gesture recognition based on joint training of 3D ResNet and memory module

被引:10
|
作者
Li, Lianwei [1 ]
Qin, Shiyin [1 ,2 ]
Lu, Zhi [1 ]
Xu, Kuanhong [3 ]
Hu, Zhongying [3 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Dongguan Univ Technol, Sch Elect Engn & Intelligentizat, Dongguan 523808, Guangdong, Peoples R China
[3] Sony China Res Lab, Artificial Intelligence Res Dept, Beijing 100028, Peoples R China
基金
中国国家自然科学基金;
关键词
Gesture recognition; One-shot learning; Joint training; 3D ResNet; Memory module; RGB-D DATA; DATASET;
D O I
10.1007/s11042-019-08429-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As a research hotspot in the field of human-machine interaction, a great progress of hand gesture recognition has been achieved with the development of deep learning of neural networks. However, in the deep learning based recognition methods, it is necessary to rely heavily on large-scale labeled dataset which is very hard to build in practical applications. In order to achieve a well performance under some strict constraint of few sample data, one-shot learning gesture recognition is studied and a joint deep training method by combination of 3D ResNet with a memory module is presented in this paper. In our scheme a combinatorial optimization of feature extraction by 3D ResNet with memory capacity of rare event by memory module is carried out with an effective strategy of optimal decision and two relative performance indices. In order to implement one-shot learning gesture recognition, the memory module is employed to remember the features extracted by well-trained 3D ResNet and the classification decision is performed by the nearest neighbor algorithm with cosine similarity measure. In view of real-world applications about human-machine interaction technology, its ability to deal with negative samples plays a significant role thus a mechanism based on the threshold of cosine similarity is built to realize effective classification and rejection respectively. In order to validate and evaluate the performance of our proposed method, a special hand gesture dataset containing 3045 gesture videos is built and a series of experiment results on our collected dataset and public datasets demonstrate the feasibility and effectiveness of our method.
引用
收藏
页码:6727 / 6757
页数:31
相关论文
共 50 条
  • [21] Biomechanical-based Approach to Data Augmentation for One-Shot Gesture Recognition
    Cabrera, Maria E.
    Wachs, Juan P.
    PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, : 38 - 44
  • [22] Skeleton-Based Dynamic Hand Gesture Recognition Using an Enhanced Network with One-Shot Learning
    Ma, Chunyong
    Zhang, Shengsheng
    Wang, Anni
    Qi, Yongyang
    Chen, Ge
    APPLIED SCIENCES-BASEL, 2020, 10 (11):
  • [23] BoMW: Bag of Manifold Words for One-Shot Learning Gesture Recognition From Kinect
    Zhang, Lei
    Zhang, Shengping
    Jiang, Feng
    Qi, Yuankai
    Zhang, Jun
    Guo, Yuliang
    Zhou, Huiyu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 2562 - 2573
  • [24] Adaptive Local Spatiotemporal Features from RGB-D Data for One-Shot Learning Gesture Recognition
    Lin, Jia
    Ruan, Xiaogang
    Yu, Naigong
    Yang, Yee-Hong
    SENSORS, 2016, 16 (12)
  • [25] Explore Efficient Local Features from RGB-D Data for One-Shot Learning Gesture Recognition
    Wan, Jun
    Guo, Guodong
    Li, Stan Z.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (08) : 1626 - 1639
  • [26] 3D Photography with One-shot Portrait Relighting
    Liu, Yunfei
    Wen, Sijia
    Lu, Feng
    2021 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY ADJUNCT PROCEEDINGS (ISMAR-ADJUNCT 2021), 2021, : 145 - 146
  • [27] One-shot active 3D image capture
    Proesmans, M
    VanGool, L
    THREE-DIMENSIONAL IMAGE CAPTURE, 1997, 3023 : 50 - 61
  • [28] One-shot 3D gradient field scanning
    Di Martino, J. Matias
    Fernandez, Alicia
    Ferrari, Jose A.
    OPTICS AND LASERS IN ENGINEERING, 2015, 72 : 26 - 38
  • [29] 3D Digital Model of Folk Dance Based on Few-Shot Learning and Gesture Recognition
    Zhang, Ning
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [30] HIERARCHICAL TEMPORAL MEMORY ENHANCED ONE-SHOT DISTANCE LEARNING FOR ACTION RECOGNITION
    Zou, Yixiong
    Shi, Yemin
    Wang, Yaowei
    Shu, Yu
    Yuan, Qingsheng
    Tian, Yonghong
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,