One-shot learning gesture recognition based on joint training of 3D ResNet and memory module

被引:10
|
作者
Li, Lianwei [1 ]
Qin, Shiyin [1 ,2 ]
Lu, Zhi [1 ]
Xu, Kuanhong [3 ]
Hu, Zhongying [3 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Dongguan Univ Technol, Sch Elect Engn & Intelligentizat, Dongguan 523808, Guangdong, Peoples R China
[3] Sony China Res Lab, Artificial Intelligence Res Dept, Beijing 100028, Peoples R China
基金
中国国家自然科学基金;
关键词
Gesture recognition; One-shot learning; Joint training; 3D ResNet; Memory module; RGB-D DATA; DATASET;
D O I
10.1007/s11042-019-08429-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As a research hotspot in the field of human-machine interaction, a great progress of hand gesture recognition has been achieved with the development of deep learning of neural networks. However, in the deep learning based recognition methods, it is necessary to rely heavily on large-scale labeled dataset which is very hard to build in practical applications. In order to achieve a well performance under some strict constraint of few sample data, one-shot learning gesture recognition is studied and a joint deep training method by combination of 3D ResNet with a memory module is presented in this paper. In our scheme a combinatorial optimization of feature extraction by 3D ResNet with memory capacity of rare event by memory module is carried out with an effective strategy of optimal decision and two relative performance indices. In order to implement one-shot learning gesture recognition, the memory module is employed to remember the features extracted by well-trained 3D ResNet and the classification decision is performed by the nearest neighbor algorithm with cosine similarity measure. In view of real-world applications about human-machine interaction technology, its ability to deal with negative samples plays a significant role thus a mechanism based on the threshold of cosine similarity is built to realize effective classification and rejection respectively. In order to validate and evaluate the performance of our proposed method, a special hand gesture dataset containing 3045 gesture videos is built and a series of experiment results on our collected dataset and public datasets demonstrate the feasibility and effectiveness of our method.
引用
收藏
页码:6727 / 6757
页数:31
相关论文
共 50 条
  • [1] One-shot learning gesture recognition based on joint training of 3D ResNet and memory module
    Lianwei Li
    Shiyin Qin
    Zhi Lu
    Kuanhong Xu
    Zhongying Hu
    Multimedia Tools and Applications, 2020, 79 : 6727 - 6757
  • [2] Real-time one-shot learning gesture recognition based on lightweight 3D Inception-ResNet with separable convolutions
    Lianwei Li
    Shiyin Qin
    Zhi Lu
    Dinghao Zhang
    Kuanhong Xu
    Zhongying Hu
    Pattern Analysis and Applications, 2021, 24 : 1173 - 1192
  • [3] Real-time one-shot learning gesture recognition based on lightweight 3D Inception-ResNet with separable convolutions
    Li, Lianwei
    Qin, Shiyin
    Lu, Zhi
    Zhang, Dinghao
    Xu, Kuanhong
    Hu, Zhongying
    PATTERN ANALYSIS AND APPLICATIONS, 2021, 24 (03) : 1173 - 1192
  • [4] One-shot learning hand gesture recognition based on modified 3d convolutional neural networks
    Lu, Zhi
    Qin, Shiyin
    Li, Xiaojie
    Li, Lianwei
    Zhang, Dinghao
    MACHINE VISION AND APPLICATIONS, 2019, 30 (7-8) : 1157 - 1180
  • [5] One-shot learning hand gesture recognition based on modified 3d convolutional neural networks
    Zhi Lu
    Shiyin Qin
    Xiaojie Li
    Lianwei Li
    Dinghao Zhang
    Machine Vision and Applications, 2019, 30 : 1157 - 1180
  • [6] One-shot Learning Gesture Recognition Based on Evolution of Discrimination with Successive Memory
    Li, Xiaojie
    Qin, Shiyin
    Xu, Kuanhong
    Hu, Zhongying
    2018 IEEE INTERNATIONAL CONFERENCE OF INTELLIGENT ROBOTICS AND CONTROL ENGINEERING (IRCE), 2018, : 263 - 269
  • [7] One-shot Learning Gesture Recognition based on Improved 3D SMoSIFT Feature Descriptor from RGB-D Videos
    Lin, Jia
    Ruan, Xiaogang
    Yu, Naigong
    Wei, Ruoyan
    2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 4947 - 4952
  • [8] One-shot learning goes 3D
    Zijian Zhao
    Shan Deng
    Zhouhang Jiang
    Kai Ni
    Nature Electronics, 2021, 4 : 866 - 867
  • [9] One-shot learning goes 3D
    Zhao, Zijian
    Deng, Shan
    Jiang, Zhouhang
    Ni, Kai
    NATURE ELECTRONICS, 2021, 4 (12) : 866 - 867
  • [10] One-Shot Gesture Recognition: One Step Towards Adaptive Learning
    Cabrera, Maria E.
    Sanchez-Tamayo, Natalia
    Voyles, Richard
    Wachs, Juan P.
    2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 784 - 789