One-shot learning gesture recognition based on joint training of 3D ResNet and memory module

被引:10
|
作者
Li, Lianwei [1 ]
Qin, Shiyin [1 ,2 ]
Lu, Zhi [1 ]
Xu, Kuanhong [3 ]
Hu, Zhongying [3 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Dongguan Univ Technol, Sch Elect Engn & Intelligentizat, Dongguan 523808, Guangdong, Peoples R China
[3] Sony China Res Lab, Artificial Intelligence Res Dept, Beijing 100028, Peoples R China
基金
中国国家自然科学基金;
关键词
Gesture recognition; One-shot learning; Joint training; 3D ResNet; Memory module; RGB-D DATA; DATASET;
D O I
10.1007/s11042-019-08429-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As a research hotspot in the field of human-machine interaction, a great progress of hand gesture recognition has been achieved with the development of deep learning of neural networks. However, in the deep learning based recognition methods, it is necessary to rely heavily on large-scale labeled dataset which is very hard to build in practical applications. In order to achieve a well performance under some strict constraint of few sample data, one-shot learning gesture recognition is studied and a joint deep training method by combination of 3D ResNet with a memory module is presented in this paper. In our scheme a combinatorial optimization of feature extraction by 3D ResNet with memory capacity of rare event by memory module is carried out with an effective strategy of optimal decision and two relative performance indices. In order to implement one-shot learning gesture recognition, the memory module is employed to remember the features extracted by well-trained 3D ResNet and the classification decision is performed by the nearest neighbor algorithm with cosine similarity measure. In view of real-world applications about human-machine interaction technology, its ability to deal with negative samples plays a significant role thus a mechanism based on the threshold of cosine similarity is built to realize effective classification and rejection respectively. In order to validate and evaluate the performance of our proposed method, a special hand gesture dataset containing 3045 gesture videos is built and a series of experiment results on our collected dataset and public datasets demonstrate the feasibility and effectiveness of our method.
引用
收藏
页码:6727 / 6757
页数:31
相关论文
共 50 条
  • [41] Complementary-View SAR Target Recognition Based on One-Shot Learning
    Chen, Benteng
    Zhou, Zhengkang
    Liu, Chunyu
    Zheng, Jia
    REMOTE SENSING, 2024, 16 (14)
  • [42] You Will Never Walk Alone: One-Shot 3D Action Recognition with Point Cloud Sequence
    Tong X.
    Xiao Y.
    Tan B.
    Yang J.
    Cao Z.
    Zhou J.T.
    Yuan J.
    IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (11) : 1 - 1
  • [43] Reducing Training Time in a One-Shot Machine Learning-Based Compiler
    Thomson, John
    O'Boyle, Michael
    Fursin, Grigori
    Franke, Bjoern
    LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, 2010, 5898 : 399 - +
  • [44] One-shot learning based pattern transition map for action early recognition
    Ji, Yanli
    Yang, Yang
    Xu, Xing
    Shen, Heng Tao
    SIGNAL PROCESSING, 2018, 143 : 364 - 370
  • [45] One-Shot 3D-Gradient Method Applied to Face Recognition
    Matias Di Martino, J.
    Fernandez, Alicia
    Ferrari, Jose
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2015, 2015, 9423 : 176 - 183
  • [46] ADVERSARY DISTILLATION FOR ONE-SHOT ATTACKS ON 3D TARGET TRACKING
    Wang, Zhengyi
    Wang, Xupeng
    Sohel, Ferdous
    Bennamoun, Mohammed
    Liao, Yong
    Yu, Jiali
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2749 - 2753
  • [47] Machine Learning in 3D Space Gesture Recognition
    Naosekpam, Veronica
    Sharma, Rupam Kumar
    JURNAL KEJURUTERAAN, 2019, 31 (02): : 243 - 248
  • [48] Using Anatomical Priors for Deep 3D One-shot Segmentation
    Duc Duy Pham
    Dovletov, Gurbandurdy
    Pauli, Josef
    BIOIMAGING: PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES - VOL. 2: BIOIMAGING, 2021, : 174 - 181
  • [49] Robust estimation of correspondences for one-shot 3D surface reconstruction
    Zheng, L.
    Shmarlouski, A.
    Huynh, T. H.
    Hesser, J.
    STRAHLENTHERAPIE UND ONKOLOGIE, 2018, 194 : S87 - S88
  • [50] Post-stroke hand gesture recognition via one-shot transfer learning using prototypical networks
    Sarwat, Hussein
    Alkhashab, Amr
    Song, Xinyu
    Jiang, Shuo
    Jia, Jie
    Shull, Peter B.
    JOURNAL OF NEUROENGINEERING AND REHABILITATION, 2024, 21 (01)