Gesture MNIST: A New Free-Hand Gesture Dataset

被引:2
|
作者
Schak, Monika [1 ]
Gepperth, Alexander [1 ]
机构
[1] Fulda Univ Appl Sci, Leizpiger Str 123, D-36037 Fulda, Germany
关键词
Hand gesture; Dataset; Sequence classification; LSTM; CNN; Outlier detection;
D O I
10.1007/978-3-031-15937-4_55
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a unimodal, comprehensive, and easy-to-use dataset for visual free-hand gesture recognition. We call it GestureMNIST because of the 28 x 28 grayscale format of its images, and because the number of samples is approximately 80,000, similar to MNIST. Each of the six gesture classes is composed of a sequence of 12 images taken by a 3D camera. As a peculiarity w.r.t. other datasets, all sequences are recorded by a single person, ensuring high sample uniformity and quality. A particular focus is to provide a vision-based dataset that can be used "out of the box" for sequence classification without any preprocessing, segmentation, and feature extraction steps. We present classification experiments on GestureMNIST with different types of DNNs, establishing a performance baseline for sequence classification algorithms. We place particular emphasis on ahead-of-time classification, i.e., the correct identification of a gestures class before the gesture is completed. It is shown that CNN and LSTM-based deep learning achieves nearperfect performance, whereas ahead-of-time classification performance offers ample scope for future research with GestureMNIST. GestureMNIST contains visual samples only, but other modalities, namely acceleration and sound data, are available upon request.
引用
收藏
页码:657 / 668
页数:12
相关论文
共 50 条
  • [1] The Gesture Disagreement Problem in Free-hand Gesture Interaction
    Wu, Huiyue
    Zhang, Shaoke
    Liu, Jiayi
    Qiu, Jiali
    Zhang, Xiaolong
    [J]. INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2019, 35 (12) : 1102 - 1114
  • [2] A NEW DATASET FOR HAND GESTURE ESTIMATION
    Shao, Biyao
    Xie, Yifeng
    Yang, Hongnan
    Jiang, Yatong
    Yan, Chenggang
    Xie, Hongtao
    Wang, Yangang
    [J]. 2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1388 - 1392
  • [3] Gesture Recognition on a New Multi-Modal Hand Gesture Dataset
    Schak, Monika
    Gepperth, Alexander
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM), 2021, : 122 - 131
  • [4] Gesture Recognition and Multi-modal Fusion on a New Hand Gesture Dataset
    Schak, Monika
    Gepperth, Alexander
    [J]. PATTERN RECOGNITION APPLICATIONS AND METHODS, ICPRAM 2021, ICPRAM 2022, 2023, 13822 : 76 - 97
  • [5] Diverse hand gesture recognition dataset
    Zahra Mohammadi
    Alireza Akhavanpour
    Razieh Rastgoo
    Mohammad Sabokrou
    [J]. Multimedia Tools and Applications, 2024, 83 : 50245 - 50267
  • [6] SHAPE: a dataset for hand gesture recognition
    Dang, Tuan Linh
    Nguyen, Huu Thang
    Dao, Duc Manh
    Nguyen, Hoang Vu
    Luong, Duc Long
    Nguyen, Ba Tuan
    Kim, Suntae
    Monet, Nicolas
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (24): : 21849 - 21862
  • [7] SHAPE: a dataset for hand gesture recognition
    Tuan Linh Dang
    Huu Thang Nguyen
    Duc Manh Dao
    Hoang Vu Nguyen
    Duc Long Luong
    Ba Tuan Nguyen
    Suntae Kim
    Nicolas Monet
    [J]. Neural Computing and Applications, 2022, 34 : 21849 - 21862
  • [8] Diverse hand gesture recognition dataset
    Mohammadi, Zahra
    Akhavanpour, Alireza
    Rastgoo, Razieh
    Sabokrou, Mohammad
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (17) : 50245 - 50267
  • [9] EgoGesture: A New Dataset and Benchmark for Egocentric Hand Gesture Recognition
    Zhang, Yifan
    Cao, Congqi
    Cheng, Jian
    Lu, Hanqing
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (05) : 1038 - 1050
  • [10] Hand posture dataset creation for gesture recognition
    Anton-Canalis, Luis
    Sanchez-Nielsen, Elena
    [J]. VISAPP 2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2006, : 197 - +