A Unified Framework for Gesture Recognition and Spatiotemporal Gesture Segmentation

被引:211
|
作者
Alon, Jonathan [1 ]
Athitsos, Vassilis [2 ]
Yuan, Quan [1 ]
Sclaroff, Stan [1 ]
机构
[1] Boston Univ, Dept Comp Sci, Boston, MA 02215 USA
[2] Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA
基金
美国国家科学基金会;
关键词
Gesture recognition; gesture spotting; human motion analysis; dynamic time warping; continuous dynamic programming; HIDDEN MARKOV-MODELS; HAND GESTURES; TRACKING;
D O I
10.1109/TPAMI.2008.203
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Within the context of hand gesture recognition, spatiotemporal gesture segmentation is the task of determining, in a video sequence, where the gesturing hand is located and when the gesture starts and ends. Existing gesture recognition methods typically assume either known spatial segmentation or known temporal segmentation, or both. This paper introduces a unified framework for simultaneously performing spatial segmentation, temporal segmentation, and recognition. In the proposed framework, information flows both bottom-up and top-down. A gesture can be recognized even when the hand location is highly ambiguous and when information about when the gesture begins and ends is unavailable. Thus, the method can be applied to continuous image streams where gestures are performed in front of moving, cluttered backgrounds. The proposed method consists of three novel contributions: a spatiotemporal matching algorithm that can accommodate multiple candidate hand detections in every frame, a classifier-based pruning framework that enables accurate and early rejection of poor matches to gesture models, and a subgesture reasoning algorithm that learns which gesture models can falsely match parts of other longer gestures. The performance of the approach is evaluated on two challenging applications: recognition of hand-signed digits gestured by users wearing short-sleeved shirts, in front of a cluttered background, and retrieval of occurrences of signs of interest in a video database containing continuous, unsegmented signing in American Sign Language (ASL).
引用
收藏
页码:1685 / 1699
页数:15
相关论文
共 50 条
  • [31] Deep Dynamic Neural Networks for Gesture Segmentation and Recognition
    Wu, Di
    Shao, Ling
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 552 - 571
  • [32] Comparison of Hand Segmentation Methodologies for Hand Gesture Recognition
    Howe, Lim Wei
    Wong, Farrah
    Chekima, Ali
    INTERNATIONAL SYMPOSIUM OF INFORMATION TECHNOLOGY 2008, VOLS 1-4, PROCEEDINGS: COGNITIVE INFORMATICS: BRIDGING NATURAL AND ARTIFICIAL KNOWLEDGE, 2008, : 914 - 920
  • [33] What makes a gesture a gesture? Neural signatures involved in gesture recognition
    Cabrera, Maria E.
    Novak, Keisha
    Foti, Daniel
    Voyles, Richard
    Wachs, Juan P.
    2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 748 - 753
  • [34] Gesture Recognition based on Spatiotemporal Histogram of Oriented Gradient Variation
    Kojima, Seiji
    Ohyama, Wataru
    Wakabayashi, Tetsushi
    2017 6TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS AND VISION & 2017 7TH INTERNATIONAL SYMPOSIUM IN COMPUTATIONAL MEDICAL AND HEALTH TECHNOLOGY (ICIEV-ISCMHT), 2017,
  • [35] Spatiotemporal spectral histogramming analysis in hand gesture signature recognition
    Khoh, Wee How
    Pang, Ying Han
    Ooi, Shih Yin
    Yap, Hui Yen
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (03) : 4275 - 4286
  • [36] Touch Gesture and Emotion Recognition Using Decomposed Spatiotemporal Convolutions
    Li, Yun-Kai
    Meng, Qing-Hao
    Yang, Tian-Hao
    Wang, Ya-Xin
    Hou, Hui-Rang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [37] Continuous gesture recognition by using gesture spotting
    Lee, Daeha
    Yoon, Hosub
    Kim, Jaehong
    2016 16TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2016, : 1496 - 1498
  • [38] Continuous Gesture Recognition with Hand-oriented Spatiotemporal Feature
    Liu, Zhipeng
    Chai, Xiujuan
    Liu, Zhuang
    Chen, Xilin
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 3056 - 3064
  • [39] Spatiotemporal recursive hyperspheric classification with an application to dynamic gesture recognition
    Reed, Salyer B.
    Reed, Tyson R. C.
    Dascalu, Sergiu M.
    ARTIFICIAL INTELLIGENCE, 2019, 270 : 41 - 66
  • [40] Gesture Feature Extraction for Static Gesture Recognition
    Haitham Sabah Hasan
    Sameem Binti Abdul Kareem
    Arabian Journal for Science and Engineering, 2013, 38 : 3349 - 3366