A Unified Framework for Gesture Recognition and Spatiotemporal Gesture Segmentation

被引:211
|
作者
Alon, Jonathan [1 ]
Athitsos, Vassilis [2 ]
Yuan, Quan [1 ]
Sclaroff, Stan [1 ]
机构
[1] Boston Univ, Dept Comp Sci, Boston, MA 02215 USA
[2] Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA
基金
美国国家科学基金会;
关键词
Gesture recognition; gesture spotting; human motion analysis; dynamic time warping; continuous dynamic programming; HIDDEN MARKOV-MODELS; HAND GESTURES; TRACKING;
D O I
10.1109/TPAMI.2008.203
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Within the context of hand gesture recognition, spatiotemporal gesture segmentation is the task of determining, in a video sequence, where the gesturing hand is located and when the gesture starts and ends. Existing gesture recognition methods typically assume either known spatial segmentation or known temporal segmentation, or both. This paper introduces a unified framework for simultaneously performing spatial segmentation, temporal segmentation, and recognition. In the proposed framework, information flows both bottom-up and top-down. A gesture can be recognized even when the hand location is highly ambiguous and when information about when the gesture begins and ends is unavailable. Thus, the method can be applied to continuous image streams where gestures are performed in front of moving, cluttered backgrounds. The proposed method consists of three novel contributions: a spatiotemporal matching algorithm that can accommodate multiple candidate hand detections in every frame, a classifier-based pruning framework that enables accurate and early rejection of poor matches to gesture models, and a subgesture reasoning algorithm that learns which gesture models can falsely match parts of other longer gestures. The performance of the approach is evaluated on two challenging applications: recognition of hand-signed digits gestured by users wearing short-sleeved shirts, in front of a cluttered background, and retrieval of occurrences of signs of interest in a video database containing continuous, unsegmented signing in American Sign Language (ASL).
引用
收藏
页码:1685 / 1699
页数:15
相关论文
共 50 条
  • [21] Multimodal Spatiotemporal Feature Map for Dynamic Gesture Recognition
    Zhang X.
    Zeng X.
    Sun W.
    Ren Y.
    Xu T.
    Computer Systems Science and Engineering, 2023, 46 (01): : 671 - 686
  • [22] A database-based framework for gesture recognition
    Vassilis Athitsos
    Haijing Wang
    Alexandra Stefan
    Personal and Ubiquitous Computing, 2010, 14 : 511 - 526
  • [23] A Reservoir Computing Framework for Continuous Gesture Recognition
    Tietz, Stephan
    Jirak, Doreen
    Wermter, Stefan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: WORKSHOP AND SPECIAL SESSIONS, 2019, 11731 : 7 - 18
  • [24] A Framework for Recognition of Hand Gesture in Static Postures
    Vishwakarma, D. K.
    Priyadarshani
    Singh, Kuldeep
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2016, : 294 - 298
  • [25] A database-based framework for gesture recognition
    Athitsos, Vassilis
    Wang, Haijing
    Stefan, Alexandra
    PERSONAL AND UBIQUITOUS COMPUTING, 2010, 14 (06) : 511 - 526
  • [26] A Gesture Recognition Framework for Exploring Museum Exhibitions
    Agate, Vincenzo
    Gaglio, Salvatore
    AVI'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON ADVANCED VISUAL INTERFACES, 2018,
  • [27] An Interactive Image Segmentation Method in Hand Gesture Recognition
    Chen, Disi
    Li, Gongfa
    Sun, Ying
    Kong, Jianyi
    Jiang, Guozhang
    Tang, Heng
    Ju, Zhaojie
    Yu, Hui
    Liu, Honghai
    SENSORS, 2017, 17 (02)
  • [28] Statistical Segmentation and Recognition of Fingertip Trajectories for a Gesture Interface
    Morimoto, Kazuhiro
    Miyajima, Chiyomi
    Kitaoka, Norihide
    Itou, Katunobu
    Takeda, Kazuya
    ICMI'07: PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERFACES, 2007, : 54 - +
  • [29] An HMM-based approach for gesture segmentation and recognition
    Deng, JW
    Tsui, HT
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS: IMAGE, SPEECH AND SIGNAL PROCESSING, 2000, : 679 - 682
  • [30] Segmentation and recognition of continuous gesture based on chaotic theory
    Feng, Guangyu
    Hou, Wenjun
    BEHAVIOUR & INFORMATION TECHNOLOGY, 2020, 39 (11) : 1246 - 1256