Recognition of visual activities and interactions by stochastic parsing

被引:370
|
作者
Ivanov, YA
Bobick, AF
机构
[1] MIT, Media Lab, Vis & Modeling Grp, Cambridge, MA 02139 USA
[2] Georgia Inst Technol, Coll Comp, Atlanta, GA 30332 USA
关键词
syntactic pattern recognition; action recognition; high level vision; video surveillance; gesture recognition; video monitoring;
D O I
10.1109/34.868686
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a probabilistic syntactic approach to the detection and recognition of temporally extended activities and interactions between multiple agents. The fundamental idea is to divide the recognition problem into two levels. The lower level detections are performed using standard independent probabilistic event detectors to propose candidate detections of low-level features. The outputs of these detectors provide the input stream for a stochastic context-free grammar parsing mechanism. The grammar and parser provide longer range temporal constraints, disambiguate uncertain low-fever detections, and allow the inclusion of a priori knowledge about the structure of temporal events in a given domain. To achieve such a system we: 1) provide techniques for generating a discrete symbol stream from continuous low-level detectors; 2) extend stochastic context-free parsing to handle uncertainty in the input symbol stream; 3) augment a run-time parsing algorithm to enforce intersymbol constraints such as requiring temporal consistency between primitives; and 4) extend the consistency filtering to maintain consistent multiobject interactions. We develop a real-time system and demonstrate the approach in several experiments on gesture recognition and in video surveillance. In the surveillance application, we show how the system correctly interprets activities of multiple, interacting objects.
引用
收藏
页码:852 / 872
页数:21
相关论文
共 50 条
  • [31] Recognition of Long-Term Behaviors by Parsing Sequences of Short-Term Actions with a Stochastic Regular Grammar
    Sanroma, Gerard
    Burghouts, Gertjan
    Schutte, Klamer
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2012, 7626 : 225 - 233
  • [32] Time reduction of stochastic parsing with stochastic context-free grammars
    Sánchez, JA
    Benedí, JM
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 2, PROCEEDINGS, 2005, 3523 : 163 - 171
  • [33] BAYESIAN BELIEF NETWORKS AS A TOOL FOR STOCHASTIC PARSING
    LUCKE, H
    SPEECH COMMUNICATION, 1995, 16 (01) : 89 - 118
  • [34] Proposed Framework for Stochastic Parsing of Myanmar Language
    Aung, Myintzu Phyo
    Aung, Ohnmar
    Hlaing, Nan Yu
    BIG DATA ANALYSIS AND DEEP LEARNING APPLICATIONS, 2019, 744 : 179 - 187
  • [35] Stochastic Representation and Recognition of High-level Group Activities: Describing Structural Uncertainties in Human Activities
    Ryoo, M. S.
    Aggarwal, J. K.
    2009 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPR WORKSHOPS 2009), VOLS 1 AND 2, 2009, : 538 - 538
  • [36] VISUAL RECOGNITION OF EVENTS AND ACTIVITIES BASED ON MOMENTUM OF MOTION ENERGY MASS
    Hu, Jinhui
    Boulgouris, Nikolaos V.
    APPLIED ARTIFICIAL INTELLIGENCE, 2012, 26 (1-2) : 81 - 96
  • [37] Visual signals or displacement activities? The function of visual displays in agonistic interactions in nocturnal tree frogs
    Raíssa Furtado
    Fausto Nomura
    acta ethologica, 2014, 17 : 9 - 14
  • [38] Visual signals or displacement activities? The function of visual displays in agonistic interactions in nocturnal tree frogs
    Furtado, Raissa
    Nomura, Fausto
    ACTA ETHOLOGICA, 2014, 17 (01) : 9 - 14
  • [39] Parsing and Predicting Increased Noise in Visual Cortex
    Fisher, Tucker G.
    JOURNAL OF NEUROSCIENCE, 2015, 35 (20): : 7657 - 7659
  • [40] Fast stochastic context-free parsing: A stochastic version of the valiant algorithm
    Benedi, Jose-Miguel
    Sanchez, Joan-Andreu
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 1, PROCEEDINGS, 2007, 4477 : 80 - +