Recognition of visual activities and interactions by stochastic parsing

被引:370
|
作者
Ivanov, YA
Bobick, AF
机构
[1] MIT, Media Lab, Vis & Modeling Grp, Cambridge, MA 02139 USA
[2] Georgia Inst Technol, Coll Comp, Atlanta, GA 30332 USA
关键词
syntactic pattern recognition; action recognition; high level vision; video surveillance; gesture recognition; video monitoring;
D O I
10.1109/34.868686
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a probabilistic syntactic approach to the detection and recognition of temporally extended activities and interactions between multiple agents. The fundamental idea is to divide the recognition problem into two levels. The lower level detections are performed using standard independent probabilistic event detectors to propose candidate detections of low-level features. The outputs of these detectors provide the input stream for a stochastic context-free grammar parsing mechanism. The grammar and parser provide longer range temporal constraints, disambiguate uncertain low-fever detections, and allow the inclusion of a priori knowledge about the structure of temporal events in a given domain. To achieve such a system we: 1) provide techniques for generating a discrete symbol stream from continuous low-level detectors; 2) extend stochastic context-free parsing to handle uncertainty in the input symbol stream; 3) augment a run-time parsing algorithm to enforce intersymbol constraints such as requiring temporal consistency between primitives; and 4) extend the consistency filtering to maintain consistent multiobject interactions. We develop a real-time system and demonstrate the approach in several experiments on gesture recognition and in video surveillance. In the surveillance application, we show how the system correctly interprets activities of multiple, interacting objects.
引用
收藏
页码:852 / 872
页数:21
相关论文
共 50 条
  • [21] Action recognition using probabilistic parsing
    Bobick, AF
    Ivanov, YA
    1998 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1998, : 196 - 202
  • [22] The necessity of parsing for predicate argument recognition
    Gildea, D
    Palmer, M
    40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2002, : 239 - 246
  • [23] On the Use of Parsing for Named Entity Recognition
    Alonso, Miguel A.
    Gomez-Rodriguez, Carlos
    Vilares, Jesus
    APPLIED SCIENCES-BASEL, 2021, 11 (03): : 1 - 24
  • [24] Lexical Parsing Expression Recognition Schemata
    Lumpe, Markus
    2015 24TH AUSTRALASIAN SOFTWARE ENGINEERING CONFERENCE (ASWEC 2015), 2015, : 165 - 174
  • [25] A parsing technique for sketch recognition systems
    Costagliola, G
    Deufemia, V
    Polese, G
    Risi, M
    2004 IEEE SYMPOSIUM ON VISUAL LANGUAGES AND HUMAN CENTRIC COMPUTING: PROCEEDINGS, 2004, : 19 - 26
  • [26] Familiarity with words modulates interhemispheric interactions in visual word recognition
    Kim, Sangyub
    Kim, Joonwoo
    Nam, Kichun
    FRONTIERS IN PSYCHOLOGY, 2022, 13
  • [27] Stochastic Decorrelation Constraint Regularized Auto-Encoder for Visual Recognition
    Mao, Fengling
    Xiong, Wei
    Du, Bo
    Zhang, Lefei
    MULTIMEDIA MODELING, MMM 2017, PT II, 2017, 10133 : 368 - 380
  • [28] Motor-visual neurons and action recognition in social interactions
    de la Rosa, Stephan
    Buelthoff, Heinrich H.
    BEHAVIORAL AND BRAIN SCIENCES, 2014, 37 (02) : 197 - 198
  • [29] Extending bidirectional chart parsing with a stochastic model
    Ageno, A
    Rodriguez, H
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 21 - 26
  • [30] Speed and accuracy in shallow and deep stochastic parsing
    Kaplan, RM
    Riezler, S
    King, FH
    Maxwell, JT
    Vasserman, A
    Crouch, R
    HLT-NAACL 2004: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2004, : 97 - 104