Recognition of visual activities and interactions by stochastic parsing

被引：370

作者：

Ivanov, YA

Bobick, AF

机构：

[1] MIT, Media Lab, Vis & Modeling Grp, Cambridge, MA 02139 USA

[2] Georgia Inst Technol, Coll Comp, Atlanta, GA 30332 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2000年 / 22卷 / 08期

关键词：

syntactic pattern recognition; action recognition; high level vision; video surveillance; gesture recognition; video monitoring;

D O I：

10.1109/34.868686

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes a probabilistic syntactic approach to the detection and recognition of temporally extended activities and interactions between multiple agents. The fundamental idea is to divide the recognition problem into two levels. The lower level detections are performed using standard independent probabilistic event detectors to propose candidate detections of low-level features. The outputs of these detectors provide the input stream for a stochastic context-free grammar parsing mechanism. The grammar and parser provide longer range temporal constraints, disambiguate uncertain low-fever detections, and allow the inclusion of a priori knowledge about the structure of temporal events in a given domain. To achieve such a system we: 1) provide techniques for generating a discrete symbol stream from continuous low-level detectors; 2) extend stochastic context-free parsing to handle uncertainty in the input symbol stream; 3) augment a run-time parsing algorithm to enforce intersymbol constraints such as requiring temporal consistency between primitives; and 4) extend the consistency filtering to maintain consistent multiobject interactions. We develop a real-time system and demonstrate the approach in several experiments on gesture recognition and in video surveillance. In the surveillance application, we show how the system correctly interprets activities of multiple, interacting objects.

引用

页码：852 / 872

页数：21

共 50 条

[21] Action recognition using probabilistic parsing
Bobick, AF
Ivanov, YA
1998 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1998, : 196 - 202
[22] The necessity of parsing for predicate argument recognition
Gildea, D
Palmer, M
40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2002, : 239 - 246
[23] On the Use of Parsing for Named Entity Recognition
Alonso, Miguel A.
Gomez-Rodriguez, Carlos
Vilares, Jesus
APPLIED SCIENCES-BASEL, 2021, 11 (03): : 1 - 24
[24] Lexical Parsing Expression Recognition Schemata
Lumpe, Markus
2015 24TH AUSTRALASIAN SOFTWARE ENGINEERING CONFERENCE (ASWEC 2015), 2015, : 165 - 174
[25] A parsing technique for sketch recognition systems
Costagliola, G
Deufemia, V
Polese, G
Risi, M
2004 IEEE SYMPOSIUM ON VISUAL LANGUAGES AND HUMAN CENTRIC COMPUTING: PROCEEDINGS, 2004, : 19 - 26
[26] Familiarity with words modulates interhemispheric interactions in visual word recognition
Kim, Sangyub
Kim, Joonwoo
Nam, Kichun
FRONTIERS IN PSYCHOLOGY, 2022, 13
[27] Stochastic Decorrelation Constraint Regularized Auto-Encoder for Visual Recognition
Mao, Fengling
Xiong, Wei
Du, Bo
Zhang, Lefei
MULTIMEDIA MODELING, MMM 2017, PT II, 2017, 10133 : 368 - 380
[28] Motor-visual neurons and action recognition in social interactions
de la Rosa, Stephan
Buelthoff, Heinrich H.
BEHAVIORAL AND BRAIN SCIENCES, 2014, 37 (02) : 197 - 198
[29] Extending bidirectional chart parsing with a stochastic model
Ageno, A
Rodriguez, H
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 21 - 26
[30] Speed and accuracy in shallow and deep stochastic parsing
Kaplan, RM
Riezler, S
King, FH
Maxwell, JT
Vasserman, A
Crouch, R
HLT-NAACL 2004: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2004, : 97 - 104

← 1 2 3 4 5 →