Video-understanding framework for automatic behavior recognition

被引:43
|
作者
Bremond, Francois [1 ]
Thonnat, Monique [1 ]
Zuniga, Marcos [1 ]
机构
[1] INRIA Sophia Antipolis, ORION Grp, F-06902 Sophia Antipolis, France
关键词
Bayesian Network; IEEE Computer Society; Temporal Constraint; Basic Scenario; Metro Station;
D O I
10.3758/BF03192795
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
We propose an activity-monitoring framework based on a platform called VSIP, enabling behavior recognition in different environments. To allow end-users to actively participate in the development of a new application, VSIP separates algorithms from a priori knowledge. To describe how VSIP works, we present a full description of a system developed with this platform for recognizing behaviors, involving either isolated individuals, groups of people, or crowds, in the context of visual monitoring of metro scenes, using multiple cameras. In this work, we also illustrate the capability of the framework to easily combine and tune various recognition methods dedicated to the visual analysis of specific situations (e.g., mono-/multiactors' activities, numerical/symbolic actions, or temporal scenarios). We also present other applications, using this framework, in the context of behavior recognition. VSIP has shown a good performance on human behavior recognition for different problems and configurations, being suitable to fulfill a large variety of requirements.
引用
收藏
页码:416 / 426
页数:11
相关论文
共 50 条
  • [31] Multimodal understanding for person recognition in video broadcasts
    Bechat, Frederic
    Bendris, Meriem
    Charlet, Delphine
    Damnati, Geraldine
    Favre, Benoit
    Rouvier, Mickael
    Auguste, Remi
    Bigot, Benjamin
    Dufour, Richard
    Fredouille, Corinne
    Linares, Georges
    Martinet, Jean
    Senay, Gregory
    Tirilly, Pierre
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 607 - 611
  • [32] Image understanding research for automatic target recognition
    Bhanu, Bir
    Jones, Terry L.
    IEEE Aerospace and Electronic Systems Magazine, 1993, 8 (10) : 15 - 23
  • [33] Prosody modeling for automatic speech recognition and understanding
    Shriberg, E
    Stolcke, A
    MATHEMATICAL FOUNDATIONS OF SPEECH AND LANGUAGE PROCESSING, 2004, 138 : 105 - 114
  • [34] Video Analysis for Human Behavior Understanding
    Hwang, Jenq-Neng
    Kim, Changick
    Cheng, Hsu-Yung
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2010,
  • [35] Video Analysis for Human Behavior Understanding
    Jenq-Neng Hwang
    Changick Kim
    Hsu-Yung Cheng
    EURASIP Journal on Advances in Signal Processing, 2010
  • [36] OmniViD: A Generative Framework for Universal Video Understanding
    Wang, Junke
    Chen, Dongdong
    Luo, Chong
    He, Bo
    Yuan, Lu
    Wu, Zuxuan
    Jiang, Yu-Gang
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 18209 - 18220
  • [37] Social behavior recognition in continuous video
    Burgos-Artizzu, Xavier P.
    Dollar, Piotr
    Lin, Dayu
    Anderson, David J.
    Perona, Pietro
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 1322 - 1329
  • [38] Crowd Behavior Recognition for Video Surveillance
    Saxena, Shobhit
    Bremond, Francois
    Thonnat, Monnique
    Ma, Ruihua
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PROCEEDINGS, 2008, 5259 : 970 - +
  • [39] A framework for improved video text detection and recognition
    Haojin Yang
    Bernhard Quehl
    Harald Sack
    Multimedia Tools and Applications, 2014, 69 : 217 - 245
  • [40] Video Analytics Framework for Human Action Recognition
    Khan, Muhammad Attique
    Alhaisoni, Majed
    Armghan, Ammar
    Alenezi, Fayadh
    Tariq, Usman
    Nam, Yunyoung
    Akram, Tallha
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (03): : 3841 - 3859