Unsupervised Activity Perception in Crowded and Complicated Scenes Using Hierarchical Bayesian Models

被引:402
|
作者
Wang, Xiaogang [1 ]
Ma, Xiaoxu [1 ]
Grimson, W. Eric L. [1 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
关键词
Hierarchical Bayesian model; visual surveillance; activity analysis; abnormality detection; video segmentation; motion segmentation; clustering; Dirichlet process; Gibbs sampling; variational inference; VIDEO; TRACKING;
D O I
10.1109/TPAMI.2008.87
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel unsupervised learning framework to model activities and interactions in crowded and complicated scenes. Under our framework, hierarchical Bayesian models are used to connect three elements in visual surveillance: low-level visual features, simple "atomic" activities, and interactions. Atomic activities are modeled as distributions over low-level visual features, and multiagent interactions are modeled as distributions over atomic activities. These models are learned in an unsupervised way. Given a long video sequence, moving pixels are clustered into different atomic activities and short video clips are clustered into different interactions. In this paper, we propose three hierarchical Bayesian models: the Latent Dirichlet Allocation (LDA) mixture model, the Hierarchical Dirichlet Processes (HDP) mixture model, and the Dual Hierarchical Dirichlet Processes (Dual-HDP) model. They advance existing topic models, such as LDA [1] and HDP [2]. Directly using existing LDA and HDP models under our framework, only moving pixels can be clustered into atomic activities. Our models can cluster both moving pixels and video clips into atomic activities and into interactions. The LDA mixture model assumes that it is already known how many different types of atomic activities and interactions occur in the scene. The HDP mixture model automatically decides the number of categories of atomic activities. The Dual-HDP automatically decides the numbers of categories of both atomic activities and interactions. Our data sets are challenging video sequences from crowded traffic scenes and train station scenes with many kinds of activities co-occurring. Without tracking and human labeling effort, our framework completes many challenging visual surveillance tasks of broad interest such as: 1) discovering and providing a summary of typical atomic activities and interactions occurring in the scene, 2) segmenting long video sequences into different interactions, 3) segmenting motions into different activities, 4) detecting abnormality, and 5) supporting high-level queries on activities and interactions. In our work, these surveillance problems are formulated in a transparent, clean, and probabilistic way compared with the ad hoc nature of many existing approaches.
引用
收藏
页码:539 / 555
页数:17
相关论文
共 50 条
  • [1] Unsupervised activity perception by hierarchical Bayesian models
    Wang, Xiaogang
    Ma, Xiaoxu
    Grimson, Eric
    [J]. 2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 785 - +
  • [2] People detection in crowded scenes using hierarchical features
    Zeng, Qiang
    Yuan, Yule
    Fu, Canmiao
    Zhao, Yong
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON IMAGING SYSTEMS AND TECHNIQUES (IST), 2017, : 122 - 126
  • [3] Hierarchical Bayesian models for unsupervised scene understanding
    Steinberg, Daniel M.
    Pizarro, Oscar
    Williams, Stefan B.
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 131 : 128 - 144
  • [4] Nonparametric Bayesian Models for Unsupervised Activity Recognition and Tracking
    Dhir, Neil
    Perov, Yura
    Wood, Frank
    [J]. 2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 4040 - 4045
  • [5] Hierarchical representation of scenes using activity information
    Landabaso, JL
    Pardis, M
    Xu, LQ
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 677 - 680
  • [6] Bayesian Hierarchical Models with Applications Using R
    Stock, Christian
    [J]. BIOMETRICAL JOURNAL, 2021, 63 (01) : 213 - 214
  • [7] Intelligent smoothing using hierarchical Bayesian models
    Graham, Patrick
    [J]. EPIDEMIOLOGY, 2008, 19 (03) : 493 - 495
  • [8] Modeling and Tracking of Crowded Traffic Scenes by using Policy Trees, Occupancy Grid Blocks and Bayesian Filters
    Vatavu, Andrei
    Danescu, Radu
    Nedevschi, Sergiu
    [J]. 2014 IEEE 17TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2014, : 1948 - 1955
  • [9] Calibrating Subjective Probabilities Using Hierarchical Bayesian Models
    Merkle, Edgar C.
    [J]. ADVANCES IN SOCIAL COMPUTING, PROCEEDINGS, 2010, 6007 : 13 - 22
  • [10] Inversion of hierarchical Bayesian models using Gaussian processes
    Lomakina, Ekaterina I.
    Paliwal, Saee
    Diaconescu, Andreea O.
    Brodersen, Kay H.
    Aponte, Eduardo A.
    Buhmann, Joachim M.
    Stephan, Klaas E.
    [J]. NEUROIMAGE, 2015, 118 : 133 - 145