A Hierarchical Spatio-Temporal Model for Human Activity Recognition

被引:33
|
作者
Xu, Wanru [1 ]
Miao, Zhenjiang [1 ]
Zhang, Xiao-Ping [2 ]
Tian, Yi [1 ]
机构
[1] Beijing Jiaotong Univ, Beijing 100044, Peoples R China
[2] Ryerson Univ, Dept Elect & Comp Engn, Toronto, ON M5B 2K3, Canada
关键词
Activity recognition; hidden conditional random field (HCRF); hierarchical structure; spatio-temporal dependencies; HIDDEN MARKOV MODEL; FRAMEWORK;
D O I
10.1109/TMM.2017.2674622
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There are two key issues in human activity recognition: spatial dependencies and temporal dependencies. Most recent methods focus on only one of them, and thus do not have sufficient descriptive power to recognize complex activity. In this paper, we propose a hierarchical spatio-temporal model (HSTM) to solve the problem by modeling spatial and temporal constraints simultaneously. The new HSTM is a two-layer hidden conditional random field (HCRF), where the bottom-layer HCRF aims at describing spatial relations in each frame and learning more discriminative representations, and the top-layer HCRF utilizes these high-level features to characterize temporal relations in the whole video sequence. The new HSTM takes advantage of the bottom layer as the building blocks for the top layer and it aggregates evidence from local to global level. A novel learning algorithm is derived to train all model parameters efficiently and its effectiveness is validated theoretically. Experimental results show that the HSTM can successfully classify human activities with higher accuracies on single-person actions (UCF) than other existing methods. More importantly, the HSTM also achieves superior performance on more practical interactions, including human-human interactional activities (UT-Interaction, BIT-Interaction, and CASIA) and human-object interactional activities (Gupta video dataset).
引用
收藏
页码:1494 / 1509
页数:16
相关论文
共 50 条
  • [21] 4-Dimensional Local Spatio-Temporal Features for Human Activity Recognition
    Zhang, Hao
    Parker, Lynne E.
    2011 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2011,
  • [22] A Dual Pipeline With Spatio-Temporal Attention Fusion Approach for Human Activity Recognition
    Wang, Xiaodong
    Li, Ying
    Fang, Aiqing
    He, Pei
    Guo, Yangming
    IEEE SENSORS JOURNAL, 2024, 24 (15) : 25150 - 25162
  • [23] Sensor-Based Human Activity Recognition with Spatio-Temporal Deep Learning
    Nafea, Ohoud
    Abdul, Wadood
    Muhammad, Ghulam
    Alsulaiman, Mansour
    SENSORS, 2021, 21 (06) : 1 - 20
  • [24] Spatio-Temporal Analysis of Trajectory for Pedestrian Activity Recognition
    Kim, Young-Nam
    Park, Jin-Hee
    Kim, Moon-Hyun
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2018, 13 (02) : 961 - 968
  • [25] Abnormal Activity Recognition Using Spatio-Temporal Features
    Chathuramali, K. G. Manosha
    Ramasinghe, Sameera
    Rodrigo, Ranga
    2014 7TH INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION FOR SUSTAINABILITY (ICIAFS), 2014,
  • [26] Spatio-temporal interactive reasoning model for multi-group activity recognition
    Huang, Jianglan
    Li, Lindong
    Qing, Linbo
    Tang, Wang
    Wang, Pingyu
    Guo, Li
    Peng, Yonghong
    PATTERN RECOGNITION, 2025, 199
  • [27] Spatio-Temporal Steerable Pyramid for Human Action Recognition
    Zhen, Xiantong
    Shao, Ling
    2013 10TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), 2013,
  • [28] Spatio-temporal Video Autoencoder for Human Action Recognition
    Sousa e Santos, Anderson Carlos
    Pedrini, Helio
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 114 - 123
  • [29] Human Action Recognition Algorithm Based on Spatio-Temporal Interactive Attention Model
    Pan Na
    Jiang Min
    Kong Jun
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (18)
  • [30] Spatio-temporal Semantic Features for Human Action Recognition
    Liu, Jia
    Wang, Xiaonian
    Li, Tianyu
    Yang, Jie
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2012, 6 (10): : 2632 - 2649