Learning Dynamic Spatio-Temporal Relations for Human Activity Recognition

被引:2
|
作者
Liu, Zhenyu [1 ]
Yao, Yaqiang [2 ]
Liu, Yan [2 ]
Zhu, Yuening [3 ]
Tao, Zhenchao [4 ]
Wang, Lei [5 ]
Feng, Yuhong [6 ]
机构
[1] China Univ Polit Sci & Law, Dept Sci & Technol Teaching, Beijing 100088, Peoples R China
[2] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230027, Anhui, Peoples R China
[3] Shanghai Univ, Sch Mat & Engn, Shanghai 200444, Peoples R China
[4] Univ Sci & Technol China, Affliated Hosp 1, Hefei 230001, Peoples R China
[5] Third Peoples Hosp Hefei, Hefei 230041, Peoples R China
[6] Shenzhen Univ, Coll Comp Sci & Software Engn, Natl Engn Lab Big Data Syst Comp Technol, Shenzhen 518060, Guangdong, Peoples R China
关键词
Hidden Markov models; Activity recognition; Hip; Head; Histograms; Recurrent neural networks; Image segmentation; Human activity recognition; qualitative spatio-temporal graph; vector quantization; discrete HMMs; CLASSIFICATION; MODEL; FEATURES;
D O I
10.1109/ACCESS.2020.3009136
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human activity, which usually consists of several actions (sub-activities), generally covers interactions among persons and/or objects. In particular, human actions involve certain spatial and temporal relationships, are the components of more complicated activity, and evolve dynamically over time. Therefore, the description of a single human action and the modeling of the evolution of successive human actions are two major issues in human activity recognition. In this paper, we develop a method for human activity recognition that tackles these two issues. In the proposed method, an activity is divided into several successive actions represented by spatio-temporal patterns, and the evolution of these actions are captured by a sequential model. A refined comprehensive spatio-temporal graph is utilized to represent a single action, which is a qualitative representation of a human action incorporating both the spatial and temporal relations of the participant objects. Next, a discrete hidden Markov model is applied to model the evolution of action sequences. Moreover, a fully automatic partition method is proposed to divide a long-term human activity video into several human actions based on variational objects and qualitative spatial relations. Finally, a hierarchical decomposition of the human body is introduced to obtain a discriminative representation for a single action. Experimental results on the Cornell Activity Dataset demonstrate the efficiency and effectiveness of the proposed approach, which will enable long videos of human activity to be better recognized.
引用
收藏
页码:130340 / 130352
页数:13
相关论文
共 50 条
  • [1] LEARNING A HIERARCHICAL SPATIO-TEMPORAL MODEL FOR HUMAN ACTIVITY RECOGNITION
    Xu, Wanru
    Miao, Zhenjiang
    Zhang, Xiao-Ping
    Tian, Yi
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 1607 - 1611
  • [2] Human Activity Recognition Based on Transfer Learning with Spatio-Temporal Representations
    Zebhi, Saeedeh
    Almodarresi, S. M. T.
    Abootalebi, Vahid
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2021, 18 (06) : 839 - 845
  • [3] Sensor-Based Human Activity Recognition with Spatio-Temporal Deep Learning
    Nafea, Ohoud
    Abdul, Wadood
    Muhammad, Ghulam
    Alsulaiman, Mansour
    [J]. SENSORS, 2021, 21 (06) : 1 - 20
  • [4] A Hierarchical Spatio-Temporal Model for Human Activity Recognition
    Xu, Wanru
    Miao, Zhenjiang
    Zhang, Xiao-Ping
    Tian, Yi
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (07) : 1494 - 1509
  • [5] A Spatio-temporal Feature Learning Approach for Dynamic Scene Recognition
    Ullah, Ihsan
    Petrosino, Alfredo
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 591 - 598
  • [6] Spatio-Temporal Dynamic Inference Network for Group Activity Recognition
    Yuan, Hangjie
    Ni, Dong
    Wang, Mang
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7456 - 7465
  • [7] Qualitative and Quantitative Spatio-temporal Relations in Daily Living Activity Recognition
    Tayyub, Jawad
    Tavanai, Aryana
    Gatsoulis, Yiannis
    Cohn, Anthony G.
    Hogg, David C.
    [J]. COMPUTER VISION - ACCV 2014, PT V, 2015, 9007 : 115 - 130
  • [8] Spatio-Temporal Phrases for Activity Recognition
    Zhang, Yimeng
    Liu, Xiaoming
    Chang, Ming-Ching
    Ge, Weina
    Chen, Tsuhan
    [J]. COMPUTER VISION - ECCV 2012, PT III, 2012, 7574 : 707 - 721
  • [9] Learning Bag of Spatio-Temporal Features for Human Interaction Recognition
    Slimani, Khadidja Nour El Houda
    Benezeth, Yannick
    Souami, Feryel
    [J]. TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
  • [10] Leveraging Transfer Learning for Spatio-Temporal Human Activity Recognition from Video Sequences
    Butt, Umair Muneer
    Ullah, Hadiqa Aman
    Letchmunan, Sukumar
    Tariq, Iqra
    Hassan, Fadratul Hafinaz
    Koh, Tieng Wei
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (03): : 5017 - 5033