Extracting hierarchical spatial and temporal features for human action recognition

被引:9
|
作者
Zhang, Keting [1 ]
Zhang, Liqing [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Key Lab Shanghai Educ Commiss Intelligent Interac, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Hierarchical feature extraction; Dual-channel model; Subspace network; Spatial and temporal representation; Action recognition; PARALLEL FRAMEWORK; HEVC;
D O I
10.1007/s11042-017-5179-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human action recognition is a challenging computer vision task and many efforts have been made to improve the performance. Most previous work has concentrated on the hand-crafted features or spatial-temporal features learned from multiple contiguous frames. In this paper, we present a dual-channel model to decouple the spatial and temporal feature extraction. More specifically, we propose to capture the complementary static form information from single frame and dynamic motion information from multi-frame differences in two separate channels. In both channels we use two stacked classical subspace networks to learn hierarchical representations, which are subsequently fused for action recognition. Our model is trained and evaluated on three typical benchmarks: KTH, UCF and Hollywood2 datasets. The experimental results illustrate that our approach achieves comparable performances to the state-of-the-art methods. In addition, both feature analysis and control experiments are also carried out to demonstrate the effectiveness of the proposed approach for feature extraction and thereby action recognition.
引用
收藏
页码:16053 / 16068
页数:16
相关论文
共 50 条
  • [1] Extracting hierarchical spatial and temporal features for human action recognition
    Keting Zhang
    Liqing Zhang
    Multimedia Tools and Applications, 2018, 77 : 16053 - 16068
  • [2] Extracting Temporal Features by Key Points Transfer for Effective Action Recognition
    Liao, Chenxi
    Xu, Yuecong
    16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020), 2020, : 741 - 746
  • [3] CASCADED TEMPORAL SPATIAL FEATURES FOR VIDEO ACTION RECOGNITION
    Yu, Tingzhao
    Gu, Huxiang
    Wang, Lingfeng
    Xiang, Shiming
    Pan, Chunhong
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1552 - 1556
  • [4] Human Action Recognition by Extracting Features from Negative Space
    Rahman, Shah Atiqur
    Leung, M. K. H.
    Cho, Siu-Yeung
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2011, PT II, 2011, 6979 (II): : 29 - +
  • [5] Hierarchical Spatial-Temporal Masked Contrast for Skeleton Action Recognition
    Cao, Wenming
    Zhang, Aoyu
    He, Zhihai
    Zhang, Yicha
    Yin, Xinpeng
    IEEE Transactions on Artificial Intelligence, 2024, 5 (11): : 5801 - 5814
  • [6] MSAHTA: Mixed Spatial Attention and Hierarchical Temporal Aggregation for Action Recognition
    Feng, Jinyuan
    Yang, Dan
    Ge, Yongxin
    Qin, Xiaolei
    Chen, Yida
    Wang, Yuangan
    2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 775 - 782
  • [7] HUMAN ACTION RECOGNITION VIA SPATIAL AND TEMPORAL METHODS
    Eroglu, Hulusi
    Gokce, C. Onur
    Ilk, H. Gokhan
    2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 104 - 107
  • [8] Online human action recognition with spatial and temporal skeleton features using a distributed camera network
    Liu, Guoliang
    Zhang, Qinghui
    Cao, Yichao
    Tian, Guohui
    Ji, Ze
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (12) : 7389 - 7411
  • [9] Spatio-temporal Semantic Features for Human Action Recognition
    Liu, Jia
    Wang, Xiaonian
    Li, Tianyu
    Yang, Jie
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2012, 6 (10): : 2632 - 2649
  • [10] Human Action Recognition Based on Spatio-temporal Features
    Sawant, Nikhil
    Biswas, K. K.
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2009, 5909 : 357 - 362