Action Recognition Based on Spatial-Temporal Pyramid Sparse Coding

被引:0
|
作者
Zhang, Xiaojing [1 ]
Zhang, Hua [1 ]
Cao, Xiaochun [1 ]
机构
[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin 300072, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a novel video presentation term spatial-temporal pyramid sparse coding (STPSC) which characterizes both the spatial and temporal aspects of the video. Specifically, the co-occurrences of visual words are computed with respect to the spatial layout and the sequencing of the features in the video. The representation captures both the spatial arrangement and the temporal relationship of the words. Our representation is motivated by the technology spatial pyramid matching (SPM) which is used to recognize scenes in the image. We extend SPM to video analysis combining with sparse coding. Firstly, dense feature points are extracted and represented by displacement information from a dense optical flow field. Then sparse coding is used to quantize the feature descriptors, and the spatial-temporal pyramid is introduced to represent an action. Finally, we use SVM to classify the videos. Experimental results showed improvements over the state-of-the-art techniques on the public action dataset.
引用
收藏
页码:1455 / 1458
页数:4
相关论文
共 50 条
  • [1] Sparse Coding on Local Spatial-Temporal Volumes for Human Action Recognition
    Zhu, Yan
    Zhao, Xu
    Fu, Yun
    Liu, Yuncai
    [J]. COMPUTER VISION - ACCV 2010, PT II, 2011, 6493 : 660 - +
  • [2] Spatial-Temporal Pyramid Graph Reasoning for Action Recognition
    Geng, Tiantian
    Zheng, Feng
    Hou, Xiaorong
    Lu, Ke
    Qi, Guo-Jun
    Shao, Ling
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5484 - 5497
  • [3] Spatial-temporal pyramid based Convolutional Neural Network for action recognition
    Zheng, Zhenxing
    An, Gaoyun
    Wu, Dapeng
    Ruan, Qiuqi
    [J]. NEUROCOMPUTING, 2019, 358 : 446 - 455
  • [4] A Novel Action Recognition Scheme Based on Spatial-Temporal Pyramid Model
    Zhao, Hengying
    Xiang, Xinguang
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II, 2018, 10736 : 212 - 221
  • [5] Pyramid Spatial-Temporal Graph Transformer for Skeleton-Based Action Recognition
    Chen, Shuo
    Xu, Ke
    Jiang, Xinghao
    Sun, Tanfeng
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (18):
  • [6] Spatial-Temporal Attention for Action Recognition
    Sun, Dengdi
    Wu, Hanqing
    Ding, Zhuanlian
    Luo, Bin
    Tang, Jin
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 854 - 864
  • [7] A SPATIAL-TEMPORAL CONSTRAINT-BASED ACTION RECOGNITION METHOD
    Han, Tingting
    Yao, Hongxun
    Zhang, Yanhao
    Xu, Pengfei
    [J]. 2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 2767 - 2771
  • [8] MULTI-DIRECTIONAL CONVOLUTION NETWORKS WITH SPATIAL-TEMPORAL FEATURE PYRAMID MODULE FOR ACTION RECOGNITION
    Yang, Bohong
    Wang, Zijian
    Ran, Wu
    Lu, Hong
    Chen, Yi-Ping Phoebe
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2440 - 2444
  • [9] Joint spatial-temporal attention for action recognition
    Yu, Tingzhao
    Guo, Chaoxu
    Wang, Lingfeng
    Gu, Huxiang
    Xiang, Shiming
    Pan, Chunhong
    [J]. PATTERN RECOGNITION LETTERS, 2018, 112 : 226 - 233
  • [10] Spatial-temporal pooling for action recognition in videos
    Wang, Jiaming
    Shao, Zhenfeng
    Huang, Xiao
    Lu, Tao
    Zhang, Ruiqian
    Lv, Xianwei
    [J]. NEUROCOMPUTING, 2021, 451 : 265 - 278