Action Recognition Based on Spatial-Temporal Pyramid Sparse Coding

被引:0
|
作者
Zhang, Xiaojing [1 ]
Zhang, Hua [1 ]
Cao, Xiaochun [1 ]
机构
[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin 300072, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a novel video presentation term spatial-temporal pyramid sparse coding (STPSC) which characterizes both the spatial and temporal aspects of the video. Specifically, the co-occurrences of visual words are computed with respect to the spatial layout and the sequencing of the features in the video. The representation captures both the spatial arrangement and the temporal relationship of the words. Our representation is motivated by the technology spatial pyramid matching (SPM) which is used to recognize scenes in the image. We extend SPM to video analysis combining with sparse coding. Firstly, dense feature points are extracted and represented by displacement information from a dense optical flow field. Then sparse coding is used to quantize the feature descriptors, and the spatial-temporal pyramid is introduced to represent an action. Finally, we use SVM to classify the videos. Experimental results showed improvements over the state-of-the-art techniques on the public action dataset.
引用
收藏
页码:1455 / 1458
页数:4
相关论文
共 50 条
  • [41] Action Recognition Using a Spatial-Temporal Network for Wild Felines
    Feng, Liqi
    Zhao, Yaqin
    Sun, Yichao
    Zhao, Wenxuan
    Tang, Jiaxi
    [J]. ANIMALS, 2021, 11 (02): : 1 - 18
  • [42] Recurrent Spatial-Temporal Attention Network for Action Recognition in Videos
    Du, Wenbin
    Wang, Yali
    Qiao, Yu
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (03) : 1347 - 1360
  • [43] Spatial-Temporal Network Coding Based on BATS Code
    Xu, Xiaoli
    Guan, Yong Liang
    Zeng, Yong
    Chui, Chee-Cheon
    [J]. IEEE COMMUNICATIONS LETTERS, 2017, 21 (03) : 620 - 623
  • [44] A local descriptor based on Laplacian pyramid coding for action recognition
    Zhen, Xiantong
    Shao, Ling
    [J]. PATTERN RECOGNITION LETTERS, 2013, 34 (15) : 1899 - 1905
  • [45] Palmprint Recognition via Sparse Coding Spatial Pyramid Matching Representation of SIFT Feature
    Liu, Ligang
    Zhang, Jianxin
    Yang, Aoqi
    [J]. BIOMETRIC RECOGNITION, 2016, 9967 : 235 - 243
  • [46] Actionmamba: Action Spatial-Temporal Aggregation Network Based on Mamba and Gcn for Skeleton-Based Action Recognition
    North University of China, School of Electrical and Control Engineering, Shanxi, Taiyuan
    030051, China
    [J].
  • [47] Human action recognition based on multi-mode spatial-temporal feature fusion
    Wang, Dongli
    Yang, Jun
    Zhou, Yan
    [J]. 2019 22ND INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2019), 2019,
  • [48] Spatial-temporal graph neural ODE networks for skeleton-based action recognition
    Pan, Longji
    Lu, Jianguang
    Tang, Xianghong
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01)
  • [49] A Separable Spatial-Temporal Graph Learning Approach for Skeleton-Based Action Recognition
    Zheng, Hui
    Zhao, Ye-Sheng
    Zhang, Bo
    Shang, Guo-Qiang
    [J]. IEEE Sensors Letters, 2024, 8 (11):
  • [50] Multilevel Spatial-Temporal Excited Graph Network for Skeleton-Based Action Recognition
    Zhu, Yisheng
    Shuai, Hui
    Liu, Guangcan
    Liu, Qingshan
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 496 - 508