SPATIO-TEMPORAL MULTI-SCALE SOFT QUANTIZATION LEARNING FOR SKELETON-BASED HUMAN ACTION RECOGNITION

被引:3
|
作者
Yang, Jianyu [1 ]
Zhu, Chen [1 ]
Yuan, Junsong [2 ]
机构
[1] Soochow Univ, Sch Rail Transportat, Suzhou, Peoples R China
[2] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY USA
基金
中国国家自然科学基金;
关键词
Soft quantization; multi-scale; action recognition; Bag-of-Features;
D O I
10.1109/ICME.2019.00189
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Effective feature representation is important for action recognition. In this paper, a novel soft quantization learning method is proposed to represent visual features for action recognition. Specifically, we propose a dual multi-scale soft-quantization network, which is a trainable quantizer using RBF neurons. The RBF layer includes dual multi-scale structure, namely a three-level hierarchical skeleton structure in space, and a temporal-pyramid based multi-scale time structure. Different spatial levels in the RBF layer have respective RBF neurons for hierarchical spatial information, while the temporal scales share them to reduce the number of parameters in the network. An accumulation layer following the RBF layer summarizes the RBF output as a histogram representation for classification task. The proposed method is end-to-end differentiable that can be trained using regular back-propagation. The conducted experiments on benchmark datasets verify that the proposed method outperforms state-of-the-art methods.
引用
收藏
页码:1078 / 1083
页数:6
相关论文
共 50 条
  • [1] Multi-scale spatio-temporal network for skeleton-based gait recognition
    He, Dongzhi
    Xue, Yongle
    Li, Yunyu
    Sun, Zhijie
    Xiao, Xingmei
    Wang, Jin
    [J]. AI COMMUNICATIONS, 2023, 36 (04) : 297 - 310
  • [2] Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition
    Chen, Tailin
    Zhou, Desen
    Wang, Jian
    Wang, Shidong
    Guan, Yu
    He, Xuming
    Ding, Errui
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4334 - 4342
  • [3] Learning Representations by Contrastive Spatio-Temporal Clustering for Skeleton-Based Action Recognition
    Wang, Mingdao
    Li, Xueming
    Chen, Siqi
    Zhang, Xianlin
    Ma, Lei
    Zhang, Yue
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3207 - 3220
  • [4] MTT: Multi-Scale Temporal Transformer for Skeleton-Based Action Recognition
    Kong, Jun
    Bian, Yuhang
    Jiang, Min
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 528 - 532
  • [5] Joint Learning in the Spatio-Temporal and Frequency Domains for Skeleton-Based Action Recognition
    Hu, Guyue
    Cui, Bo
    Yu, Shan
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (09) : 2207 - 2220
  • [6] Global spatio-temporal synergistic topology learning for skeleton-based action recognition
    Dai, Meng
    Sun, Zhonghua
    Wang, Tianyi
    Feng, Jinchao
    Jia, Kebin
    [J]. PATTERN RECOGNITION, 2023, 140
  • [7] Spatio-temporal stacking model for skeleton-based action recognition
    Yufeng Zhong
    Qiuyan Yan
    [J]. Applied Intelligence, 2022, 52 : 12116 - 12130
  • [8] Leveraging Spatio-Temporal Dependency for Skeleton-Based Action Recognition
    Lee, Jungho
    Lee, Minhyeok
    Cho, Suhwan
    Woo, Sungmin
    Jang, Sungjun
    Lee, Sangyoun
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10221 - 10230
  • [9] Spatio-temporal segments attention for skeleton-based action recognition
    Qiu, Helei
    Hou, Biao
    Ren, Bo
    Zhang, Xiaohua
    [J]. NEUROCOMPUTING, 2023, 518 : 30 - 38
  • [10] Spatio-Temporal Difference Descriptor for Skeleton-Based Action Recognition
    Ding, Chongyang
    Liu, Kai
    Korhonen, Jari
    Belyaev, Evgeny
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1227 - 1235