A Context Based Deep Temporal Embedding Network in Action Recognition

被引:2
|
作者
Koohzadi, Maryam [1 ]
Charkari, Nasrollah Moghadam [1 ]
机构
[1] Tarbiat Modares Univ, Dept Elect & Comp Engn, Tehran, Iran
关键词
Deep temporal embedding; Self-supervision; Residual technique; Two-step deep method; Long-term temporal representation; ATTENTION;
D O I
10.1007/s11063-020-10248-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Long term temporal representation methods demand high computational cost, restricting their practical use in real world applications. We propose a two-step deep residual method for efficiently learning long-term discriminative temporal representation, whilst significantly reducing computational cost. In the first step, a novel self-supervision deep temporal embedding method is presented to embed repetitive short-term motions at a cluster-friendly feature space. In the second step, an efficient temporal representation is made by leveraging the differences between the original data and its associated repetitive motion clusters as a novel deep residual method. Experimental results demonstrate that, the proposed method achieves competitive results on some challenging human action recognition datasets like UCF101, HMDB51, THUMOS14, and Kinetics-400.
引用
收藏
页码:187 / 220
页数:34
相关论文
共 50 条
  • [21] Temporal Context Aggregation Network for Temporal Action Proposal Refinement
    Qing, Zhiwu
    Su, Haisheng
    Gan, Weihao
    Wang, Dongliang
    Wu, Wei
    Wang, Xiang
    Qiao, Yu
    Yan, Junjie
    Gao, Changxin
    Sang, Nong
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 485 - 494
  • [22] Global Temporal Difference Network for Action Recognition
    Xie, Zhao
    Chen, Jiansong
    Wu, Kewei
    Guo, Dan
    Hong, Richang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7594 - 7606
  • [23] Temporal Segment Connection Network for Action Recognition
    Li, Qian
    Yang, Wenzhu
    Chen, Xiangyang
    Yuan, Tongtong
    Wang, Yuxia
    IEEE ACCESS, 2020, 8 : 179118 - 179127
  • [24] Silhouette analysis for human action recognition based on maximum spatio-temporal dissimilarity embedding
    Jian Cheng
    Haijun Liu
    Hongsheng Li
    Machine Vision and Applications, 2014, 25 : 1007 - 1018
  • [25] Silhouette analysis for human action recognition based on maximum spatio-temporal dissimilarity embedding
    Cheng, Jian
    Liu, Haijun
    Li, Hongsheng
    MACHINE VISION AND APPLICATIONS, 2014, 25 (04) : 1007 - 1018
  • [26] Student Action Recognition Based on Deep Convolutional Generative Adversarial Network
    Cheng, Yanyan
    Dai, Zhongjian
    Ji, Ye
    Li, Simin
    Jia, Zhiyang
    Hirota, Kaoru
    Dai, Yaping
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 128 - 133
  • [27] Long Jump Action Recognition Based on Deep Convolutional Neural Network
    Wang, Zhiteng
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [28] Human Action Recognition based on Simple Deep Convolution Network PCANet
    Abdelbaky, Amany
    Aly, Saleh
    PROCEEDINGS OF 2020 INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN COMMUNICATION AND COMPUTER ENGINEERING (ITCE), 2020, : 257 - 262
  • [29] Deep autoencoder architecture with outliers for temporal attributed network embedding
    Mo, Xian
    Pang, Jun
    Liu, Zhiming
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 240
  • [30] Transforming spatio-temporal self-attention using action embedding for skeleton-based action recognition
    Ahmad, Tasweer
    Rizvi, Syed Tahir Hussain
    Kanwal, Neel
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95