Motion Enhanced Model Based on High-Level Spatial Features

被引:0
|
作者
Wu, Yang [1 ]
Guo, Lei [1 ]
Dai, Xiaodong [1 ]
Zhang, Bin [1 ]
Park, Dong-Won [2 ]
Ma, Ming [1 ]
机构
[1] College of Computer Science and Engineering, Inner Mongolia University, Hohhot,010021, China
[2] Department of Information and Communications, PaiChai University, Daejeon,35345, Korea, Republic of
来源
Computers, Materials and Continua | 2022年 / 73卷 / 03期
关键词
Deep learning - Extraction - Optical flows;
D O I
暂无
中图分类号
学科分类号
摘要
Action recognition has become a current research hotspot in computer vision. Compared to other deep learning methods, Two-stream convolutional network structure achieves better performance in action recognition, which divides the network into spatial and temporal streams, using video frame images as well as dense optical streams in the network, respectively, to obtain the category labels. However, the two-stream network has some drawbacks, i.e., using dense optical flow as the input of the temporal stream, which is computationally expensive and extremely time-consuming for the current extraction algorithm and cannot meet the requirements of real-time tasks. In this paper, instead of the dense optical flow, the Motion Vectors (MVs) are used and extracted from the compressed domain as temporal features, which greatly reduces the extraction time. However, the motion pattern that MVs contain is coarser, which leads to low accuracy. In this paper, we propose two strategies to improve the accuracy: firstly, an accumulated strategy is used to enhance the motion information and continuity of MVs; secondly, knowledge distillation is used to fuse the spatial information into the temporal stream so that more information (e.g., motion details, colors, etc.) is obtainable. Experimental results show that the accuracy of MV can be greatly improved by the strategies proposed in this paper and the final recognition for human actions accuracy is guaranteed without using optical flow. © 2022 Tech Science Press. All rights reserved.
引用
收藏
页码:5911 / 5924
相关论文
共 50 条
  • [1] Motion Enhanced Model Based on High-Level Spatial Features
    Wu, Yang
    Guo, Lei
    Dai, Xiaodong
    Zhang, Bin
    Park, Dong-Won
    Ma, Ming
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (03): : 5911 - 5924
  • [2] High-level motion processing
    Verstraten, FAJ
    TRENDS IN COGNITIVE SCIENCES, 1999, 3 (08) : 318 - 318
  • [3] High-level motion processing
    Trends in Cognitive Sciences, 3 (08):
  • [4] Visual Place Recognition by spatial matching of high-level CNN features
    Camara, Luis G.
    Preucil, Libor
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2020, 133
  • [5] Learning spatial hierarchies of high-level features in deep neural network
    Razzaghi, Parvin
    Abbasi, Karim
    Bayat, Pegah
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 70
  • [6] A neural model of high-level motion processing: Line motion and formotion dynamics
    Baloch, AA
    Grossberg, S
    VISION RESEARCH, 1997, 37 (21) : 3037 - 3059
  • [7] An Image Caption Model Incorporating High-level Semantic Features
    Luo, Zhiwang
    Hu, Jiwei
    Liu, Quan
    Deng, Jiamei
    ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2019), 2019, 11179
  • [8] A Novel Hand Gesture Recognition Based on High-Level Features
    Li, Jing
    Wang, Jianxin
    Ju, Zhaojie
    INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2018, 15 (02)
  • [9] Scanpath Prediction Based on High-Level Features and Memory Bias
    Shao, Xuan
    Luo, Ye
    Zhu, Dandan
    Li, Shuqin
    Itti, Laurent
    Lu, Jianwei
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT III, 2017, 10636 : 3 - 13
  • [10] FEATURES OF HIGH-LEVEL LANGUAGES FOR MICROPROCESSORS
    DAVIES, AC
    MICROPROCESSORS AND MICROSYSTEMS, 1987, 11 (02) : 77 - 87