Motion Enhanced Model Based on High-Level Spatial Features

被引:0
|
作者
Wu, Yang [1 ]
Guo, Lei [1 ]
Dai, Xiaodong [1 ]
Zhang, Bin [1 ]
Park, Dong-Won [2 ]
Ma, Ming [1 ]
机构
[1] College of Computer Science and Engineering, Inner Mongolia University, Hohhot,010021, China
[2] Department of Information and Communications, PaiChai University, Daejeon,35345, Korea, Republic of
来源
Computers, Materials and Continua | 2022年 / 73卷 / 03期
关键词
Deep learning - Extraction - Optical flows;
D O I
暂无
中图分类号
学科分类号
摘要
Action recognition has become a current research hotspot in computer vision. Compared to other deep learning methods, Two-stream convolutional network structure achieves better performance in action recognition, which divides the network into spatial and temporal streams, using video frame images as well as dense optical streams in the network, respectively, to obtain the category labels. However, the two-stream network has some drawbacks, i.e., using dense optical flow as the input of the temporal stream, which is computationally expensive and extremely time-consuming for the current extraction algorithm and cannot meet the requirements of real-time tasks. In this paper, instead of the dense optical flow, the Motion Vectors (MVs) are used and extracted from the compressed domain as temporal features, which greatly reduces the extraction time. However, the motion pattern that MVs contain is coarser, which leads to low accuracy. In this paper, we propose two strategies to improve the accuracy: firstly, an accumulated strategy is used to enhance the motion information and continuity of MVs; secondly, knowledge distillation is used to fuse the spatial information into the temporal stream so that more information (e.g., motion details, colors, etc.) is obtainable. Experimental results show that the accuracy of MV can be greatly improved by the strategies proposed in this paper and the final recognition for human actions accuracy is guaranteed without using optical flow. © 2022 Tech Science Press. All rights reserved.
引用
收藏
页码:5911 / 5924
相关论文
共 50 条
  • [31] Brain Tumour Segmentation based on Extremely Randomized Forest with High-Level Features
    Pinto, Adriano
    Pereira, Sergio
    Correia, Higino
    Oliveira, J.
    Rasteiro, Deolinda M. L. D.
    Silva, Carlos A.
    2015 37TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2015, : 3037 - 3040
  • [32] High-Level features for automatic skin lesions neural network based classification
    Abbes, Wiem
    Sellami, Dorra
    2016 SECOND INTERNATIONAL IMAGE PROCESSING, APPLICATIONS AND SYSTEMS (IPAS), 2016,
  • [33] Interpretable Music Categorisation Based on Fuzzy Rules and High-Level Audio Features
    Vatolkin, Igor
    Rudolph, Guenter
    DATA SCIENCE, LEARNING BY LATENT STRUCTURES, AND KNOWLEDGE DISCOVERY, 2015, : 423 - 432
  • [34] High-level clothes description based on colour-texture and structural features
    Borràs, A
    Tous, F
    Lladós, J
    Vanrell, M
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PROCEEDINGS, 2003, 2652 : 108 - 116
  • [35] Radial bias alters high-level motion perception
    Menceloglu, Melisa
    Nakayama, Ken
    Song, Joo-Hyun
    VISION RESEARCH, 2023, 209
  • [36] HIGH-LEVEL MOTION CONTROL PROGRAMMING USING DSPS
    MESHKAT, S
    CONTROL ENGINEERING, 1988, 35 (02) : 50 - 51
  • [37] AN ASYNCHRONOUS MODEL FOR HIGH-LEVEL SYNTHESIS
    BRAGE, JP
    MICROELECTRONICS JOURNAL, 1994, 25 (03) : 199 - 213
  • [38] Psilocybin impairs high-level but not low-level motion perception
    Carter, OL
    Pettigrew, JD
    Burr, DC
    Alais, D
    Hasler, F
    Vollenweider, FX
    NEUROREPORT, 2004, 15 (12) : 1947 - 1951
  • [39] ENHANCING MODEL-BASED SKIN COLOR DETECTION: FROM LOW-LEVEL RGB FEATURES TO HIGH-LEVEL DISCRIMINATIVE BINARY-CLASS FEATURES
    Cheng, You-Chi
    Feng, Zhe
    Weng, Fuliang
    Lee, Chin-Hui
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 1401 - 1404
  • [40] THE NEED FOR IMPULSIVITY & SMOOTHNESS Improving HCI by Qualitatively Measuring New High-Level Human Motion Features
    Mazzarino, Barbara
    Mancini, Maurizio
    SIGMAP 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2009, : 62 - 67