Motion Enhanced Model Based on High-Level Spatial Features

被引:0
|
作者
Wu, Yang [1 ]
Guo, Lei [1 ]
Dai, Xiaodong [1 ]
Zhang, Bin [1 ]
Park, Dong-Won [2 ]
Ma, Ming [1 ]
机构
[1] College of Computer Science and Engineering, Inner Mongolia University, Hohhot,010021, China
[2] Department of Information and Communications, PaiChai University, Daejeon,35345, Korea, Republic of
来源
Computers, Materials and Continua | 2022年 / 73卷 / 03期
关键词
Deep learning - Extraction - Optical flows;
D O I
暂无
中图分类号
学科分类号
摘要
Action recognition has become a current research hotspot in computer vision. Compared to other deep learning methods, Two-stream convolutional network structure achieves better performance in action recognition, which divides the network into spatial and temporal streams, using video frame images as well as dense optical streams in the network, respectively, to obtain the category labels. However, the two-stream network has some drawbacks, i.e., using dense optical flow as the input of the temporal stream, which is computationally expensive and extremely time-consuming for the current extraction algorithm and cannot meet the requirements of real-time tasks. In this paper, instead of the dense optical flow, the Motion Vectors (MVs) are used and extracted from the compressed domain as temporal features, which greatly reduces the extraction time. However, the motion pattern that MVs contain is coarser, which leads to low accuracy. In this paper, we propose two strategies to improve the accuracy: firstly, an accumulated strategy is used to enhance the motion information and continuity of MVs; secondly, knowledge distillation is used to fuse the spatial information into the temporal stream so that more information (e.g., motion details, colors, etc.) is obtainable. Experimental results show that the accuracy of MV can be greatly improved by the strategies proposed in this paper and the final recognition for human actions accuracy is guaranteed without using optical flow. © 2022 Tech Science Press. All rights reserved.
引用
收藏
页码:5911 / 5924
相关论文
共 50 条
  • [21] Combination of high-level features with low-level features for detection of pedestrian
    Takarli, Fariba
    Aghagolzadeh, Ali
    Seyedarabi, Hadi
    SIGNAL IMAGE AND VIDEO PROCESSING, 2016, 10 (01) : 93 - 101
  • [22] Exploring high-level features for detecting cyberpedophilia
    Bogdanova, Dasha
    Rosso, Paolo
    Solorio, Thamar
    COMPUTER SPEECH AND LANGUAGE, 2014, 28 (01): : 108 - 120
  • [23] THE HIGH-LEVEL LANGUAGE AND OPERATING SYSTEM SUPPORT FEATURES OF ADVANCED MICROPROCESSORS .1. HIGH-LEVEL LANGUAGE SUPPORT FEATURES
    NG, KW
    MOK, KY
    MICROPROCESSING AND MICROPROGRAMMING, 1987, 19 (03): : 203 - 218
  • [24] High-level synthesis of an enhanced connex memory
    Hascsi, Z
    Mitu, B
    Petre, M
    Stefan, G
    CAS '96 PROCEEDINGS - 1996 INTERNATIONAL SEMICONDUCTOR CONFERENCE, 19TH EDITION, VOLS 1 AND 2, 1996, : 163 - 166
  • [25] Search-based Detection of High-level Model Changes
    ben Fadhel, Ameni
    Kessentini, Marouane
    Langer, Philip
    Wimmer, Manuel
    2012 28TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE (ICSM), 2012, : 212 - 221
  • [26] MODEL-BASED STRATEGIES FOR HIGH-LEVEL ROBOT VISION
    SHNEIER, MO
    LUMIA, R
    KENT, EW
    COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1986, 33 (03): : 293 - 306
  • [27] Change Detection Based on Low-Level to High-Level Features Integration With Limited Samples
    Wang, Xin
    Du, Peijun
    Chen, Dongmei
    Liu, Sicong
    Zhang, Wei
    Li, Erzhu
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 6260 - 6276
  • [28] Sternum image retrieval based on high-level semantic information and low-level features
    Chen, Qin
    Tai, Xiaoying
    BMEI 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOL 1, 2008, : 362 - 366
  • [29] OPTIMIZATION OF MOTION PRIMITIVES FOR HIGH-LEVEL MOTION PLANNING OF MODULAR ROBOTS
    Vonasek, Vojtech
    Penc, Ondrej
    Kosnar, Karel
    Preucil, Libor
    MOBILE SERVICE ROBOTICS, 2014, : 109 - +
  • [30] High-Level Geometry-based Features of Video Modality for Emotion Prediction
    Weber, Raphael
    Barrielle, Vincent
    Soladie, Catherine
    Seguier, Renaud
    PROCEEDINGS OF THE 6TH INTERNATIONAL WORKSHOP ON AUDIO/VISUAL EMOTION CHALLENGE (AVEC'16), 2016, : 51 - 58