Motion Enhanced Model Based on High-Level Spatial Features

被引：0

作者：

Wu, Yang ^{[1
]}

Guo, Lei ^{[1
]}

Dai, Xiaodong ^{[1
]}

Zhang, Bin ^{[1
]}

Park, Dong-Won ^{[2
]}

Ma, Ming ^{[1
]}

机构：

[1] College of Computer Science and Engineering, Inner Mongolia University, Hohhot,010021, China

[2] Department of Information and Communications, PaiChai University, Daejeon,35345, Korea, Republic of

来源：

Computers, Materials and Continua | 2022年 / 73卷 / 03期

关键词：

Deep learning - Extraction - Optical flows;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Action recognition has become a current research hotspot in computer vision. Compared to other deep learning methods, Two-stream convolutional network structure achieves better performance in action recognition, which divides the network into spatial and temporal streams, using video frame images as well as dense optical streams in the network, respectively, to obtain the category labels. However, the two-stream network has some drawbacks, i.e., using dense optical flow as the input of the temporal stream, which is computationally expensive and extremely time-consuming for the current extraction algorithm and cannot meet the requirements of real-time tasks. In this paper, instead of the dense optical flow, the Motion Vectors (MVs) are used and extracted from the compressed domain as temporal features, which greatly reduces the extraction time. However, the motion pattern that MVs contain is coarser, which leads to low accuracy. In this paper, we propose two strategies to improve the accuracy: firstly, an accumulated strategy is used to enhance the motion information and continuity of MVs; secondly, knowledge distillation is used to fuse the spatial information into the temporal stream so that more information (e.g., motion details, colors, etc.) is obtainable. Experimental results show that the accuracy of MV can be greatly improved by the strategies proposed in this paper and the final recognition for human actions accuracy is guaranteed without using optical flow. © 2022 Tech Science Press. All rights reserved.

引用

页码：5911 / 5924

共 50 条

[21] Combination of high-level features with low-level features for detection of pedestrian
Takarli, Fariba
Aghagolzadeh, Ali
Seyedarabi, Hadi
SIGNAL IMAGE AND VIDEO PROCESSING, 2016, 10 (01) : 93 - 101
[22] Exploring high-level features for detecting cyberpedophilia
Bogdanova, Dasha
Rosso, Paolo
Solorio, Thamar
COMPUTER SPEECH AND LANGUAGE, 2014, 28 (01): : 108 - 120
[23] THE HIGH-LEVEL LANGUAGE AND OPERATING SYSTEM SUPPORT FEATURES OF ADVANCED MICROPROCESSORS .1. HIGH-LEVEL LANGUAGE SUPPORT FEATURES
NG, KW
MOK, KY
MICROPROCESSING AND MICROPROGRAMMING, 1987, 19 (03): : 203 - 218
[24] High-level synthesis of an enhanced connex memory
Hascsi, Z
Mitu, B
Petre, M
Stefan, G
CAS '96 PROCEEDINGS - 1996 INTERNATIONAL SEMICONDUCTOR CONFERENCE, 19TH EDITION, VOLS 1 AND 2, 1996, : 163 - 166
[25] Search-based Detection of High-level Model Changes
ben Fadhel, Ameni
Kessentini, Marouane
Langer, Philip
Wimmer, Manuel
2012 28TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE (ICSM), 2012, : 212 - 221
[26] MODEL-BASED STRATEGIES FOR HIGH-LEVEL ROBOT VISION
SHNEIER, MO
LUMIA, R
KENT, EW
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1986, 33 (03): : 293 - 306
[27] Change Detection Based on Low-Level to High-Level Features Integration With Limited Samples
Wang, Xin
Du, Peijun
Chen, Dongmei
Liu, Sicong
Zhang, Wei
Li, Erzhu
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 6260 - 6276
[28] Sternum image retrieval based on high-level semantic information and low-level features
Chen, Qin
Tai, Xiaoying
BMEI 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOL 1, 2008, : 362 - 366
[29] OPTIMIZATION OF MOTION PRIMITIVES FOR HIGH-LEVEL MOTION PLANNING OF MODULAR ROBOTS
Vonasek, Vojtech
Penc, Ondrej
Kosnar, Karel
Preucil, Libor
MOBILE SERVICE ROBOTICS, 2014, : 109 - +
[30] High-Level Geometry-based Features of Video Modality for Emotion Prediction
Weber, Raphael
Barrielle, Vincent
Soladie, Catherine
Seguier, Renaud
PROCEEDINGS OF THE 6TH INTERNATIONAL WORKSHOP ON AUDIO/VISUAL EMOTION CHALLENGE (AVEC'16), 2016, : 51 - 58

← 1 2 3 4 5 →