Motion Enhanced Model Based on High-Level Spatial Features

被引:0
|
作者
Wu, Yang [1 ]
Guo, Lei [1 ]
Dai, Xiaodong [1 ]
Zhang, Bin [1 ]
Park, Dong-Won [2 ]
Ma, Ming [1 ]
机构
[1] College of Computer Science and Engineering, Inner Mongolia University, Hohhot,010021, China
[2] Department of Information and Communications, PaiChai University, Daejeon,35345, Korea, Republic of
来源
Computers, Materials and Continua | 2022年 / 73卷 / 03期
关键词
Deep learning - Extraction - Optical flows;
D O I
暂无
中图分类号
学科分类号
摘要
Action recognition has become a current research hotspot in computer vision. Compared to other deep learning methods, Two-stream convolutional network structure achieves better performance in action recognition, which divides the network into spatial and temporal streams, using video frame images as well as dense optical streams in the network, respectively, to obtain the category labels. However, the two-stream network has some drawbacks, i.e., using dense optical flow as the input of the temporal stream, which is computationally expensive and extremely time-consuming for the current extraction algorithm and cannot meet the requirements of real-time tasks. In this paper, instead of the dense optical flow, the Motion Vectors (MVs) are used and extracted from the compressed domain as temporal features, which greatly reduces the extraction time. However, the motion pattern that MVs contain is coarser, which leads to low accuracy. In this paper, we propose two strategies to improve the accuracy: firstly, an accumulated strategy is used to enhance the motion information and continuity of MVs; secondly, knowledge distillation is used to fuse the spatial information into the temporal stream so that more information (e.g., motion details, colors, etc.) is obtainable. Experimental results show that the accuracy of MV can be greatly improved by the strategies proposed in this paper and the final recognition for human actions accuracy is guaranteed without using optical flow. © 2022 Tech Science Press. All rights reserved.
引用
收藏
页码:5911 / 5924
相关论文
共 50 条
  • [41] A fast MILP solver for high-level synthesis based on heuristic model reduction and enhanced branch and bound algorithm
    Mina Mirhosseini
    Mahmood Fazlali
    Mohammad K Fallah
    Jeong-A Lee
    The Journal of Supercomputing, 2023, 79 : 12042 - 12073
  • [42] A fast MILP solver for high-level synthesis based on heuristic model reduction and enhanced branch and bound algorithm
    Mirhosseini, Mina
    Fazlali, Mahmood
    Fallah, Mohammad K.
    Lee, Jeong-A
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (11): : 12042 - 12073
  • [43] A Late Fusion Approach for Harnessing Multi-CNN Model High-level Features
    Akilan, T.
    Wu, Q. M. Jonathan
    Safaei, A.
    Jiang, Wei
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 566 - 571
  • [44] Indoor Image Representation by High-Level Semantic Features
    Sitaula, Chiranjibi
    Xiang, Yong
    Zhang, Yushu
    Lu, Xuequan
    Aryal, Sunil
    IEEE ACCESS, 2019, 7 : 84967 - 84979
  • [45] Object Detection by Estimating and Combining High-Level Features
    Levine, Geoffrey
    DeJong, Gerald
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2009, PROCEEDINGS, 2009, 5716 : 161 - 169
  • [46] Using high-level semantic features in video retrieval
    Zheng, Wujie
    Li, Jianmin
    Si, Zhangzhang
    Lin, Fuzong
    Zhang, Bo
    IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2006, 4071 : 370 - 379
  • [47] IDENTIFYING HIGH-LEVEL FEATURES OF TEXTURE-PERCEPTION
    RAO, AR
    LOHSE, GL
    CVGIP-GRAPHICAL MODELS AND IMAGE PROCESSING, 1993, 55 (03): : 218 - 233
  • [48] Learning to pool high-level features for face representation
    Huang, Renjie
    Ye, Mao
    Xu, Pei
    Li, Tao
    Dou, Yumin
    VISUAL COMPUTER, 2015, 31 (12): : 1683 - 1695
  • [49] Analysis of High-level Features for Vocal Emotion Recognition
    Atassi, Hicham
    Esposito, Anna
    Smekal, Zdenek
    2011 34TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2011, : 361 - 366
  • [50] Utilising High-Level Features in Summarisation of Academic Presentations
    Curtis, Keith
    Jones, Gareth J. F.
    Campbell, Nick
    PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, : 320 - 326