Action recognition on continuous video

被引:0
|
作者
Y. L. Chang
C. S. Chan
P. Remagnino
机构
[1] University of Malaya,
[2] Kingston upon Thames,undefined
来源
关键词
Deep learning; Action recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Video action recognition has been a challenging task over the years. The challenge herein is not only due to the complication in increasing information in videos but also the requirement of an efficient method to retain information over a longer-term where human action would take to perform. This paper proposes a novel framework, named as long-term video action recognition (LVAR) to perform generic action classification in the continuous video. The idea of LVAR is introducing a partial recurrence connection to propagate information within every layer of a spatial-temporal network, such as the well-known C3D. Empirically, we show that this addition allows the C3D network to access long-term information, and subsequently improves action recognition performance with videos of different length selected from both UCF101 and miniKinetics datasets. Further confirmation of our approach is strengthened with experiments on untrimmed video from the Thumos14 dataset.
引用
收藏
页码:1233 / 1243
页数:10
相关论文
共 50 条
  • [21] Video Analysis for Continuous Sign Language Recognition
    Piater, Justus
    Hoyoux, Thomas
    Du, Wei
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : A192 - A195
  • [22] Learning hierarchical video representation for action recognition
    Li Q.
    Qiu Z.
    Yao T.
    Mei T.
    Rui Y.
    Luo J.
    International Journal of Multimedia Information Retrieval, 2017, 6 (1) : 85 - 98
  • [23] Leveraging Temporal Contextualization for Video Action Recognition
    Kim, Minji
    Han, Dongyoon
    Kim, Taekyung
    Han, Bohyung
    COMPUTER VISION - ECCV 2024, PT XXI, 2025, 15079 : 74 - 91
  • [24] Averaging Video Sequences to Improve Action Recognition
    Gao, Zhen
    Lu, Guoliang
    Yan, Peng
    2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), 2016, : 89 - 93
  • [25] Spatiotemporal Fusion Networks for Video Action Recognition
    Liu, Zheng
    Hu, Haifeng
    Zhang, Junxuan
    NEURAL PROCESSING LETTERS, 2019, 50 (02) : 1877 - 1890
  • [26] A Robust and Efficient Video Representation for Action Recognition
    Wang, Heng
    Oneata, Dan
    Verbeek, Jakob
    Schmid, Cordelia
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 119 (03) : 219 - 238
  • [27] Deep Local Video Feature for Action Recognition
    Lan, Zhenzhong
    Zhu, Yi
    Hauptmann, Alexander G.
    Newsam, Shawn
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 1219 - 1225
  • [28] Action Keypoint Network for Efficient Video Recognition
    Chen, Xu
    Han, Yahong
    Wang, Xiaohan
    Sun, Yifan
    Yang, Yi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 4980 - 4993
  • [29] Binary Neural Network for Video Action Recognition
    Han, Hongfeng
    Lu, Zhiwu
    Wen, Ji-Rong
    MULTIMEDIA MODELING, MMM 2023, PT I, 2023, 13833 : 95 - 106
  • [30] Comparison of action recognition from video and IMUs
    Podoprosvetov, A., V
    Alisejchik, A. P.
    Orlov, I. A.
    14TH INTERNATIONAL SYMPOSIUM INTELLIGENT SYSTEMS, 2021, 186 : 242 - 249