Action recognition on continuous video

被引:0
|
作者
Y. L. Chang
C. S. Chan
P. Remagnino
机构
[1] University of Malaya,
[2] Kingston upon Thames,undefined
来源
关键词
Deep learning; Action recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Video action recognition has been a challenging task over the years. The challenge herein is not only due to the complication in increasing information in videos but also the requirement of an efficient method to retain information over a longer-term where human action would take to perform. This paper proposes a novel framework, named as long-term video action recognition (LVAR) to perform generic action classification in the continuous video. The idea of LVAR is introducing a partial recurrence connection to propagate information within every layer of a spatial-temporal network, such as the well-known C3D. Empirically, we show that this addition allows the C3D network to access long-term information, and subsequently improves action recognition performance with videos of different length selected from both UCF101 and miniKinetics datasets. Further confirmation of our approach is strengthened with experiments on untrimmed video from the Thumos14 dataset.
引用
收藏
页码:1233 / 1243
页数:10
相关论文
共 50 条
  • [31] Video Analytics Framework for Human Action Recognition
    Khan, Muhammad Attique
    Alhaisoni, Majed
    Armghan, Ammar
    Alenezi, Fayadh
    Tariq, Usman
    Nam, Yunyoung
    Akram, Tallha
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (03): : 3841 - 3859
  • [32] Spatiotemporal Pyramid Network for Video Action Recognition
    Wang, Yunbo
    Long, Mingsheng
    Wang, Jianmin
    Yu, Philip S.
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2097 - 2106
  • [33] Dense Dilated Network for Video Action Recognition
    Xu, Baohan
    Ye, Hao
    Zheng, Yingbin
    Wang, Heng
    Luwang, Tianyu
    Jiang, Yu-Gang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (10) : 4941 - 4953
  • [34] Dynamic Normalization and Relay for Video Action Recognition
    Cai, Dongqi
    Yao, Anbang
    Chen, Yurong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [35] Spatiotemporal Residual Networks for Video Action Recognition
    Feichtenhofer, Christoph
    Pinz, Axel
    Wildes, Richard P.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [36] Curvature: A signature for Action Recognition in Video Sequences
    Chen, He
    Chirikjian, Gregory S.
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 3743 - 3750
  • [37] Temporal Contrastive Pretraining for Video Action Recognition
    Lorre, Guillaume
    Rabarisoa, Jaonary
    Orcesi, Astrid
    Ainouz, Samia
    Canu, Stephane
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 651 - 659
  • [38] Temporal Bilinear Networks for Video Action Recognition
    Li, Yanghao
    Song, Sijie
    Li, Yuqi
    Liu, Jiaying
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8674 - 8681
  • [39] Submodular Attribute Selection for Action Recognition in Video
    Zheng, Jinging
    Jiang, Zhuolin
    Chellappa, Rama
    Phillips, P. Jonathon
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [40] Video Action Recognition with Attentive Semantic Units
    Chen, Yifei
    Chen, Dapeng
    Liu, Ruijin
    Li, Hao
    Peng, Wei
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10136 - 10146