Action recognition on continuous video

被引:0
|
作者
Y. L. Chang
C. S. Chan
P. Remagnino
机构
[1] University of Malaya,
[2] Kingston upon Thames,undefined
来源
关键词
Deep learning; Action recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Video action recognition has been a challenging task over the years. The challenge herein is not only due to the complication in increasing information in videos but also the requirement of an efficient method to retain information over a longer-term where human action would take to perform. This paper proposes a novel framework, named as long-term video action recognition (LVAR) to perform generic action classification in the continuous video. The idea of LVAR is introducing a partial recurrence connection to propagate information within every layer of a spatial-temporal network, such as the well-known C3D. Empirically, we show that this addition allows the C3D network to access long-term information, and subsequently improves action recognition performance with videos of different length selected from both UCF101 and miniKinetics datasets. Further confirmation of our approach is strengthened with experiments on untrimmed video from the Thumos14 dataset.
引用
收藏
页码:1233 / 1243
页数:10
相关论文
共 50 条
  • [41] A Robust and Efficient Video Representation for Action Recognition
    Heng Wang
    Dan Oneata
    Jakob Verbeek
    Cordelia Schmid
    International Journal of Computer Vision, 2016, 119 : 219 - 238
  • [42] Video and Image Complexity in Human Action Recognition
    Burgos-Madrigal, Andrea
    Altamirano-Robles, Leopoldo
    PROGRESS IN ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION, 2021, 13055 : 349 - 359
  • [43] Exploring Action Recognition in Endoscopy Video Datasets
    Tian, Yuchen
    Paheding, Sidike
    Azimi, Ehsan
    Lee, Eung-Joo
    REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2024, 2024, 13034
  • [44] Combining Video Subsequences for Human Action Recognition
    Onofri, Leonardo
    Soda, Paolo
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 597 - 600
  • [45] YogaTube: A Video Benchmark for Yoga Action Recognition
    Yadav, Santosh Kumar
    Singh, Guntaas
    Verma, Manisha
    Tiwari, Kamlesh
    Pandey, Hari Mohan
    Akbar, Shaik Ali
    Corcoran, Peter
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [46] Temporal Difference Networks for Video Action Recognition
    Ng, Joe Yue-Hei
    Davis, Larry S.
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1577 - 1586
  • [47] Stereoscopic Video Description for Human Action Recognition
    Mademlis, Ioannis
    Iosifidis, Alexandros
    Tefas, Anastasios
    Nikolaidis, Nikos
    Pitas, Ioannis
    2014 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE FOR MULTIMEDIA, SIGNAL AND VISION PROCESSING (CIMSIVP), 2014, : 1 - 6
  • [48] Spatiotemporal Fusion Networks for Video Action Recognition
    Zheng Liu
    Haifeng Hu
    Junxuan Zhang
    Neural Processing Letters, 2019, 50 : 1877 - 1890
  • [49] Less is More: Video Trimming for Action Recognition
    Antic, Borislav
    Milbich, Timo
    Ommer, Bjoern
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, : 515 - 521
  • [50] Spatiotemporal Relation Networks for Video Action Recognition
    Liu, Zheng
    Hu, Haifeng
    IEEE ACCESS, 2019, 7 : 14969 - 14976