Live Video Action Recognition from Unsupervised Action Proposals

被引:0
|
作者
Lopcz-Sastrc, Roberto J. [1 ]
Baptista-Rios, Marcos [2 ]
Acevedo-Rodriguez, Francisco J. [1 ]
Martin-Martin, Pilar [1 ]
Maldonado-Bascon, Saturnino [1 ]
机构
[1] Univ Alcala, Dept Signal Theory, GRAM, Alcala De Henares, Spain
[2] Gradiant, Multimodal Informat Grp, Vigo, Spain
关键词
LOCALIZATION;
D O I
10.23919/MVA51890.2021.9511355
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of action detection in untrimmed videos consists in localizing those parts of a certain video that can contain an action. Typically, state-of-the-art approaches to this problem use a temporal action proposals (TAPs) generator followed by an action classifier module. Moreover, TAPs solutions are learned from a supervised setting, and need the entire video to be processed to produce effective proposals. These properties become a limitation for certain real applications in which a system requires to know the content of the video in an online fashion. To do so, in this work we introduce a live video action detection application which integrates the action classifier step with an unsupervised and online TAPs generator. We evaluate, for the first time, the precision of this novel pipeline for the problem of action detection in untrimmed videos. We offer a thorough experimental evaluation in ActivityNet dataset, where our unsupervised model can compete with the state-of-the-art supervised solutions.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Meta-action descriptor for action recognition in RGBD video
    Huang, Min
    Su, Song-Zhi
    Cai, Guo-Rong
    Zhang, Hong-Bo
    Cao, Donglin
    Li, Shao-Zi
    IET COMPUTER VISION, 2017, 11 (04) : 301 - 308
  • [32] Coupling Video Segmentation and Action Recognition
    Ghodrati, Amir
    Pedersoli, Marco
    Tuytelaars, Tinne
    2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 618 - 625
  • [33] Breaking video into pieces for action recognition
    Ying Zheng
    Hongxun Yao
    Xiaoshuai Sun
    Xuesong Jiang
    Fatih Porikli
    Multimedia Tools and Applications, 2017, 76 : 22195 - 22212
  • [34] Action recognition in broadcast tennis video
    Zhu, Guangyu
    Xu, Changsheng
    Huang, Qingming
    Gao, Wen
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2006, : 251 - +
  • [35] Modeling Video Evolution For Action Recognition
    Fernando, Basura
    Gavves, Efstratios
    Oramas, Jose M.
    Ghodrati, Amir
    Tuytelaars, Tinne
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 5378 - 5387
  • [36] Recurring the Transformer for Video Action Recognition
    Yang, Jiewen
    Dong, Xingbo
    Liu, Liujun
    Zhang, Chao
    Shen, Jiajun
    Yu, Dahai
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14043 - 14053
  • [37] Breaking video into pieces for action recognition
    Zheng, Ying
    Yao, Hongxun
    Sun, Xiaoshuai
    Jiang, Xuesong
    Porikli, Fatih
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (21) : 22195 - 22212
  • [38] Live Action: Can Young Children Learn Verbs From Video?
    Roseberry, Sarah
    Hirsh-Pasek, Kathy
    Parish-Morris, Julia
    Golinkoff, Roberta M.
    CHILD DEVELOPMENT, 2009, 80 (05) : 1360 - 1375
  • [39] A Recursive Constrained Framework for Unsupervised Video Action Clustering
    Peng, Bo
    Lei, Jianjun
    Fu, Huazhu
    Shao, Ling
    Huang, Qingming
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (01) : 555 - 565
  • [40] A Comparison Study on Human Action Recognition from Video Streams
    Lin, S. C. F.
    Wong, C. Y.
    Ren, T. R.
    Kwok, N. M.
    2012 5TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), 2012, : 1162 - 1166