Live Video Action Recognition from Unsupervised Action Proposals

被引：0

作者：

Lopcz-Sastrc, Roberto J. ^{[1
]}

Baptista-Rios, Marcos ^{[2
]}

Acevedo-Rodriguez, Francisco J. ^{[1
]}

Martin-Martin, Pilar ^{[1
]}

Maldonado-Bascon, Saturnino ^{[1
]}

机构：

[1] Univ Alcala, Dept Signal Theory, GRAM, Alcala De Henares, Spain

[2] Gradiant, Multimodal Informat Grp, Vigo, Spain

来源：

PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021) | 2021年

关键词：

LOCALIZATION;

D O I：

10.23919/MVA51890.2021.9511355

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The problem of action detection in untrimmed videos consists in localizing those parts of a certain video that can contain an action. Typically, state-of-the-art approaches to this problem use a temporal action proposals (TAPs) generator followed by an action classifier module. Moreover, TAPs solutions are learned from a supervised setting, and need the entire video to be processed to produce effective proposals. These properties become a limitation for certain real applications in which a system requires to know the content of the video in an online fashion. To do so, in this work we introduce a live video action detection application which integrates the action classifier step with an unsupervised and online TAPs generator. We evaluate, for the first time, the precision of this novel pipeline for the problem of action detection in untrimmed videos. We offer a thorough experimental evaluation in ActivityNet dataset, where our unsupervised model can compete with the state-of-the-art supervised solutions.

引用

页数：6

共 50 条

[31] Meta-action descriptor for action recognition in RGBD video
Huang, Min
Su, Song-Zhi
Cai, Guo-Rong
Zhang, Hong-Bo
Cao, Donglin
Li, Shao-Zi
IET COMPUTER VISION, 2017, 11 (04) : 301 - 308
[32] Coupling Video Segmentation and Action Recognition
Ghodrati, Amir
Pedersoli, Marco
Tuytelaars, Tinne
2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 618 - 625
[33] Breaking video into pieces for action recognition
Ying Zheng
Hongxun Yao
Xiaoshuai Sun
Xuesong Jiang
Fatih Porikli
Multimedia Tools and Applications, 2017, 76 : 22195 - 22212
[34] Action recognition in broadcast tennis video
Zhu, Guangyu
Xu, Changsheng
Huang, Qingming
Gao, Wen
18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2006, : 251 - +
[35] Modeling Video Evolution For Action Recognition
Fernando, Basura
Gavves, Efstratios
Oramas, Jose M.
Ghodrati, Amir
Tuytelaars, Tinne
2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 5378 - 5387
[36] Recurring the Transformer for Video Action Recognition
Yang, Jiewen
Dong, Xingbo
Liu, Liujun
Zhang, Chao
Shen, Jiajun
Yu, Dahai
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14043 - 14053
[37] Breaking video into pieces for action recognition
Zheng, Ying
Yao, Hongxun
Sun, Xiaoshuai
Jiang, Xuesong
Porikli, Fatih
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (21) : 22195 - 22212
[38] Live Action: Can Young Children Learn Verbs From Video?
Roseberry, Sarah
Hirsh-Pasek, Kathy
Parish-Morris, Julia
Golinkoff, Roberta M.
CHILD DEVELOPMENT, 2009, 80 (05) : 1360 - 1375
[39] A Recursive Constrained Framework for Unsupervised Video Action Clustering
Peng, Bo
Lei, Jianjun
Fu, Huazhu
Shao, Ling
Huang, Qingming
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (01) : 555 - 565
[40] A Comparison Study on Human Action Recognition from Video Streams
Lin, S. C. F.
Wong, C. Y.
Ren, T. R.
Kwok, N. M.
2012 5TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), 2012, : 1162 - 1166

← 1 2 3 4 5 →