A DISTRIBUTION BASED VIDEO REPRESENTATION FOR HUMAN ACTION RECOGNITION

被引：3

作者：

Song, Yan ^{[1
,2
]}

Tang, Sheng ^{[1
]}

Zheng, Yan-Tao ^{[3
]}

Chua, Tat-Seng ^{[4
]}

Zhang, Yongdong ^{[1
]}

Lin, Shouxun ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Comp Technol, Lab Adv Comp Res, Beijing, Peoples R China

[2] Chinese Acad Sci, Grad Sch, Beijing, Peoples R China

[3] Inst Infocomm Res, A STAR, Singapore, Singapore

[4] Natl Univ Singapore, Sch Comp, Singapore 117548, Singapore

来源：

2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010) | 2010年

关键词：

human action recognition; probabilistic video representation; information-theoretic video matching;

D O I：

10.1109/ICME.2010.5582550

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Most current research on human action recognition in videos uses the bag-of-words (BoW) representations based on vector quantization on local spatial temporal features, due to the simplicity and good performance of such representations. In contrast to the BoW schemes, this paper explores a localized, continuous and probabilistic video representation. Specifically, the proposed representation encodes the visual and motion information of an ensemble of local spatial temporal (ST) features of a video into a distribution estimated by a generative probabilistic model such as the Gaussian Mixture Model. Furthermore, this probabilistic video representation naturally gives rise to an information-theoretic distance metric of videos. This makes the representation readily applicable as input to most discriminative classifiers, such as the nearest neighbor schemes and the kernel methods. The experiments on two datasets, KTH and UCF sports, show that the proposed approach could deliver promising results.

引用

页码：772 / 777

页数：6

共 50 条

[31] Primitive Based Action Representation and Recognition
Sanmohan
Kruger, Volker
[J]. IMAGE ANALYSIS, PROCEEDINGS, 2009, 5575 : 31 - 40
[32] Space-Time Robust Video Representation for Action Recognition
Ballas, Nicolas
Yang, Yi
Lan, Zhen-zhong
Delezoide, Betrand
Preteux, Francoise
Hauptmann, Alex
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2704 - 2711
[33] Video sketch: A middle-level representation for action recognition
Xing-Yuan Zhang
Ya-Ping Huang
Yang Mi
Yan-Ting Pei
Qi Zou
Song Wang
[J]. Applied Intelligence, 2021, 51 : 2589 - 2608
[34] Video sketch: A middle-level representation for action recognition
Zhang, Xing-Yuan
Huang, Ya-Ping
Mi, Yang
Pei, Yan-Ting
Zou, Qi
Wang, Song
[J]. APPLIED INTELLIGENCE, 2021, 51 (04) : 2589 - 2608
[35] A survey of video-based human action recognition in team sports
Yin, Hongwei
Sinnott, Richard O.
Jayaputera, Glenn T.
[J]. ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (11)
[36] Research on Video-Based Human Action Behavior Recognition Algorithms
Si, Haifei
Hu, Xingliu
Wang, Yizhi
[J]. 2019 5TH INTERNATIONAL CONFERENCE ON ENVIRONMENTAL SCIENCE AND MATERIAL APPLICATION, 2020, 440
[37] Human Action Recognition Based on a Spatio-Temporal Video Autoencoder
Sousa e Santos, Anderson Carlos
Pedrini, Helio
[J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2020, 34 (11)
[38] Dynamic Motion Representation for Human Action Recognition
Asghari-Esfeden, Sadjad
Sznaier, Mario
Camps, Octavia
[J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 546 - 555
[39] Using SAX representation for human action recognition
Junejo, Imran N.
Al Aghbari, Zaher
[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2012, 23 (06) : 853 - 861
[40] Human Action Recognition using Sparse Representation
Liu, Changhong
Yang, Yang
Chen, Yong
[J]. 2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 4, 2009, : 184 - +

← 1 2 3 4 5 →