A DISTRIBUTION BASED VIDEO REPRESENTATION FOR HUMAN ACTION RECOGNITION

被引:3
|
作者
Song, Yan [1 ,2 ]
Tang, Sheng [1 ]
Zheng, Yan-Tao [3 ]
Chua, Tat-Seng [4 ]
Zhang, Yongdong [1 ]
Lin, Shouxun [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Lab Adv Comp Res, Beijing, Peoples R China
[2] Chinese Acad Sci, Grad Sch, Beijing, Peoples R China
[3] Inst Infocomm Res, A STAR, Singapore, Singapore
[4] Natl Univ Singapore, Sch Comp, Singapore 117548, Singapore
关键词
human action recognition; probabilistic video representation; information-theoretic video matching;
D O I
10.1109/ICME.2010.5582550
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Most current research on human action recognition in videos uses the bag-of-words (BoW) representations based on vector quantization on local spatial temporal features, due to the simplicity and good performance of such representations. In contrast to the BoW schemes, this paper explores a localized, continuous and probabilistic video representation. Specifically, the proposed representation encodes the visual and motion information of an ensemble of local spatial temporal (ST) features of a video into a distribution estimated by a generative probabilistic model such as the Gaussian Mixture Model. Furthermore, this probabilistic video representation naturally gives rise to an information-theoretic distance metric of videos. This makes the representation readily applicable as input to most discriminative classifiers, such as the nearest neighbor schemes and the kernel methods. The experiments on two datasets, KTH and UCF sports, show that the proposed approach could deliver promising results.
引用
收藏
页码:772 / 777
页数:6
相关论文
共 50 条
  • [31] Primitive Based Action Representation and Recognition
    Sanmohan
    Kruger, Volker
    [J]. IMAGE ANALYSIS, PROCEEDINGS, 2009, 5575 : 31 - 40
  • [32] Space-Time Robust Video Representation for Action Recognition
    Ballas, Nicolas
    Yang, Yi
    Lan, Zhen-zhong
    Delezoide, Betrand
    Preteux, Francoise
    Hauptmann, Alex
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2704 - 2711
  • [33] Video sketch: A middle-level representation for action recognition
    Xing-Yuan Zhang
    Ya-Ping Huang
    Yang Mi
    Yan-Ting Pei
    Qi Zou
    Song Wang
    [J]. Applied Intelligence, 2021, 51 : 2589 - 2608
  • [34] Video sketch: A middle-level representation for action recognition
    Zhang, Xing-Yuan
    Huang, Ya-Ping
    Mi, Yang
    Pei, Yan-Ting
    Zou, Qi
    Wang, Song
    [J]. APPLIED INTELLIGENCE, 2021, 51 (04) : 2589 - 2608
  • [35] A survey of video-based human action recognition in team sports
    Yin, Hongwei
    Sinnott, Richard O.
    Jayaputera, Glenn T.
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (11)
  • [36] Research on Video-Based Human Action Behavior Recognition Algorithms
    Si, Haifei
    Hu, Xingliu
    Wang, Yizhi
    [J]. 2019 5TH INTERNATIONAL CONFERENCE ON ENVIRONMENTAL SCIENCE AND MATERIAL APPLICATION, 2020, 440
  • [37] Human Action Recognition Based on a Spatio-Temporal Video Autoencoder
    Sousa e Santos, Anderson Carlos
    Pedrini, Helio
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2020, 34 (11)
  • [38] Dynamic Motion Representation for Human Action Recognition
    Asghari-Esfeden, Sadjad
    Sznaier, Mario
    Camps, Octavia
    [J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 546 - 555
  • [39] Using SAX representation for human action recognition
    Junejo, Imran N.
    Al Aghbari, Zaher
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2012, 23 (06) : 853 - 861
  • [40] Human Action Recognition using Sparse Representation
    Liu, Changhong
    Yang, Yang
    Chen, Yong
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 4, 2009, : 184 - +