A DISTRIBUTION BASED VIDEO REPRESENTATION FOR HUMAN ACTION RECOGNITION

被引:3
|
作者
Song, Yan [1 ,2 ]
Tang, Sheng [1 ]
Zheng, Yan-Tao [3 ]
Chua, Tat-Seng [4 ]
Zhang, Yongdong [1 ]
Lin, Shouxun [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Lab Adv Comp Res, Beijing, Peoples R China
[2] Chinese Acad Sci, Grad Sch, Beijing, Peoples R China
[3] Inst Infocomm Res, A STAR, Singapore, Singapore
[4] Natl Univ Singapore, Sch Comp, Singapore 117548, Singapore
关键词
human action recognition; probabilistic video representation; information-theoretic video matching;
D O I
10.1109/ICME.2010.5582550
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Most current research on human action recognition in videos uses the bag-of-words (BoW) representations based on vector quantization on local spatial temporal features, due to the simplicity and good performance of such representations. In contrast to the BoW schemes, this paper explores a localized, continuous and probabilistic video representation. Specifically, the proposed representation encodes the visual and motion information of an ensemble of local spatial temporal (ST) features of a video into a distribution estimated by a generative probabilistic model such as the Gaussian Mixture Model. Furthermore, this probabilistic video representation naturally gives rise to an information-theoretic distance metric of videos. This makes the representation readily applicable as input to most discriminative classifiers, such as the nearest neighbor schemes and the kernel methods. The experiments on two datasets, KTH and UCF sports, show that the proposed approach could deliver promising results.
引用
收藏
页码:772 / 777
页数:6
相关论文
共 50 条
  • [1] An overview of sparse representation based action recognition in video
    Ushapreethi, P.
    Lakshmipriya, G. G.
    [J]. 2018 2ND INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION, AND SIGNAL PROCESSING (ICCCSP): SPECIAL FOCUS ON TECHNOLOGY AND INNOVATION FOR SMART ENVIRONMENT, 2018, : 63 - 67
  • [2] ACTION RECOGNITION BASED ON KINEMATIC REPRESENTATION OF VIDEO DATA
    Sun, Xin
    Huang, Di
    Wang, Yunhong
    Qin, Jie
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 1530 - 1534
  • [3] Video action recognition based on visual rhythm representation
    Moreira, Thierry Pinheiro
    Menotti, David
    Pedrini, Helio
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 71
  • [4] ACTLETS: A NOVEL LOCAL REPRESENTATION FOR HUMAN ACTION RECOGNITION IN VIDEO
    Ullah, Muhammad Muneeb
    Laptev, Ivan
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 777 - 780
  • [5] Exploring probabilistic localized video representation for human action recognition
    Yan Song
    Sheng Tang
    Yan-Tao Zheng
    Tat-Seng Chua
    Yongdong Zhang
    Shouxun Lin
    [J]. Multimedia Tools and Applications, 2012, 58 : 663 - 685
  • [6] Exploring probabilistic localized video representation for human action recognition
    Song, Yan
    Tang, Sheng
    Zheng, Yan-Tao
    Chua, Tat-Seng
    Zhang, Yongdong
    Lin, Shouxun
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 58 (03) : 663 - 685
  • [7] A component-based video content representation for action recognition
    Adeli, Vida
    Fazl-Ersi, Ehsan
    Harati, Ahad
    [J]. IMAGE AND VISION COMPUTING, 2019, 90
  • [8] A line based pose representation for human action recognition
    Baysal, Sermetcan
    Duygulu, Pinar
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (05) : 458 - 471
  • [9] A Grid-based Representation for Human Action Recognition
    Lamghari, Soufiane
    Bilodeau, Guillaume-Alexandre
    Saunier, Nicolas
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10500 - 10507
  • [10] Learning hierarchical video representation for action recognition
    Li Q.
    Qiu Z.
    Yao T.
    Mei T.
    Rui Y.
    Luo J.
    [J]. International Journal of Multimedia Information Retrieval, 2017, 6 (1) : 85 - 98