A DISTRIBUTION BASED VIDEO REPRESENTATION FOR HUMAN ACTION RECOGNITION

被引:3
|
作者
Song, Yan [1 ,2 ]
Tang, Sheng [1 ]
Zheng, Yan-Tao [3 ]
Chua, Tat-Seng [4 ]
Zhang, Yongdong [1 ]
Lin, Shouxun [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Lab Adv Comp Res, Beijing, Peoples R China
[2] Chinese Acad Sci, Grad Sch, Beijing, Peoples R China
[3] Inst Infocomm Res, A STAR, Singapore, Singapore
[4] Natl Univ Singapore, Sch Comp, Singapore 117548, Singapore
关键词
human action recognition; probabilistic video representation; information-theoretic video matching;
D O I
10.1109/ICME.2010.5582550
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Most current research on human action recognition in videos uses the bag-of-words (BoW) representations based on vector quantization on local spatial temporal features, due to the simplicity and good performance of such representations. In contrast to the BoW schemes, this paper explores a localized, continuous and probabilistic video representation. Specifically, the proposed representation encodes the visual and motion information of an ensemble of local spatial temporal (ST) features of a video into a distribution estimated by a generative probabilistic model such as the Gaussian Mixture Model. Furthermore, this probabilistic video representation naturally gives rise to an information-theoretic distance metric of videos. This makes the representation readily applicable as input to most discriminative classifiers, such as the nearest neighbor schemes and the kernel methods. The experiments on two datasets, KTH and UCF sports, show that the proposed approach could deliver promising results.
引用
收藏
页码:772 / 777
页数:6
相关论文
共 50 条
  • [11] A Robust and Efficient Video Representation for Action Recognition
    Heng Wang
    Dan Oneata
    Jakob Verbeek
    Cordelia Schmid
    [J]. International Journal of Computer Vision, 2016, 119 : 219 - 238
  • [12] A Robust and Efficient Video Representation for Action Recognition
    Wang, Heng
    Oneata, Dan
    Verbeek, Jakob
    Schmid, Cordelia
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 119 (03) : 219 - 238
  • [13] Exploring Multimodal Video Representation for Action Recognition
    Wang, Cheng
    Yang, Haojin
    Meinel, Christoph
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 1924 - 1931
  • [14] Human Action Recognition in Video
    Singh, Dushyant Kumar
    [J]. ADVANCED INFORMATICS FOR COMPUTING RESEARCH, ICAICR 2018, PT I, 2019, 955 : 54 - 66
  • [15] Human Behavior Feature Representation and Recognition Based on Depth Video
    He, Miao
    Song, Guangming
    Wei, Zhong
    [J]. JOURNAL OF WEB ENGINEERING, 2020, 19 (5-6): : 883 - 902
  • [16] Multimodal human action recognition based on spatio-temporal action representation recognition model
    Wu, Qianhan
    Huang, Qian
    Li, Xing
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (11) : 16409 - 16430
  • [17] Multimodal human action recognition based on spatio-temporal action representation recognition model
    Qianhan Wu
    Qian Huang
    Xing Li
    [J]. Multimedia Tools and Applications, 2023, 82 : 16409 - 16430
  • [18] Human Action Recognition Based on Discriminative Collaborative Representation Classifier
    Zan Baofeng
    Kong Jun
    Jiang Min
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2018, 55 (01)
  • [19] Human action recognition using trajectory-based representation
    Abdul-Azim, Haiam A.
    Hemayed, Elsayed E.
    [J]. EGYPTIAN INFORMATICS JOURNAL, 2015, 16 (02) : 187 - 198
  • [20] Scale-based human motion representation for action recognition
    Pers, J
    Vuckovic, G
    Dezman, B
    Kovacic, S
    [J]. ISPA 2003: PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, PTS 1 AND 2, 2003, : 668 - 673