Exploring probabilistic localized video representation for human action recognition

被引:0
|
作者
Yan Song
Sheng Tang
Yan-Tao Zheng
Tat-Seng Chua
Yongdong Zhang
Shouxun Lin
机构
[1] Chinese Academy of Sciences,Laboratory of Advanced Computing Research, Institute of Computing Technology
[2] Graduate University of the Chinese Academy of Sciences,School of Computing
[3] Institute for Infocomm Research,undefined
[4] A*STAR,undefined
[5] National University of Singapore,undefined
来源
关键词
Human action recognition; Probabilistic video representation; Information-theoretic video matching;
D O I
暂无
中图分类号
学科分类号
摘要
In recent years, the bag-of-words (BoW) video representations have achieved promising results in human action recognition in videos. By vector quantizing local spatial temporal (ST) features, the BoW video representation brings in simplicity and efficiency, but limitations too. First, the discretization of feature space in BoW inevitably results in ambiguity and information loss in video representation. Second, there exists no universal codebook for BoW representation. The codebook needs to be re-built when video corpus is changed. To tackle these issues, this paper explores a localized, continuous and probabilistic video representation. Specifically, the proposed representation encodes the visual and motion information of an ensemble of local ST features of a video into a distribution estimated by a generative probabilistic model. Furthermore, the probabilistic video representation naturally gives rise to an information-theoretic distance metric of videos. This makes the representation readily applicable to most discriminative classifiers, such as the nearest neighbor schemes and the kernel based classifiers. Experiments on two datasets, KTH and UCF sports, show that the proposed approach could deliver promising results.
引用
收藏
页码:663 / 685
页数:22
相关论文
共 50 条
  • [31] Video and Image Complexity in Human Action Recognition
    Burgos-Madrigal, Andrea
    Altamirano-Robles, Leopoldo
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION, 2021, 13055 : 349 - 359
  • [32] Combining Video Subsequences for Human Action Recognition
    Onofri, Leonardo
    Soda, Paolo
    [J]. 2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 597 - 600
  • [33] Stereoscopic Video Description for Human Action Recognition
    Mademlis, Ioannis
    Iosifidis, Alexandros
    Tefas, Anastasios
    Nikolaidis, Nikos
    Pitas, Ioannis
    [J]. 2014 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE FOR MULTIMEDIA, SIGNAL AND VISION PROCESSING (CIMSIVP), 2014, : 1 - 6
  • [34] Automatic Video Descriptor for Human Action Recognition
    Perera, Minoli
    Farook, Cassim
    Madurapperuma, A. P.
    [J]. 2017 NATIONAL INFORMATION TECHNOLOGY CONFERENCE (NITC), 2017, : 61 - 66
  • [35] Handcrafted localized phase features for human action recognition
    Hejazi, Seyed Mostafa
    Abhayaratne, Charith
    [J]. IMAGE AND VISION COMPUTING, 2022, 123
  • [36] Deep Video Understanding: Representation Learning, Action Recognition, and Language Generation
    Mei, Tao
    [J]. PROCEEDINGS OF THE 1ST WORKSHOP AND CHALLENGE ON COMPREHENSIVE VIDEO UNDERSTANDING IN THE WILD (COVIEW'18), 2018, : 1 - 1
  • [37] Action Recognition in Video by Sparse Representation on Covariance Manifolds of Silhouette Tunnels
    Guo, Kai
    Ishwar, Prakash
    Konrad, Janusz
    [J]. RECOGNIZING PATTERNS IN SIGNALS, SPEECH, IMAGES, AND VIDEOS, 2010, 6388 : 294 - 305
  • [38] Quasi-invariants for human action representation and recognition
    Parameswaran, V
    Chellappa, R
    [J]. 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL I, PROCEEDINGS, 2002, : 307 - 310
  • [39] Human Action Recognition with Extremities as Semantic Posture Representation
    Yu, Elden
    Aggarwal, J. K.
    [J]. 2009 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPR WORKSHOPS 2009), VOLS 1 AND 2, 2009, : 457 - 464
  • [40] A hierarchical representation for human action recognition in realistic scenes
    Qing Lei
    Hongbo Zhang
    Minghai Xin
    Yiqiao Cai
    [J]. Multimedia Tools and Applications, 2018, 77 : 11403 - 11423