Augmenting bag-of-words: a robust contextual representation of spatiotemporal interest points for action recognition

被引:16
|
作者
Li, Yang [1 ]
Ye, Junyong [1 ]
Wang, Tongqing [1 ]
Huang, Shijian [1 ]
机构
[1] Chongqing Univ, Minist Educ, Key Lab Optoelect Technol & Syst, Chongqing 630044, Peoples R China
来源
VISUAL COMPUTER | 2015年 / 31卷 / 10期
关键词
Action recognition; Contextual features; Cumulative probability histogram; Sparse coding; APPEARANCE; FEATURES;
D O I
10.1007/s00371-014-1020-8
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Although traditional bag-of-words model, together with local spatiotemporal features, has shown promising results for human action recognition, it ignores all structural information of features, which carries important information of motion structures in videos. Recent methods usually characterize the relationship of quantized spatiotemporal features to overcome this drawback. However, the propagation of quantization error leads to an unreliable representation. To alleviate the propagation of quantization error, we present a coding method, which considers not only the spatial similarity but also the reconstruction ability of visual words after giving a probabilistic interpretation of coding coefficients. Based on our coding method, a new type of feature called cumulative probability histogram is proposed to robustly characterize contextual structural information around interest points, which are extracted from multi-layered contexts and assumed to be complementary to local spatiotemporal features. The proposed method is verified on four benchmark datasets. Experiment results show that our method can achieve better performance than previous methods in action recognition.
引用
收藏
页码:1383 / 1394
页数:12
相关论文
共 50 条
  • [1] Augmenting bag-of-words: a robust contextual representation of spatiotemporal interest points for action recognition
    Yang Li
    Junyong Ye
    Tongqing Wang
    Shijian Huang
    The Visual Computer, 2015, 31 : 1383 - 1394
  • [2] Contextual Bag-of-Words for Robust Visual Tracking
    Zeng, Fanxiang
    Ji, Yuefeng
    Levine, Martin D.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (03) : 1433 - 1447
  • [3] Scale Coding Bag-of-Words for Action Recognition
    Khan, Fahad Shahbaz
    van de Weijer, Joost
    Bagdanov, Andrew D.
    Felsberg, Michael
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1514 - 1519
  • [4] Contextual Bag-of-Words for Visual Categorization
    Li, Teng
    Mei, Tao
    Kweon, In-So
    Hua, Xian-Sheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (04) : 381 - 392
  • [5] A novel hierarchical Bag-of-Words model for compact action representation
    Sun, Qianru
    Liu, Hong
    Ma, Liqian
    Zhang, Tianwei
    NEUROCOMPUTING, 2016, 174 : 722 - 732
  • [6] A bag-of-words equivalent recurrent neural network for action recognition
    Richard, Alexander
    Gall, Juergen
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 156 : 79 - 91
  • [7] Bag-of-words Modelling for Speech Recognition
    Ziolko, Bartosz
    Manandhar, Suresh
    Wilson, Richard C.
    INTERNATIONAL CONFERENCE ON FUTURE COMPUTER AND COMMUNICATIONS, PROCEEDINGS, 2009, : 646 - +
  • [8] Fuzzy Bag-of-Words Model for Document Representation
    Zhao, Rui
    Mao, Kezhi
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2018, 26 (02) : 794 - 804
  • [9] Augmenting Bag-of-Words: Data-Driven Discovery of Temporal and Structural Information for Activity Recognition
    Bettadapura, Vinay
    Schindler, Grant
    Ploetz, Thomas
    Essa, Irfan
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 2619 - 2626
  • [10] Vehicle Logo Recognition Based on Bag-of-Words
    Yu, Shuyuan
    Zheng, Shibao
    Yang, Hua
    Liang, Longfei
    2013 10TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2013), 2013, : 353 - 358