Augmenting bag-of-words: a robust contextual representation of spatiotemporal interest points for action recognition

被引：16

作者：

Li, Yang ^{[1
]}

Ye, Junyong ^{[1
]}

Wang, Tongqing ^{[1
]}

Huang, Shijian ^{[1
]}

机构：

[1] Chongqing Univ, Minist Educ, Key Lab Optoelect Technol & Syst, Chongqing 630044, Peoples R China

来源：

VISUAL COMPUTER | 2015年 / 31卷 / 10期

关键词：

Action recognition; Contextual features; Cumulative probability histogram; Sparse coding; APPEARANCE; FEATURES;

D O I：

10.1007/s00371-014-1020-8

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Although traditional bag-of-words model, together with local spatiotemporal features, has shown promising results for human action recognition, it ignores all structural information of features, which carries important information of motion structures in videos. Recent methods usually characterize the relationship of quantized spatiotemporal features to overcome this drawback. However, the propagation of quantization error leads to an unreliable representation. To alleviate the propagation of quantization error, we present a coding method, which considers not only the spatial similarity but also the reconstruction ability of visual words after giving a probabilistic interpretation of coding coefficients. Based on our coding method, a new type of feature called cumulative probability histogram is proposed to robustly characterize contextual structural information around interest points, which are extracted from multi-layered contexts and assumed to be complementary to local spatiotemporal features. The proposed method is verified on four benchmark datasets. Experiment results show that our method can achieve better performance than previous methods in action recognition.

引用

页码：1383 / 1394

页数：12

共 50 条

[1] Augmenting bag-of-words: a robust contextual representation of spatiotemporal interest points for action recognition
Yang Li
Junyong Ye
Tongqing Wang
Shijian Huang
The Visual Computer, 2015, 31 : 1383 - 1394
[2] Contextual Bag-of-Words for Robust Visual Tracking
Zeng, Fanxiang
Ji, Yuefeng
Levine, Martin D.
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (03) : 1433 - 1447
[3] Scale Coding Bag-of-Words for Action Recognition
Khan, Fahad Shahbaz
van de Weijer, Joost
Bagdanov, Andrew D.
Felsberg, Michael
2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1514 - 1519
[4] Contextual Bag-of-Words for Visual Categorization
Li, Teng
Mei, Tao
Kweon, In-So
Hua, Xian-Sheng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (04) : 381 - 392
[5] A novel hierarchical Bag-of-Words model for compact action representation
Sun, Qianru
Liu, Hong
Ma, Liqian
Zhang, Tianwei
NEUROCOMPUTING, 2016, 174 : 722 - 732
[6] A bag-of-words equivalent recurrent neural network for action recognition
Richard, Alexander
Gall, Juergen
COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 156 : 79 - 91
[7] Bag-of-words Modelling for Speech Recognition
Ziolko, Bartosz
Manandhar, Suresh
Wilson, Richard C.
INTERNATIONAL CONFERENCE ON FUTURE COMPUTER AND COMMUNICATIONS, PROCEEDINGS, 2009, : 646 - +
[8] Fuzzy Bag-of-Words Model for Document Representation
Zhao, Rui
Mao, Kezhi
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2018, 26 (02) : 794 - 804
[9] Augmenting Bag-of-Words: Data-Driven Discovery of Temporal and Structural Information for Activity Recognition
Bettadapura, Vinay
Schindler, Grant
Ploetz, Thomas
Essa, Irfan
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 2619 - 2626
[10] Vehicle Logo Recognition Based on Bag-of-Words
Yu, Shuyuan
Zheng, Shibao
Yang, Hua
Liang, Longfei
2013 10TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2013), 2013, : 353 - 358

← 1 2 3 4 5 →