A bag-of-words equivalent recurrent neural network for action recognition

被引:33
|
作者
Richard, Alexander [1 ]
Gall, Juergen [1 ]
机构
[1] Univ Bonn, Romerstrasse 164, D-53177 Bonn, Germany
基金
欧洲研究理事会;
关键词
Action recognition; Bag-of-words; Neural networks; DESCRIPTORS; CATEGORIES;
D O I
10.1016/j.cviu.2016.10.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The traditional bag-of-words approach has found a wide range of applications in computer vision. The standard pipeline consists of a generation of a visual vocabulary, a quantization of the features into histograms of visual words, and a classification step for which usually a support vector machine in combination with a non-linear kernel is used. Given large amounts of data, however, the model suffers from a lack of discriminative power. This applies particularly for action recognition, where the vast amount of video features needs to be subsampled for unsupervised visual vocabulary generation. Moreover, the kernel computation can be very expensive on large datasets. In this work, we propose a recurrent neural network that is equivalent to the traditional bag-of-words approach but enables for the application of discriminative training. The model further allows to incorporate the kernel computation into the neural network directly, solving the complexity issue and allowing to represent the complete classification system within a single network. We evaluate our method on four recent action recognition benchmarks and show that the conventional model as well as sparse coding methods are outperformed. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:79 / 91
页数:13
相关论文
共 50 条
  • [41] Understanding bag-of-words model: a statistical framework
    Yin Zhang
    Rong Jin
    Zhi-Hua Zhou
    International Journal of Machine Learning and Cybernetics, 2010, 1 : 43 - 52
  • [42] ECG Biometrics Using Bag-of-Words Models
    Ciocoiu, Iulian B.
    2015 INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS (ISSCS), 2015,
  • [43] Understanding bag-of-words model: a statistical framework
    Zhang, Yin
    Jin, Rong
    Zhou, Zhi-Hua
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2010, 1 (1-4) : 43 - 52
  • [44] A Bag-of-Words Speedometer for Single Camera SLAM
    Botterill, Tom
    Green, Richard
    Mills, Steven
    2009 24TH INTERNATIONAL CONFERENCE IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ 2009), 2009, : 91 - +
  • [45] Improving bag-of-words scheme for scene categorization
    Li, Qun
    Zhang, Hong-Gang
    Guo, Jun
    Bhanu, Bir
    An, Le
    Li, Q. (liqun@bupt.edu.cn), 1600, Beijing University of Posts and Telecommunications (19): : 166 - 171
  • [46] Graph-based bag-of-words for classification
    Silva, Fernanda B.
    Werneck, Rafael de O.
    Goldenstein, Siome
    Tabbone, Salvatore
    Torres, Ricardo da S.
    PATTERN RECOGNITION, 2018, 74 : 266 - 285
  • [47] Incorporating Temporal Context in Bag-of-Words Models
    Glaser, Tamar
    Zelnik-Manor, Lihi
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [48] Persistence Bag-of-Words for Topological Data Analysis
    Zielinski, Bartosz
    Lipinski, Michal
    Juda, Mateusz
    Zeppelzauer, Matthias
    Dlotko, Pawel
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4489 - 4495
  • [49] Fuzzy Bag-of-Words Model for Document Representation
    Zhao, Rui
    Mao, Kezhi
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2018, 26 (02) : 794 - 804
  • [50] A Bag-of-Words Model for Cellular Image Segmentation
    Cheng, Li
    Ye, Ning
    Yu, Weimiao
    Cheah, Andre
    ADVANCES IN BIO-IMAGING: FROM PHYSICS TO SIGNAL UNDERSTANDING ISSUES: STATE OF THE ART AND CHALLEGES, 2012, 120 : 209 - +