A bag-of-words equivalent recurrent neural network for action recognition

被引:33
|
作者
Richard, Alexander [1 ]
Gall, Juergen [1 ]
机构
[1] Univ Bonn, Romerstrasse 164, D-53177 Bonn, Germany
基金
欧洲研究理事会;
关键词
Action recognition; Bag-of-words; Neural networks; DESCRIPTORS; CATEGORIES;
D O I
10.1016/j.cviu.2016.10.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The traditional bag-of-words approach has found a wide range of applications in computer vision. The standard pipeline consists of a generation of a visual vocabulary, a quantization of the features into histograms of visual words, and a classification step for which usually a support vector machine in combination with a non-linear kernel is used. Given large amounts of data, however, the model suffers from a lack of discriminative power. This applies particularly for action recognition, where the vast amount of video features needs to be subsampled for unsupervised visual vocabulary generation. Moreover, the kernel computation can be very expensive on large datasets. In this work, we propose a recurrent neural network that is equivalent to the traditional bag-of-words approach but enables for the application of discriminative training. The model further allows to incorporate the kernel computation into the neural network directly, solving the complexity issue and allowing to represent the complete classification system within a single network. We evaluate our method on four recent action recognition benchmarks and show that the conventional model as well as sparse coding methods are outperformed. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:79 / 91
页数:13
相关论文
共 50 条
  • [21] A novel hierarchical Bag-of-Words model for compact action representation
    Sun, Qianru
    Liu, Hong
    Ma, Liqian
    Zhang, Tianwei
    NEUROCOMPUTING, 2016, 174 : 722 - 732
  • [22] Object Classification and Recognition using Bag-of-Words (BoW) Model
    Ali, Nursabillilah Mohd
    Jun, Soon Wei
    Karis, Mohd Safirin
    Ghazaly, Mariam Md
    Arai, Mohd Shahrieel Mohd
    2016 IEEE 12TH INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA), 2016, : 216 - 220
  • [23] Machine Learning for Hand Gesture Recognition Using Bag-of-words
    Benmoussa, Marouane
    Mahmoudi, Abdelhak
    2018 INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND COMPUTER VISION (ISCV2018), 2018,
  • [24] Fusing Color and Shape for Bag-of-Words Based Object Recognition
    van de Weijer, Joost
    Khan, Fahad Shahbaz
    COMPUTATIONAL COLOR IMAGING, CCIW 2013, 2013, 7786 : 25 - 34
  • [25] Accelerating Bag-of-Words with SOM
    Chen, Jian-Hui
    Wang, Zuo-Ren
    Liu, Cheng-Lin
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT III, 2019, 11955 : 573 - 584
  • [26] From Universal Bag-of-Words to Adaptive Bag-of-Phrases for Mobile Scene Recognition
    Chen, Tao
    Yap, Kim-Hui
    Chau, Lap-Pui
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 825 - 828
  • [27] Network-Based Bag-of-Words Model for Text Classification
    Yan, Dongyang
    Li, Keping
    Gu, Shuang
    Yang, Liu
    IEEE ACCESS, 2020, 8 : 82641 - 82652
  • [28] A Hierarchical Bag-of-Words Model Based on Local Space-Time Features for Human Action Recognition
    Wu, Jiangwei
    Zhou, Daobing
    Xiao, Guoqiang
    2013 INTERNATIONAL CONFERENCE ON IT CONVERGENCE AND SECURITY (ICITCS), 2013,
  • [29] Scene Character Recognition via Bag-of-Words Model: A Comprehensive Study
    Zhang, Zhong
    Wang, Hong
    Liu, Shuang
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2018, 423 : 819 - 826
  • [30] Food Recognition: Can Deep Learning or Bag-of-Words Match Humans?
    Furtado, Pedro
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 2: BIOIMAGING, 2020, : 102 - 108