A bag-of-words equivalent recurrent neural network for action recognition

被引:33
|
作者
Richard, Alexander [1 ]
Gall, Juergen [1 ]
机构
[1] Univ Bonn, Romerstrasse 164, D-53177 Bonn, Germany
基金
欧洲研究理事会;
关键词
Action recognition; Bag-of-words; Neural networks; DESCRIPTORS; CATEGORIES;
D O I
10.1016/j.cviu.2016.10.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The traditional bag-of-words approach has found a wide range of applications in computer vision. The standard pipeline consists of a generation of a visual vocabulary, a quantization of the features into histograms of visual words, and a classification step for which usually a support vector machine in combination with a non-linear kernel is used. Given large amounts of data, however, the model suffers from a lack of discriminative power. This applies particularly for action recognition, where the vast amount of video features needs to be subsampled for unsupervised visual vocabulary generation. Moreover, the kernel computation can be very expensive on large datasets. In this work, we propose a recurrent neural network that is equivalent to the traditional bag-of-words approach but enables for the application of discriminative training. The model further allows to incorporate the kernel computation into the neural network directly, solving the complexity issue and allowing to represent the complete classification system within a single network. We evaluate our method on four recent action recognition benchmarks and show that the conventional model as well as sparse coding methods are outperformed. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:79 / 91
页数:13
相关论文
共 50 条
  • [31] Distinguish Polarity in Bag-of-Words Visualization
    Xie, Yusheng
    Chen, Zhengzhang
    Agrawal, Ankit
    Choudhary, Alok
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3344 - 3350
  • [32] Contextual Bag-of-Words for Visual Categorization
    Li, Teng
    Mei, Tao
    Kweon, In-So
    Hua, Xian-Sheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (04) : 381 - 392
  • [33] Bag-of-words with aggregated temporal pair-wise word co-occurrence for human action recognition
    Agusti, Pau
    Javier Traver, V.
    Pla, Filiberto
    PATTERN RECOGNITION LETTERS, 2014, 49 : 224 - 230
  • [34] Bag-of-Words Similarity in eXplainable AI
    Narteni, Sara
    Ferretti, Melissa
    Rampa, Vittorio
    Mongelli, Maurizio
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, 2023, 543 : 835 - 851
  • [35] Hand Posture Recognition and Tracking Based on Bag-of-Words for Human Robot Interaction
    Chuang, Yuelong
    Chen, Ling
    Zhao, Gangqiang
    Chen, Gencai
    2011 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2011,
  • [36] Classification of Non-Conventional Ships Using a Neural Bag-Of-Words Mechanism
    Polap, Dawid
    Wlodarczyk-Sielicka, Marta
    SENSORS, 2020, 20 (06)
  • [37] A Deep Learning Model Based on Neural Bag-of-Words Attention for Sentiment Analysis
    Liao, Jing
    Yi, Zhixiang
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2021, 12815 : 467 - 478
  • [38] Exponentially Decaying Bag-of-Words Input Features for Feed-Forward Neural Network in Statistical Machine Translation
    Peter, Jan-Thorsten
    Wang, Weiyue
    Ney, Hermann
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, 2016, : 293 - 298
  • [39] Temporal Spiking Recurrent Neural Network for Action Recognition
    Wang, Wei
    Hao, Siyuan
    Wei, Yunchao
    Xia, Shengtao
    Feng, Jiashi
    Sebe, Nicu
    IEEE ACCESS, 2019, 7 : 117165 - 117175
  • [40] Contextual Bag-of-Words for Robust Visual Tracking
    Zeng, Fanxiang
    Ji, Yuefeng
    Levine, Martin D.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (03) : 1433 - 1447