Attention-Based Temporal Weighted Convolutional Neural Network for Action Recognition

被引:51
|
作者
Zang, Jinliang [1 ]
Wang, Le [1 ]
Liu, Ziyi [1 ]
Zhang, Qilin [2 ]
Niu, Zhenxing
Hua, Gang [3 ]
Zheng, Nanning [1 ]
机构
[1] Xi An Jiao Tong Univ, Xian 710049, Shaanxi, Peoples R China
[2] HERE Technol, Chicago, IL 60606 USA
[3] Microsoft Res, Redmond, WA 98052 USA
基金
中国博士后科学基金;
关键词
Action recognition; Attention model; Convolutional neural networks; Video-level prediction; Temporal weighting;
D O I
10.1007/978-3-319-92007-8_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research in human action recognition has accelerated significantly since the introduction of powerful machine learning tools such as Convolutional Neural Networks (CNNs). However, effective and efficient methods for incorporation of temporal information into CNNs are still being actively explored in the recent literature. Motivated by the popular recurrent attention models in the research area of natural language processing, we propose the Attention-based Temporal Weighted CNN (ATW), which embeds a visual attention model into a temporal weighted multi-stream CNN. This attention model is simply implemented as temporal weighting yet it effectively boosts the recognition performance of video representations. Besides, each stream in the proposed ATW frame- work is capable of end-to-end training, with both network parameters and temporal weights optimized by stochastic gradient descent (SGD) with back-propagation. Our experiments show that the proposed attention mechanism contributes substantially to the performance gains with the more discriminative snippets by focusing on more relevant video segments.
引用
下载
收藏
页码:97 / 108
页数:12
相关论文
共 50 条
  • [21] Attention-based Convolutional Neural Network for ASV Spoofing Detection
    Ling, Hefei
    Huang, Leichao
    Huang, Junrui
    Zhang, Baiyan
    Li, Ping
    INTERSPEECH 2021, 2021, : 4289 - 4293
  • [22] Attention-based convolutional neural network for Bangla sentiment analysis
    Sharmin, Sadia
    Chakma, Danial
    AI and Society, 2021, 36 (01): : 381 - 396
  • [23] Recurrent Temporal Sparse Autoencoder for Attention-based Action Recognition
    Xin, Miao
    Zhang, Hong
    Sun, Mingui
    Yuan, Ding
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 456 - 463
  • [24] Attention-Based Convolutional Neural Network and Bidirectional Gated Recurrent Unit for Human Activity Recognition
    Tao, Shuai
    Zhao, Zhiqiang
    Qin, Jing
    Ji, Changqing
    Wang, Zumin
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1128 - 1134
  • [25] Attention-Based Convolutional Neural Network for Weakly Labeled Human Activities' Recognition With Wearable Sensors
    Wang, Kun
    He, Jun
    Zhang, Lei
    IEEE SENSORS JOURNAL, 2019, 19 (17) : 7598 - 7604
  • [26] Discriminative Attention-based Convolutional Neural Network for 3D Facial Expression Recognition
    Zhu, Kangkang
    Du, Zhengyin
    Li, Weixin
    Huang, Di
    Wang, Yunhong
    Chen, Liming
    2019 14TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2019), 2019, : 590 - 597
  • [27] Spatial-temporal pyramid based Convolutional Neural Network for action recognition
    Zheng, Zhenxing
    An, Gaoyun
    Wu, Dapeng
    Ruan, Qiuqi
    NEUROCOMPUTING, 2019, 358 : 446 - 455
  • [28] Temporal Pyramid Pooling-Based Convolutional Neural Network for Action Recognition
    Wang, Peng
    Cao, Yuanzhouhan
    Shen, Chunhua
    Liu, Lingqiao
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (12) : 2613 - 2622
  • [29] Spatio-Temporal Self-Attention Weighted VLAD Neural Network for Action Recognition
    Cheng, Shilei
    Xie, Mei
    Ma, Zheng
    Li, Siqi
    Gu, Song
    Yang, Feng
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (01) : 220 - 224
  • [30] An Attention-Based Convolutional Recurrent Neural Networks for Scene Text Recognition
    Alshawi, Adil Abdullah Abdulhussein
    Tanha, Jafar
    Balafar, Mohammad Ali
    IEEE ACCESS, 2024, 12 : 8123 - 8134