Attention-Based Temporal Weighted Convolutional Neural Network for Action Recognition

被引:51
|
作者
Zang, Jinliang [1 ]
Wang, Le [1 ]
Liu, Ziyi [1 ]
Zhang, Qilin [2 ]
Niu, Zhenxing
Hua, Gang [3 ]
Zheng, Nanning [1 ]
机构
[1] Xi An Jiao Tong Univ, Xian 710049, Shaanxi, Peoples R China
[2] HERE Technol, Chicago, IL 60606 USA
[3] Microsoft Res, Redmond, WA 98052 USA
基金
中国博士后科学基金;
关键词
Action recognition; Attention model; Convolutional neural networks; Video-level prediction; Temporal weighting;
D O I
10.1007/978-3-319-92007-8_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research in human action recognition has accelerated significantly since the introduction of powerful machine learning tools such as Convolutional Neural Networks (CNNs). However, effective and efficient methods for incorporation of temporal information into CNNs are still being actively explored in the recent literature. Motivated by the popular recurrent attention models in the research area of natural language processing, we propose the Attention-based Temporal Weighted CNN (ATW), which embeds a visual attention model into a temporal weighted multi-stream CNN. This attention model is simply implemented as temporal weighting yet it effectively boosts the recognition performance of video representations. Besides, each stream in the proposed ATW frame- work is capable of end-to-end training, with both network parameters and temporal weights optimized by stochastic gradient descent (SGD) with back-propagation. Our experiments show that the proposed attention mechanism contributes substantially to the performance gains with the more discriminative snippets by focusing on more relevant video segments.
引用
收藏
页码:97 / 108
页数:12
相关论文
共 50 条
  • [1] Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network
    Wang, Le
    Zang, Jinliang
    Zhang, Qilin
    Niu, Zhenxing
    Hua, Gang
    Zheng, Nanning
    [J]. SENSORS, 2018, 18 (07)
  • [2] Attention-based convolutional neural network for deep face recognition
    Hefei Ling
    Jiyang Wu
    Junrui Huang
    Jiazhong Chen
    Ping Li
    [J]. Multimedia Tools and Applications, 2020, 79 : 5595 - 5616
  • [3] Attention-based convolutional neural network for deep face recognition
    Ling, Hefei
    Wu, Jiyang
    Huang, Junrui
    Chen, Jiazhong
    Li, Ping
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (9-10) : 5595 - 5616
  • [4] ATSN: Attention-Based Temporal Segment Network for Action Recognition
    Sun, Yun-lei
    Zhang, Da-lin
    [J]. TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2019, 26 (06): : 1664 - 1669
  • [5] Cascade Attention-based Spatial-temporal Convolutional Neural Network for Motion Image Posture Recognition
    Zhang, Shuqi
    [J]. Journal of Computers (Taiwan), 2022, 33 (01): : 21 - 30
  • [6] EEG emotion recognition using attention-based convolutional transformer neural network
    Gong, Linlin
    Li, Mingyang
    Zhang, Tao
    Chen, Wanzhong
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 84
  • [7] An attention-based convolutional neural network for recipe recommendation
    Jia, Nan
    Chen, Jie
    Wang, Rongzheng
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 201
  • [8] Attention-Based Convolutional Neural Network for Ingredients Identification
    Chen, Shi
    Li, Ruixue
    Wang, Chao
    Liang, Jiakai
    Yue, Keqiang
    Li, Wenjun
    Li, Yilin
    [J]. ENTROPY, 2023, 25 (02)
  • [9] Attention-Based Generative Graph Convolutional Network for Skeleton-Based Human Action Recognition
    Yang, Kai
    Ding, Xiaolu
    Chen, Wai
    [J]. ICVIP 2019: PROCEEDINGS OF 2019 3RD INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, 2019, : 1 - 6
  • [10] Attention-based generative graph convolutional network for skeleton-based human action recognition
    Yang, Kai
    Ding, Xiaolu
    Chen, Wai
    [J]. ACM International Conference Proceeding Series, 2019, : 1 - 6