Attention-Based Temporal Weighted Convolutional Neural Network for Action Recognition

被引:51
|
作者
Zang, Jinliang [1 ]
Wang, Le [1 ]
Liu, Ziyi [1 ]
Zhang, Qilin [2 ]
Niu, Zhenxing
Hua, Gang [3 ]
Zheng, Nanning [1 ]
机构
[1] Xi An Jiao Tong Univ, Xian 710049, Shaanxi, Peoples R China
[2] HERE Technol, Chicago, IL 60606 USA
[3] Microsoft Res, Redmond, WA 98052 USA
基金
中国博士后科学基金;
关键词
Action recognition; Attention model; Convolutional neural networks; Video-level prediction; Temporal weighting;
D O I
10.1007/978-3-319-92007-8_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research in human action recognition has accelerated significantly since the introduction of powerful machine learning tools such as Convolutional Neural Networks (CNNs). However, effective and efficient methods for incorporation of temporal information into CNNs are still being actively explored in the recent literature. Motivated by the popular recurrent attention models in the research area of natural language processing, we propose the Attention-based Temporal Weighted CNN (ATW), which embeds a visual attention model into a temporal weighted multi-stream CNN. This attention model is simply implemented as temporal weighting yet it effectively boosts the recognition performance of video representations. Besides, each stream in the proposed ATW frame- work is capable of end-to-end training, with both network parameters and temporal weights optimized by stochastic gradient descent (SGD) with back-propagation. Our experiments show that the proposed attention mechanism contributes substantially to the performance gains with the more discriminative snippets by focusing on more relevant video segments.
引用
下载
收藏
页码:97 / 108
页数:12
相关论文
共 50 条
  • [31] Landslide Risk Prediction Model Using an Attention-Based Temporal Convolutional Network Connected to a Recurrent Neural Network
    Di Zhang
    Yang, Jiacheng
    Li, Fenglin
    Han, Shuai
    Qin, Lei
    Li, Qing
    IEEE ACCESS, 2022, 10 : 37635 - 37645
  • [32] Attention-Based Temporal Encoding Network with Background-Independent Motion Mask for Action Recognition
    Weng, Zhengkui
    Jin, Zhipeng
    Chen, Shuangxi
    Shen, Quanquan
    Ren, Xiangyang
    Li, Wuzhao
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [33] A fuzzy convolutional attention-based GRU network for human activity recognition
    Khodabandelou, Ghazaleh
    Moon, Huiseok
    Amirat, Yacine
    Mohammed, Samer
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 118
  • [34] A lightweight temporal attention-based convolution neural network for driver's activity recognition in edge
    Yang, Lichao
    Du, Weixiang
    Zhao, Yifan
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 110
  • [35] Attention-based deep neural network for driver behavior recognition
    Xiao, Weichu
    Liu, Hongli
    Ma, Ziji
    Chen, Weihong
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 132 : 152 - 161
  • [36] Attention-based 3D convolutional recurrent neural network model for multimodal emotion recognition
    Du, Yiming
    Li, Penghai
    Cheng, Longlong
    Zhang, Xuanwei
    Li, Mingji
    Li, Fengzhou
    FRONTIERS IN NEUROSCIENCE, 2024, 17
  • [37] Weighted-Dependency with Attention-Based Graph Convolutional Network for Relation Extraction
    Dong, Yihao
    Xu, Xiaolong
    NEURAL PROCESSING LETTERS, 2023, 55 (09) : 12121 - 12142
  • [38] Weighted-Dependency with Attention-Based Graph Convolutional Network for Relation Extraction
    Yihao Dong
    Xiaolong Xu
    Neural Processing Letters, 2023, 55 (9) : 12121 - 12142
  • [39] Attention-based Convolutional Neural Network for Computer Vision Color Constancy
    Koscevic, Karlo
    Subasic, Marko
    Loncaric, Sven
    PROCEEDINGS OF THE 2019 11TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2019), 2019, : 372 - 377
  • [40] Attention-Based Convolutional Recurrent Neural Network for Traffic Index Prediction
    Liu, Xiaolei
    Duan, Zhengyu
    Luo, Songwen
    Zhao, Yixin
    CICTP 2023: INNOVATION-EMPOWERED TECHNOLOGY FOR SUSTAINABLE, INTELLIGENT, DECARBONIZED, AND CONNECTED TRANSPORTATION, 2023, : 1140 - 1149