Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network

被引:28
|
作者
Wang, Le [1 ]
Zang, Jinliang [1 ]
Zhang, Qilin [2 ]
Niu, Zhenxing [3 ]
Hua, Gang [4 ]
Zheng, Nanning [1 ]
机构
[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710049, Shaanxi, Peoples R China
[2] HERE Technol, Chicago, IL 60606 USA
[3] Alibaba Grp, Hangzhou 311121, Zhejiang, Peoples R China
[4] Microsoft Res, Redmond, WA 98052 USA
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
action recognition; attention model; convolutional neural netwoks; video-level prediction; temporal weighting;
D O I
10.3390/s18071979
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Research in human action recognition has accelerated significantly since the introduction of powerful machine learning tools such as Convolutional Neural Networks (CNNs). However, effective and efficient methods for incorporation of temporal information into CNNs are still being actively explored in the recent literature. Motivated by the popular recurrent attention models in the research area of natural language processing, we propose the Attention-aware Temporal Weighted CNN (ATW CNN) for action recognition in videos, which embeds a visual attention model into a temporal weighted multi-stream CNN. This attention model is simply implemented as temporal weighting yet it effectively boosts the recognition performance of video representations. Besides, each stream in the proposed ATW CNN framework is capable of end-to-end training, with both network parameters and temporal weights optimized by stochastic gradient descent (SGD) with back-propagation. Our experimental results on the UCF-101 and HMDB-51 datasets showed that the proposed attention mechanism contributes substantially to the performance gains with the more discriminative snippets by focusing on more relevant video segments.
引用
下载
收藏
页数:18
相关论文
共 50 条
  • [41] Topology-Aware Convolutional Neural Network for Efficient Skeleton-Based Action Recognition
    Xu, Kailin
    Ye, Fanfan
    Zhong, Qiaoyong
    Xie, Di
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2866 - 2874
  • [42] Attention-aware invertible hashing network with skip connections
    Li, Shanshan
    Cai, Qiang
    Li, Zhuangzi
    Li, Haisheng
    Zhang, Naiguang
    Zhang, Xiaoyu
    PATTERN RECOGNITION LETTERS, 2020, 138 : 556 - 562
  • [43] HANet: Hybrid Attention-aware Network for Crowd Counting
    Su, Xinxing
    Yuan, Yuchen
    Su, Xiangbo
    Zou, Zhikang
    Wen, Shilei
    Zhou, Pan
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7707 - 7714
  • [44] Spatiotemporal image fusion using multiscale attention-aware two-stream convolutional neural networks
    Chen, Yuehong
    Ge, Yong
    SCIENCE OF REMOTE SENSING, 2022, 6
  • [45] A Multi-Stream Attention-Aware Convolutional Neural Network: Monitoring of Sand and Dust Storms from Ordinary Urban Surveillance Cameras
    Wang, Xing
    Yang, Zhengwei
    Feng, Huihui
    Zhao, Jiuwei
    Shi, Shuaiyi
    Cheng, Lu
    REMOTE SENSING, 2023, 15 (21)
  • [46] Attention-Aware Age-Agnostic Visual Place Recognition
    Wang, Ziqi
    Li, Jiahui
    Khademi, Seyran
    van Gemert, Jan
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1437 - 1446
  • [47] Temporal attention-aware evidential recurrent network for trustworthy prediction of Alzheimer's disease progression
    Zhang, Chenran
    Bao, Qingsen
    Zhang, Feng
    Li, Ping
    Chen, Lei
    INTELLIGENT DATA ANALYSIS, 2024, 28 (03) : 751 - 768
  • [48] ARFace: Attention-Aware and Regularization for Face Recognition With Reinforcement Learning
    Zhang L.
    Sun L.
    Yu L.
    Dong X.
    Chen J.
    Cai W.
    Wang C.
    Ning X.
    IEEE Transactions on Biometrics, Behavior, and Identity Science, 2022, 4 (01): : 30 - 42
  • [49] Skeleton Based Action Recognition with Convolutional Neural Network
    Du, Yong
    Fu, Yun
    Wang, Liang
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 579 - 583
  • [50] Deep Convolutional Network with Pixel-Aware Attention for Smoke Recognition
    Guangtao Cheng
    Xue Chen
    Jiachang Gong
    Fire Technology, 2022, 58 : 1839 - 1862