ATSN: Attention-Based Temporal Segment Network for Action Recognition

被引:2
|
作者
Sun, Yun-lei [1 ]
Zhang, Da-lin [2 ]
机构
[1] China Univ Petr East China, Coll Comp & Commun Engn, Qingdao 266580, Shandong, Peoples R China
[2] Beijing Jiaotong Univ, Natl Res Ctr Railway Safety Assessment, Beijing 100044, Peoples R China
来源
TEHNICKI VJESNIK-TECHNICAL GAZETTE | 2019年 / 26卷 / 06期
关键词
action recognition; attention; Temporal Segment Network;
D O I
10.17559/TV-20190506101459
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In human action recognition, a reasonable video representation is still a problem to be solved. For humans, it is easy to focus on the prominent areas of the image in the video, focusing on the part of interest. Inspired by this, we proposed a deep Temporal Segment Network based on visual attention-ATSN. By lightly modifying the model structure, ATSN integrates the human attention mechanism into the Temporal Segment Networks, can effectively add a weight to the video representation features, pays attention to the beneficial regions in the features, and achieves more accurate action recognition. We conducted the Oilfield-7 dataset for human actions on the oilfield. The experimental results on HMDB51 and Oilfield-7 show that the ATSN had achieved excellent performance.
引用
收藏
页码:1664 / 1669
页数:6
相关论文
共 50 条
  • [1] Attention-Based Temporal Weighted Convolutional Neural Network for Action Recognition
    Zang, Jinliang
    Wang, Le
    Liu, Ziyi
    Zhang, Qilin
    Niu, Zhenxing
    Hua, Gang
    Zheng, Nanning
    [J]. ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2018, 2018, 519 : 97 - 108
  • [2] Attention-based spatial-temporal hierarchical ConvLSTM network for action recognition in videos
    Xue, Fei
    Ji, Hongbing
    Zhang, Wenbo
    Cao, Yi
    [J]. IET COMPUTER VISION, 2019, 13 (08) : 708 - 718
  • [3] Recurrent Temporal Sparse Autoencoder for Attention-based Action Recognition
    Xin, Miao
    Zhang, Hong
    Sun, Mingui
    Yuan, Ding
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 456 - 463
  • [4] Attention-Based Temporal Encoding Network with Background-Independent Motion Mask for Action Recognition
    Weng, Zhengkui
    Jin, Zhipeng
    Chen, Shuangxi
    Shen, Quanquan
    Ren, Xiangyang
    Li, Wuzhao
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [5] An attention-based bidirectional GRU network for temporal action proposals generation
    Xiaoxin Liao
    Jingyi Yuan
    Zemin Cai
    Jian-huang Lai
    [J]. The Journal of Supercomputing, 2023, 79 : 8322 - 8339
  • [6] An attention-based bidirectional GRU network for temporal action proposals generation
    Liao, Xiaoxin
    Yuan, Jingyi
    Cai, Zemin
    Lai, Jian-huang
    [J]. JOURNAL OF SUPERCOMPUTING, 2023, 79 (08): : 8322 - 8339
  • [7] Action Recognition of Temporal Segment Network Based on Feature Fusion
    Li H.
    Ding Y.
    Li C.
    Zhang S.
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (01): : 145 - 158
  • [8] Temporal Segment Connection Network for Action Recognition
    Li, Qian
    Yang, Wenzhu
    Chen, Xiangyang
    Yuan, Tongtong
    Wang, Yuxia
    [J]. IEEE ACCESS, 2020, 8 : 179118 - 179127
  • [9] STA-TSN: Spatial-Temporal Attention Temporal Segment Network for action recognition in video
    Yang, Guoan
    Yang, Yong
    Lu, Zhengzhi
    Yang, Junjie
    Liu, Deyang
    Zhou, Chuanbo
    Fan, Zien
    [J]. PLOS ONE, 2022, 17 (03):
  • [10] A temporal-spatial attention-based action recognition method for intelligent fault diagnosis
    Luo, Wentao
    Zhang, Jianfu
    Feng, Pingfa
    Yu, Dingwen
    Wu, Zhijun
    [J]. ISA TRANSACTIONS, 2022, 125 : 459 - 473