Adversarial Attention Networks for Early Action Recognition

被引:0
|
作者
Zhang, Hong-Bo [1 ]
Pan, Wei-Xiang [1 ]
Du, Ji-Xiang [2 ]
Lei, Qing [3 ,4 ]
Chen, Yan [2 ]
Liu, Jing-Hua [3 ,4 ]
机构
[1] Huaqiao Univ, Dept Comp Sci & Technol, Xiamen 361000, Peoples R China
[2] Huaqiao Univ, Fujian Key Lab Big Data Intelligence & Secur, Xiamen 361000, Peoples R China
[3] Huaqiao Univ, Xiamen Key Lab Comp Vis & Pattern Recognit, Xiamen 361000, Peoples R China
[4] Huaqiao Univ, Fujian Prov Univ, Key Lab Comp Vis & Machine Learning, Xiamen 361000, Peoples R China
基金
中国国家自然科学基金;
关键词
Early action recognition; adversarial attention network; cross attention generator; self attention discriminator; feature fusion module;
D O I
10.1109/TETCI.2024.3437240
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Early action recognition endeavors to deduce the ongoing action by observing partial video, presenting a formidable challenge due to limited information available in the initial stages. To tackle this challenge, we introduce an innovative adversarial attention network based on generative adversarial networks. This network leverages the characteristics of both the generator and discriminator to generate unobserved action information from partial video input. The proposed method comprises a cross attention generator, self Attention discriminator, and feature fusion module. The cross attention generator captures temporal relationships in input action sequences, generating discriminative unobserved action information. The self attention discriminator adds global attention to the input sequence, capturing global context information for accurate evaluation of consistency in generated unobserved feature from cross attention generator. Finally, the feature fusion module helps the model obtain richer and more comprehensive feature representations. The proposed method is evaluated through experiments on the HMDB51, UCF101 and Something-Something v2 datasets. Experimental results demonstrate that the proposed approach outperforms existing methods across different observation ratios. Detailed ablation studies confirm the effectiveness of each component in the proposed method.
引用
下载
收藏
页数:14
相关论文
共 50 条
  • [1] Early Action Prediction With Generative Adversarial Networks
    Wang, Dong
    Yuan, Yuan
    Wang, Qi
    IEEE ACCESS, 2019, 7 : 35795 - 35804
  • [2] Nesting spatiotemporal attention networks for action recognition
    Li, Jiapeng
    Wei, Ping
    Zheng, Nanning
    NEUROCOMPUTING, 2021, 459 : 338 - 348
  • [3] Imperceptible Adversarial Attack With Multigranular Spatiotemporal Attention for Video Action Recognition
    Wu, Guoming
    Xu, Yangfan
    Li, Jun
    Shi, Zhiping
    Liu, Xianglong
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (20) : 17785 - 17796
  • [4] Adversarial Cross-Domain Action Recognition with Co-Attention
    Pan, Boxiao
    Cao, Zhangjie
    Adeli, Ehsan
    Niebles, Juan Carlos
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11815 - 11822
  • [5] Multibranch Attention Networks for Action Recognition in Still Images
    Yan, Shiyang
    Smith, Jeremy S.
    Lu, Wenjin
    Zhang, Bailing
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2018, 10 (04) : 1116 - 1125
  • [6] Cascade multi-head attention networks for action recognition
    Wang, Jiaze
    Peng, Xiaojiang
    Qiao, Yu
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 192
  • [7] Spatio-Temporal Attention Networks for Action Recognition and Detection
    Li, Jun
    Liu, Xianglong
    Zhang, Wenxuan
    Zhang, Mingyuan
    Song, Jingkuan
    Sebe, Nicu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (11) : 2990 - 3001
  • [8] Efficient dual attention SlowFast networks for video action recognition
    Wei, Dafeng
    Tian, Ye
    Wei, Liqing
    Zhong, Hong
    Chen, Siqian
    Pu, Shiliang
    Lu, Hongtao
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 222
  • [9] Memory Attention Networks for Skeleton-based Action Recognition
    Xie, Chunyu
    Li, Ce
    Zhang, Baochang
    Chen, Chen
    Han, Jungong
    Liu, Jianzhuang
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 1639 - 1645
  • [10] Hierarchical Multi-scale Attention Networks for action recognition
    Yan, Shiyang
    Smith, Jeremy S.
    Lu, Wenjin
    Zhang, Bailing
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 61 : 73 - 84