Adversarial Attention Networks for Early Action Recognition

被引:0
|
作者
Zhang, Hong-Bo [1 ]
Pan, Wei-Xiang [1 ]
Du, Ji-Xiang [2 ]
Lei, Qing [3 ,4 ]
Chen, Yan [2 ]
Liu, Jing-Hua [3 ,4 ]
机构
[1] Huaqiao Univ, Dept Comp Sci & Technol, Xiamen 361000, Peoples R China
[2] Huaqiao Univ, Fujian Key Lab Big Data Intelligence & Secur, Xiamen 361000, Peoples R China
[3] Huaqiao Univ, Xiamen Key Lab Comp Vis & Pattern Recognit, Xiamen 361000, Peoples R China
[4] Huaqiao Univ, Fujian Prov Univ, Key Lab Comp Vis & Machine Learning, Xiamen 361000, Peoples R China
基金
中国国家自然科学基金;
关键词
Early action recognition; adversarial attention network; cross attention generator; self attention discriminator; feature fusion module;
D O I
10.1109/TETCI.2024.3437240
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Early action recognition endeavors to deduce the ongoing action by observing partial video, presenting a formidable challenge due to limited information available in the initial stages. To tackle this challenge, we introduce an innovative adversarial attention network based on generative adversarial networks. This network leverages the characteristics of both the generator and discriminator to generate unobserved action information from partial video input. The proposed method comprises a cross attention generator, self Attention discriminator, and feature fusion module. The cross attention generator captures temporal relationships in input action sequences, generating discriminative unobserved action information. The self attention discriminator adds global attention to the input sequence, capturing global context information for accurate evaluation of consistency in generated unobserved feature from cross attention generator. Finally, the feature fusion module helps the model obtain richer and more comprehensive feature representations. The proposed method is evaluated through experiments on the HMDB51, UCF101 and Something-Something v2 datasets. Experimental results demonstrate that the proposed approach outperforms existing methods across different observation ratios. Detailed ablation studies confirm the effectiveness of each component in the proposed method.
引用
下载
收藏
页数:14
相关论文
共 50 条
  • [31] An Algorithm for the Recognition of Motion-Blurred QR Codes Based on Generative Adversarial Networks and Attention Mechanisms
    Hao Dong
    Haibin Liu
    Mingfei Li
    Fujie Ren
    Feng Xie
    International Journal of Computational Intelligence Systems, 17
  • [32] Recognition in early visual attention
    Martinez, A.
    PERCEPTION, 1999, 28 : 124 - 125
  • [33] Attention, biological motion, and action recognition
    Thompson, James
    Parasuraman, Raja
    NEUROIMAGE, 2012, 59 (01) : 4 - 13
  • [34] Residual attention unit for action recognition
    Liao, Zhongke
    Hu, Haifeng
    Zhang, Junxuan
    Yin, Chang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2019, 189
  • [35] Attention with structure regularization for action recognition
    Quan, Yuhui
    Chen, Yixin
    Xu, Ruotao
    Ji, Hui
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2019, 187
  • [36] Generative Adversarial Networks in Image Generation and Recognition
    Popuri, Anoushka
    Miller, John
    2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 1294 - 1297
  • [37] Continual Activity Recognition with Generative Adversarial Networks
    Ye, Juan
    Nakwijit, Pakawat
    Schiemer, Martin
    Jha, Saurav
    Zambonelli, Franco
    ACM Transactions on Internet of Things, 2021, 2 (02):
  • [38] Supervised Spatial Transformer Networks for Attention Learning in Fine-grained Action Recognition
    Liu, Dichao
    Wang, Yu
    Kato, Jien
    VISAPP: PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 4, 2019, : 311 - 318
  • [39] View transform graph attention recurrent networks for skeleton-based action recognition
    Qingqing Huang
    Fengyu Zhou
    Runze Qin
    Yang zhao
    Signal, Image and Video Processing, 2021, 15 : 599 - 606
  • [40] View transform graph attention recurrent networks for skeleton-based action recognition
    Huang, Qingqing
    Zhou, Fengyu
    Qin, Runze
    Zhao, Yang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (03) : 599 - 606