Unsupervised video-based action recognition using two-stream generative adversarial network

被引:0
|
作者
Wei Lin
Huanqiang Zeng
Jianqing Zhu
Chih-Hsien Hsia
Junhui Hou
Kai-Kuang Ma
机构
[1] Huaqiao University,School of Engineering and School of Information Science and Engineering
[2] Huaqiao University,School of Engineering
[3] Ilan University,Department of Computer Science and Information Engineering
[4] The City University of Hong Kong,Department of Computer Science
[5] Nanyang Technological University,School of Electrical and Electronic Engineering
来源
关键词
Action recognition; Two-stream generative adversarial network; Unsupervised learning;
D O I
暂无
中图分类号
学科分类号
摘要
Video-based action recognition faces many challenges, such as complex and varied dynamic motion, spatio-temporal similar action factors, and manual labeling of archived videos over large datasets. How to extract discriminative spatio-temporal action features in videos with resisting the effect of similar factors in an unsupervised manner is pivotal. For that, this paper proposes an unsupervised video-based action recognition method, called two-stream generative adversarial network (TS-GAN), which comprehensively learns the static texture and dynamic motion information inherited in videos with taking the detail information and global information into account. Specifically, the extraction of the spatio-temporal information in videos is achieved by a two-stream GAN. Considering that proper attention to detail is capable of alleviating the influence of spatio-temporal similar factors to the network, a global-detailed layer is proposed to resist similar factors via fusing intermediate features (i.e., detailed action information) and high-level semantic features (i.e., global action information). It is worthwhile of mentioning that the proposed TS-GAN does not require complex pretext tasks or the construction of positive and negative sample pairs, compared with recent unsupervised video-based action recognition methods. Extensive experiments conducted on the UCF101 and HMDB51 datasets have demonstrated that the proposed TS-GAN is superior to multiple classical and state-of-the-art unsupervised action recognition methods.
引用
收藏
页码:5077 / 5091
页数:14
相关论文
共 50 条
  • [41] Weakly supervised video action localisation via two-stream action activation network
    Yin, Chang
    Liao, Zhongke
    Hu, Haifeng
    Chen, Dihu
    ELECTRONICS LETTERS, 2019, 55 (21) : 1126 - 1127
  • [42] Unsupervised Video-Based Action Recognition With Imagining Motion and Perceiving Appearance
    Lin, Wei
    Liu, Xiaoyu
    Zhuang, Yihong
    Ding, Xinghao
    Tu, Xiaotong
    Huang, Yue
    Zeng, Huanqiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (05) : 2245 - 2258
  • [44] Efficient Two-stream Action Recognition on FPGA
    Lin, Jia-Ming
    Lai, Kuan-Ting
    Wu, Bin-Ray
    Chen, Ming-Syan
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3070 - 3074
  • [45] Fuzzy Fusion for Two-stream Action Recognition
    Sousa e Santos, Anderson Carlos
    Maia, Helena de Almeida
    Roberto e Souza, Marcos
    Vieira, Marcelo Bernardes
    Pedrini, Helio
    PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 5: VISAPP, 2020, : 117 - 123
  • [46] A Two-Stream Method for Human Action Recognition Using Facial Action Cues
    Lai, Zhimao
    Zhang, Yan
    Liang, Xiubo
    SENSORS, 2024, 24 (21)
  • [47] Combining Pose and Trajectory for Skeleton Based Action Recognition using Two-Stream RNN
    Pan, Ge
    Song, YongHong
    Wei, ShengHua
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 4375 - 4380
  • [48] Improved human action recognition approach based on two-stream convolutional neural network model
    Congcong Liu
    Jie Ying
    Haima Yang
    Xing Hu
    Jin Liu
    The Visual Computer, 2021, 37 : 1327 - 1341
  • [49] A simulated two-stream network via multilevel distillation of reviewed features and decoupled logits for video action recognition
    Gao, Zitao
    Liu, Xiangjian
    Wang, Anna K.
    Lin, Liyu
    VISUAL COMPUTER, 2024, : 3907 - 3923
  • [50] On Evaluating Video-based Generative Adversarial Networks (GANs)
    Ronquillo, Nancy
    Harguess, Josh
    2018 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR), 2018,