Augmented two stream network for robust action recognition adaptive to various action videos

被引:7
|
作者
Leng, Chuanjiang [1 ]
Ding, Qichuan [1 ]
Wu, Chengdong [1 ]
Chen, Ange [1 ]
机构
[1] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110169, Peoples R China
基金
中国国家自然科学基金;
关键词
Two-stream network; Action recognition; Data skew;
D O I
10.1016/j.jvcir.2021.103344
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In video-based action recognition, using videos with different frame numbers to train a two-stream network can result in data skew problems. Moreover, extracting the key frames from a video is crucial for improving the training and recognition efficiency of action recognition systems. However, previous works suffer from problems of information loss and optical-flow interference when handling videos with different frame numbers. In this paper, an augmented two-stream network (ATSNet) is proposed to achieve robust action recognition. A frame-number-unified strategy is first incorporated into the temporal stream network to unify the frame numbers of videos. Subsequently, the grayscale statistics of the optical-flow images are extracted to filter out any invalid optical-flow images and produce the dynamic fusion weights for the two branch networks to adapt to different action videos. Experiments conducted on the UCF101 dataset demonstrate that ATSNet outperforms previously defined methods, improving the recognition accuracy by 1.13%.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Transferable two-stream convolutional neural network for human action recognition
    Xiong, Qianqian
    Zhang, Jianjing
    Wang, Peng
    Liu, Dongdong
    Gao, Robert X.
    JOURNAL OF MANUFACTURING SYSTEMS, 2020, 56 : 605 - 614
  • [22] Skeleton action recognition using Two-Stream Adaptive Graph Convolutional Networks
    Lee, James
    Kang, Suk-ju
    2021 36TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC), 2021,
  • [23] An Overview of Action Recognition in Videos
    Buric, M.
    Pobar, M.
    Kos, M. Ivasic
    2017 40TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2017, : 1098 - 1103
  • [24] Fast-slow visual network for action recognition in videos
    Hu, Heng
    Liu, Tongcun
    Feng, Hailin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (18) : 26361 - 26379
  • [25] Enhanced Spatial Stream of Two-Stream Network Using Optical Flow for Human Action Recognition
    Khan, Shahbaz
    Hassan, Ali
    Hussain, Farhan
    Perwaiz, Aqib
    Riaz, Farhan
    Alsabaan, Maazen
    Abdul, Wadood
    APPLIED SCIENCES-BASEL, 2023, 13 (14):
  • [26] Efficient Two-stream Action Recognition on FPGA
    Lin, Jia-Ming
    Lai, Kuan-Ting
    Wu, Bin-Ray
    Chen, Ming-Syan
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3070 - 3074
  • [27] Fuzzy Fusion for Two-stream Action Recognition
    Sousa e Santos, Anderson Carlos
    Maia, Helena de Almeida
    Roberto e Souza, Marcos
    Vieira, Marcelo Bernardes
    Pedrini, Helio
    PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 5: VISAPP, 2020, : 117 - 123
  • [28] A two-stream heterogeneous network for action recognition based on skeleton and RGB modalities
    Liu, Kai
    Gao, Lei
    Khan, Naimul Mefraz
    Qi, Lin
    Guan, Ling
    23RD IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2021), 2021, : 87 - 91
  • [29] Thermal infrared action recognition with two-stream shift Graph Convolutional Network
    Liu, Jishi
    Wang, Huanyu
    Wang, Junnian
    He, Dalin
    Xu, Ruihan
    Tang, Xiongfeng
    MACHINE VISION AND APPLICATIONS, 2024, 35 (04)
  • [30] Human Action Recognition based on Two-Stream Ind Recurrent Neural Network
    Ge Penghua
    Zhi Min
    TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069