Augmented two stream network for robust action recognition adaptive to various action videos

被引:7
|
作者
Leng, Chuanjiang [1 ]
Ding, Qichuan [1 ]
Wu, Chengdong [1 ]
Chen, Ange [1 ]
机构
[1] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110169, Peoples R China
基金
中国国家自然科学基金;
关键词
Two-stream network; Action recognition; Data skew;
D O I
10.1016/j.jvcir.2021.103344
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In video-based action recognition, using videos with different frame numbers to train a two-stream network can result in data skew problems. Moreover, extracting the key frames from a video is crucial for improving the training and recognition efficiency of action recognition systems. However, previous works suffer from problems of information loss and optical-flow interference when handling videos with different frame numbers. In this paper, an augmented two-stream network (ATSNet) is proposed to achieve robust action recognition. A frame-number-unified strategy is first incorporated into the temporal stream network to unify the frame numbers of videos. Subsequently, the grayscale statistics of the optical-flow images are extracted to filter out any invalid optical-flow images and produce the dynamic fusion weights for the two branch networks to adapt to different action videos. Experiments conducted on the UCF101 dataset demonstrate that ATSNet outperforms previously defined methods, improving the recognition accuracy by 1.13%.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Two-Stream 3D Convolution Attentional Network for Action Recognition
    Kusumoseniarto, Raden Hadapiningsyah
    2020 JOINT 9TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2020 4TH INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR), 2020,
  • [32] Three-Stream Network With Bidirectional Self-Attention for Action Recognition in Extreme Low Resolution Videos
    Purwanto, Didik
    Pramono, Rizard Renanda Adhi
    Chen, Yie-Tarng
    Fang, Wen-Hsien
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (08) : 1187 - 1191
  • [33] Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network
    Xiong, Xin
    Min, Weidong
    Han, Qing
    Wang, Qi
    Zha, Cheng
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [34] Two-Stream 3-D convNet Fusion for Action Recognition in Videos With Arbitrary Size and Length
    Wang, Xuanhan
    Gao, Lianli
    Wang, Peng
    Sun, Xiaoshuai
    Liu, Xianglong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (03) : 634 - 644
  • [35] Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition
    Shi, Lei
    Zhang, Yifan
    Cheng, Jian
    Lu, Hanqing
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12018 - 12027
  • [36] PA-AWCNN: Two-stream Parallel Attention Adaptive Weight Network for RGB-D Action Recognition
    Yao, Lu
    Liu, Sheng
    Li, Chaonan
    Zou, Siyu
    Chen, Shengyong
    Guan, Diyi
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 8741 - 8747
  • [37] Motion Guided Feature-Augmented Network for Action Recognition
    Zheng, Zhenxing
    An, Gaoyun
    Ruan, Qiuqi
    PROCEEDINGS OF 2020 IEEE 15TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2020), 2020, : 391 - 394
  • [38] A Two-Stream Method for Human Action Recognition Using Facial Action Cues
    Lai, Zhimao
    Zhang, Yan
    Liang, Xiubo
    SENSORS, 2024, 24 (21)
  • [39] Action Recognition From Thermal Videos
    Batchuluun, Ganbayar
    Nguyen, Dat Tien
    Tuyen Danh Pham
    Park, Chanhum
    Park, Kang Ryoung
    IEEE ACCESS, 2019, 7 : 103893 - 103917
  • [40] SCNN: SEQUENTIAL CONVOLUTIONAL NEURAL NETWORK FOR HUMAN ACTION RECOGNITION IN VIDEOS
    Yang, Hao
    Yuan, Chunfeng
    Xing, Junliang
    Hu, Weiming
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 355 - 359