Augmented two stream network for robust action recognition adaptive to various action videos

被引：7

作者：

Leng, Chuanjiang ^{[1
]}

Ding, Qichuan ^{[1
]}

Wu, Chengdong ^{[1
]}

Chen, Ange ^{[1
]}

机构：

[1] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110169, Peoples R China

来源：

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION | 2021年 / 81卷

基金：

中国国家自然科学基金;

关键词：

Two-stream network; Action recognition; Data skew;

D O I：

10.1016/j.jvcir.2021.103344

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In video-based action recognition, using videos with different frame numbers to train a two-stream network can result in data skew problems. Moreover, extracting the key frames from a video is crucial for improving the training and recognition efficiency of action recognition systems. However, previous works suffer from problems of information loss and optical-flow interference when handling videos with different frame numbers. In this paper, an augmented two-stream network (ATSNet) is proposed to achieve robust action recognition. A frame-number-unified strategy is first incorporated into the temporal stream network to unify the frame numbers of videos. Subsequently, the grayscale statistics of the optical-flow images are extracted to filter out any invalid optical-flow images and produce the dynamic fusion weights for the two branch networks to adapt to different action videos. Experiments conducted on the UCF101 dataset demonstrate that ATSNet outperforms previously defined methods, improving the recognition accuracy by 1.13%.

引用

页数：8

共 50 条

[31] Two-Stream 3D Convolution Attentional Network for Action Recognition
Kusumoseniarto, Raden Hadapiningsyah
2020 JOINT 9TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2020 4TH INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR), 2020,
[32] Three-Stream Network With Bidirectional Self-Attention for Action Recognition in Extreme Low Resolution Videos
Purwanto, Didik
Pramono, Rizard Renanda Adhi
Chen, Yie-Tarng
Fang, Wen-Hsien
IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (08) : 1187 - 1191
[33] Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network
Xiong, Xin
Min, Weidong
Han, Qing
Wang, Qi
Zha, Cheng
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
[34] Two-Stream 3-D convNet Fusion for Action Recognition in Videos With Arbitrary Size and Length
Wang, Xuanhan
Gao, Lianli
Wang, Peng
Sun, Xiaoshuai
Liu, Xianglong
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (03) : 634 - 644
[35] Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition
Shi, Lei
Zhang, Yifan
Cheng, Jian
Lu, Hanqing
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12018 - 12027
[36] PA-AWCNN: Two-stream Parallel Attention Adaptive Weight Network for RGB-D Action Recognition
Yao, Lu
Liu, Sheng
Li, Chaonan
Zou, Siyu
Chen, Shengyong
Guan, Diyi
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 8741 - 8747
[37] Motion Guided Feature-Augmented Network for Action Recognition
Zheng, Zhenxing
An, Gaoyun
Ruan, Qiuqi
PROCEEDINGS OF 2020 IEEE 15TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2020), 2020, : 391 - 394
[38] A Two-Stream Method for Human Action Recognition Using Facial Action Cues
Lai, Zhimao
Zhang, Yan
Liang, Xiubo
SENSORS, 2024, 24 (21)
[39] Action Recognition From Thermal Videos
Batchuluun, Ganbayar
Nguyen, Dat Tien
Tuyen Danh Pham
Park, Chanhum
Park, Kang Ryoung
IEEE ACCESS, 2019, 7 : 103893 - 103917
[40] SCNN: SEQUENTIAL CONVOLUTIONAL NEURAL NETWORK FOR HUMAN ACTION RECOGNITION IN VIDEOS
Yang, Hao
Yuan, Chunfeng
Xing, Junliang
Hu, Weiming
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 355 - 359

← 1 2 3 4 5 →