Self-Supervised Learning for Action Recognition by Video Denoising

被引：0

作者：

Thi Thu Trang Phung ^{[1
]}

Thi Hong Thu Ma ^{[2
]}

Van Truong Nguyen ^{[3
]}

Duc Quang Vu ^{[4
]}

机构：

[1] Thai Nguyen Univ, Thai Nguyen, Vietnam

[2] Tan Trao Univ, Tuyen Quang, Vietnam

[3] Thai Nguyen Univ Educ, Thai Nguyen, Vietnam

[4] Natl Cent Univ, Dept CSIE, Taoyuan, Taiwan

来源：

2021 RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF 2021) | 2021年

关键词：

D O I：

10.1109/RIVF51545.2021.9642129

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep learning is a data-hungry technique that is more effective when being applied to large datasets. However, large-scale annotation datasets are not always available. A new approach, such as self-supervised learning of which labels can be automatically generated, is essential. Therefore, using self-supervised learning is a new approach to state-of-the-art methods. In this paper, we introduce a new self-supervised method namely video denoising. This method requires an autoencoder model to restore original videos. The second model is proposed, which is called the discriminator. It is used for the quality evaluation of output videos from the autoencoder. By reconstructing videos, the autoencoder is learned both spatial and temporal relations of video frames to process the downstream task easily. In the experiments, we have demonstrated that our model is well transferred to the action recognition task and outperforms state-of-the-art methods on the UCF-101 and HMDB-51 datasets.

引用

页码：76 / 81

页数：6

共 50 条

[1] Self-Supervised Video-Based Action Recognition With Disturbances
Lin, Wei
Ding, Xinghao
Huang, Yue
Zeng, Huanqiang
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2493 - 2507
[2] Contrastive Self-Supervised Learning for Skeleton Action Recognition
Gao, Xuehao
Yang, Yang
Du, Shaoyi
[J]. NEURIPS 2020 WORKSHOP ON PRE-REGISTRATION IN MACHINE LEARNING, VOL 148, 2020, 148 : 51 - 61
[3] Self-Supervised Video Pose Representation Learning for Occlusion-Robust Action Recognition
Yang, Di
Wang, Yaohui
Dantcheva, Antitza
Garattoni, Lorenzo
Francesca, Gianpiero
Bremond, Francois
[J]. 2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
[4] Part Aware Contrastive Learning for Self-Supervised Action Recognition
Hua, Yilei
Wu, Wenhan
Zheng, Ce
Lu, Aidong
Liu, Mengyuan
Chen, Chen
Wu, Shiqian
[J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 855 - 863
[5] Diffraction denoising using self-supervised learning
Markovic, Magdalena
Malehmir, Reza
Malehmir, Alireza
[J]. GEOPHYSICAL PROSPECTING, 2023, 71 (07) : 1215 - 1225
[6] Self-supervised pretext task collaborative multi-view contrastive learning for video action recognition
Bi, Shuai
Hu, Zhengping
Zhao, Mengyao
Zhang, Hehao
Di, Jirui
Sun, Zhe
[J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (07) : 3775 - 3782
[7] Self-supervised pretext task collaborative multi-view contrastive learning for video action recognition
Shuai Bi
Zhengping Hu
Mengyao Zhao
Hehao Zhang
Jirui Di
Zhe Sun
[J]. Signal, Image and Video Processing, 2023, 17 : 3775 - 3782
[8] Data-Efficient Masked Video Modeling for Self-supervised Action Recognition
Li, Qiankun
Huang, Xiaolong
Wan, Zhifan
Hu, Lanqing
Wu, Shuzhe
Zhang, Jie
Shan, Shiguang
Wang, Zengfu
[J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2723 - 2733
[9] Spatiotemporal consistency enhancement self-supervised representation learning for action recognition
Bi, Shuai
Hu, Zhengping
Zhao, Mengyao
Li, Shufang
Sun, Zhe
[J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (04) : 1485 - 1492
[10] Spatiotemporal consistency enhancement self-supervised representation learning for action recognition
Shuai Bi
Zhengping Hu
Mengyao Zhao
Shufang Li
Zhe Sun
[J]. Signal, Image and Video Processing, 2023, 17 : 1485 - 1492

← 1 2 3 4 5 →