Self-Supervised Learning for Action Recognition by Video Denoising

被引:0
|
作者
Thi Thu Trang Phung [1 ]
Thi Hong Thu Ma [2 ]
Van Truong Nguyen [3 ]
Duc Quang Vu [4 ]
机构
[1] Thai Nguyen Univ, Thai Nguyen, Vietnam
[2] Tan Trao Univ, Tuyen Quang, Vietnam
[3] Thai Nguyen Univ Educ, Thai Nguyen, Vietnam
[4] Natl Cent Univ, Dept CSIE, Taoyuan, Taiwan
关键词
D O I
10.1109/RIVF51545.2021.9642129
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning is a data-hungry technique that is more effective when being applied to large datasets. However, large-scale annotation datasets are not always available. A new approach, such as self-supervised learning of which labels can be automatically generated, is essential. Therefore, using self-supervised learning is a new approach to state-of-the-art methods. In this paper, we introduce a new self-supervised method namely video denoising. This method requires an autoencoder model to restore original videos. The second model is proposed, which is called the discriminator. It is used for the quality evaluation of output videos from the autoencoder. By reconstructing videos, the autoencoder is learned both spatial and temporal relations of video frames to process the downstream task easily. In the experiments, we have demonstrated that our model is well transferred to the action recognition task and outperforms state-of-the-art methods on the UCF-101 and HMDB-51 datasets.
引用
收藏
页码:76 / 81
页数:6
相关论文
共 50 条
  • [1] Self-Supervised Video-Based Action Recognition With Disturbances
    Lin, Wei
    Ding, Xinghao
    Huang, Yue
    Zeng, Huanqiang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2493 - 2507
  • [2] Contrastive Self-Supervised Learning for Skeleton Action Recognition
    Gao, Xuehao
    Yang, Yang
    Du, Shaoyi
    [J]. NEURIPS 2020 WORKSHOP ON PRE-REGISTRATION IN MACHINE LEARNING, VOL 148, 2020, 148 : 51 - 61
  • [3] Self-Supervised Video Pose Representation Learning for Occlusion-Robust Action Recognition
    Yang, Di
    Wang, Yaohui
    Dantcheva, Antitza
    Garattoni, Lorenzo
    Francesca, Gianpiero
    Bremond, Francois
    [J]. 2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
  • [4] Part Aware Contrastive Learning for Self-Supervised Action Recognition
    Hua, Yilei
    Wu, Wenhan
    Zheng, Ce
    Lu, Aidong
    Liu, Mengyuan
    Chen, Chen
    Wu, Shiqian
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 855 - 863
  • [5] Diffraction denoising using self-supervised learning
    Markovic, Magdalena
    Malehmir, Reza
    Malehmir, Alireza
    [J]. GEOPHYSICAL PROSPECTING, 2023, 71 (07) : 1215 - 1225
  • [6] Self-supervised pretext task collaborative multi-view contrastive learning for video action recognition
    Bi, Shuai
    Hu, Zhengping
    Zhao, Mengyao
    Zhang, Hehao
    Di, Jirui
    Sun, Zhe
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (07) : 3775 - 3782
  • [7] Self-supervised pretext task collaborative multi-view contrastive learning for video action recognition
    Shuai Bi
    Zhengping Hu
    Mengyao Zhao
    Hehao Zhang
    Jirui Di
    Zhe Sun
    [J]. Signal, Image and Video Processing, 2023, 17 : 3775 - 3782
  • [8] Data-Efficient Masked Video Modeling for Self-supervised Action Recognition
    Li, Qiankun
    Huang, Xiaolong
    Wan, Zhifan
    Hu, Lanqing
    Wu, Shuzhe
    Zhang, Jie
    Shan, Shiguang
    Wang, Zengfu
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2723 - 2733
  • [9] Spatiotemporal consistency enhancement self-supervised representation learning for action recognition
    Bi, Shuai
    Hu, Zhengping
    Zhao, Mengyao
    Li, Shufang
    Sun, Zhe
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (04) : 1485 - 1492
  • [10] Spatiotemporal consistency enhancement self-supervised representation learning for action recognition
    Shuai Bi
    Zhengping Hu
    Mengyao Zhao
    Shufang Li
    Zhe Sun
    [J]. Signal, Image and Video Processing, 2023, 17 : 1485 - 1492