Implicitly using Human Skeleton in Self-supervised Learning: Influence on Spatio-temporal Puzzle Solving and on Video Action Recognition

被引：0

作者：

Riand, Mathieu ^{[1
,2
]}

Dolle, Laurent ^{[1
]}

Le Callet, Patrick ^{[2
]}

机构：

[1] CEA Tech Pays Loire, F-44340 Bouguenais, France

[2] Nantes Univ, Lab Sci Numer Nantes, Equipe Image Percept & Interact, Nantes, France

来源：

PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ROBOTICS, COMPUTER VISION AND INTELLIGENT SYSTEMS (ROBOVIS) | 2021年

关键词：

Self-supervised Learning; Siamese Network; Skeleton Keypoints; Action Recognition; Few-shot Learning;

D O I：

10.5220/0010689500003061

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we studied the influence of adding skeleton data on top of human actions videos when performing self-supervised learning and action recognition. We show that adding this information without additional constraints actually hurts the accuracy of the network; we argue that the added skeleton is not considered by the network and seen as a noise masking part of the natural image. We bring first results on puzzle solving and video action recognition to support this hypothesis.

引用

页码：128 / 135

页数：8

共 50 条

[1] Video Cloze Procedure for Self-Supervised Spatio-Temporal Learning
Luo, Dezhao
Liu, Chang
Zhou, Yu
Yang, Dongbao
Ma, Can
Ye, Qixiang
Wang, Weiping
[J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11701 - 11708
[2] Self-Supervised Video Representation Learning by Uncovering Spatio-Temporal Statistics
Wang, Jiangliu
Jiao, Jianbo
Bao, Linchao
He, Shengfeng
Liu, Wei
Liu, Yun-hui
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (07) : 3791 - 3806
[3] Contrastive Spatio-Temporal Pretext Learning for Self-Supervised Video Representation
Zhang, Yujia
Po, Lai-Man
Xu, Xuyuan
Liu, Mengyang
Wang, Yexin
Ou, Weifeng
Zhao, Yuzhi
Yu, Wing-Yin
[J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3380 - 3389
[4] Video Playback Rate Perception for Self-supervised Spatio-Temporal Representation Learning
Yao, Yuan
Liu, Chang
Luo, Dezhao
Zhou, Yu
Ye, Qixiang
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6547 - 6556
[5] Contrastive Self-Supervised Learning for Skeleton Action Recognition
Gao, Xuehao
Yang, Yang
Du, Shaoyi
[J]. NEURIPS 2020 WORKSHOP ON PRE-REGISTRATION IN MACHINE LEARNING, VOL 148, 2020, 148 : 51 - 61
[6] Self-Supervised Learning for Action Recognition by Video Denoising
Thi Thu Trang Phung
Thi Hong Thu Ma
Van Truong Nguyen
Duc Quang Vu
[J]. 2021 RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF 2021), 2021, : 76 - 81
[7] Spatio-Temporal Self-Supervised Learning for Traffic Flow Prediction
Ji, Jiahao
Wang, Jingyuan
Huang, Chao
Wu, Junjie
Xu, Boren
Wu, Zhenhe
Zhang, Junbo
Zheng, Yu
[J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 4, 2023, : 4356 - 4364
[8] Self-Supervised Regrasping using Spatio-Temporal Tactile Features and Reinforcement Learning
Chebotar, Yevgen
Hausman, Karol
Su, Zhe
Sukhatme, Gaurav S.
Schaal, Stefan
[J]. 2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 1960 - 1966
[9] Spatio-Temporal Catcher: a Self-Supervised Transformer for Deepfake Video Detection
Li, Maosen
Li, Xurong
Yu, Kun
Deng, Cheng
Huang, Heng
Mao, Feng
Xue, Hui
Li, Minghao
[J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 8707 - 8718
[10] Spatio-temporal Video Autoencoder for Human Action Recognition
Sousa e Santos, Anderson Carlos
Pedrini, Helio
[J]. PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 114 - 123

← 1 2 3 4 5 →