Self-supervised Representation Learning for Fine Grained Human Hand Action Recognition in Industrial Assembly Lines

被引：1

作者：

Sturm, Fabian ^{[1
,2
]}

Sathiyababu, Rahul ^{[1
]}

Allipilli, Harshitha ^{[1
]}

Hergenroether, Elke ^{[2
]}

Siegel, Melanie ^{[2
]}

机构：

[1] Bosch Rexroth AG, Lise Meitner Str 4, D-89081 Ulm, Germany

[2] Univ Appl Sci Darmstadt, Schoefferstr 3, D-64295 Darmstadt, Germany

来源：

ADVANCES IN VISUAL COMPUTING, ISVC 2023, PT I | 2023年 / 14361卷

关键词：

Self-Supervised Learning; Human Action Recognition; Industrial Vision;

D O I：

10.1007/978-3-031-47969-4_14

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Humans are still indispensable on industrial assembly lines, but in the event of an error, they need support from intelligent systems. In addition to the objects to be observed, it is equally important to understand the fine-grained hand movements of a human to be able to track the entire process. However, these deep learning based hand action recognition methods are very label intensive, which cannot be offered by all industrial companies due to the associated costs. This work therefore presents a self-supervised learning approach for industrial assembly processes that allows a spatio-temporal transformer architecture to be pre-trained on a variety of information from real-world video footage of daily life. Subsequently, this deep learning model is adapted to the industrial assembly task at hand using only a few labels. It is shown which known real-world datasets are best suited for representation learning of these hand actions in a regression task, and to what extent they optimize the subsequent supervised trained classification task.

引用

页码：172 / 184

页数：13

共 50 条

[31] Whitening for Self-Supervised Representation Learning
Ermolov, Aleksandr
Siarohin, Aliaksandr
Sangineto, Enver
Sebe, Nicu
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[32] Self-Supervised Representation Learning for CAD
Jones, Benjamin T.
Hu, Michael
Kodnongbua, Milin
Kim, Vladimir G.
Schulz, Adriana
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21327 - 21336
[33] SELFGAIT: A SPATIOTEMPORAL REPRESENTATION LEARNING METHOD FOR SELF-SUPERVISED GAIT RECOGNITION
Liu, Yiqun
Zeng, Yi
Pu, Jian
Shan, Hongming
He, Peiyang
Zhang, Junping
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2570 - 2574
[34] Feature Decoupling in Self-supervised Representation Learning for Open Set Recognition
Jia, Jingyun
Chan, Philip K.
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[35] Self-supervised representation learning using multimodal Transformer for emotion recognition
Goetz, Theresa
Arora, Pulkit
Erick, F. X.
Holzer, Nina
Sawant, Shrutika
PROCEEDINGS OF THE 8TH INTERNATIONAL WORKSHOP ON SENSOR-BASED ACTIVITY RECOGNITION AND ARTIFICIAL INTELLIGENCE, IWOAR 2023, 2023,
[36] EXPLORING THE INTEGRATION OF SPEECH SEPARATION AND RECOGNITION WITH SELF-SUPERVISED LEARNING REPRESENTATION
Masuyama, Yoshiki
Chang, Xuankai
Zhang, Wangyou
Cornell, Samuele
Wang, Zhong-Qiu
Ono, Nobutaka
Qian, Yanmin
Watanabe, Shinji
2023 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, WASPAA, 2023,
[37] Self-supervised Detransformation Autoencoder for Representation Learning in Open Set Recognition
Jia, Jingyun
Chan, Philip K.
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 471 - 483
[38] Hierarchically Self-supervised Transformer for Human Skeleton Representation Learning
Chen, Yuxiao
Zhao, Long
Yuan, Jianbo
Tian, Yu
Xia, Zhaoyang
Geng, Shijie
Han, Ligong
Metaxas, Dimitris N.
COMPUTER VISION, ECCV 2022, PT XXVI, 2022, 13686 : 185 - 202
[39] SSRL: Self-Supervised Spatial-Temporal Representation Learning for 3D Action Recognition
Jin, Zhihao
Wang, Yifan
Wang, Qicong
Shen, Yehu
Meng, Hongying
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 274 - 285
[40] Self-Supervised Human Activity Recognition With Localized Time-Frequency Contrastive Representation Learning
Taghanaki, Setareh Rahimi
Rainbow, Michael
Etemad, Ali
IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2023, 53 (06) : 1027 - 1037

← 1 2 3 4 5 →