Self-supervised Representation Learning for Fine Grained Human Hand Action Recognition in Industrial Assembly Lines

被引：1

作者：

Sturm, Fabian ^{[1
,2
]}

Sathiyababu, Rahul ^{[1
]}

Allipilli, Harshitha ^{[1
]}

Hergenroether, Elke ^{[2
]}

Siegel, Melanie ^{[2
]}

机构：

[1] Bosch Rexroth AG, Lise Meitner Str 4, D-89081 Ulm, Germany

[2] Univ Appl Sci Darmstadt, Schoefferstr 3, D-64295 Darmstadt, Germany

来源：

ADVANCES IN VISUAL COMPUTING, ISVC 2023, PT I | 2023年 / 14361卷

关键词：

Self-Supervised Learning; Human Action Recognition; Industrial Vision;

D O I：

10.1007/978-3-031-47969-4_14

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Humans are still indispensable on industrial assembly lines, but in the event of an error, they need support from intelligent systems. In addition to the objects to be observed, it is equally important to understand the fine-grained hand movements of a human to be able to track the entire process. However, these deep learning based hand action recognition methods are very label intensive, which cannot be offered by all industrial companies due to the associated costs. This work therefore presents a self-supervised learning approach for industrial assembly processes that allows a spatio-temporal transformer architecture to be pre-trained on a variety of information from real-world video footage of daily life. Subsequently, this deep learning model is adapted to the industrial assembly task at hand using only a few labels. It is shown which known real-world datasets are best suited for representation learning of these hand actions in a regression task, and to what extent they optimize the subsequent supervised trained classification task.

引用

页码：172 / 184

页数：13

共 50 条

[41] SELF-SUPERVISED CONTRASTIVE LEARNING FOR AUDIO-VISUAL ACTION RECOGNITION
Liu, Yang
Tan, Ying
Lan, Haoyuan
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1000 - 1004
[42] ColloSSL: Collaborative Self-Supervised Learning for Human Activity Recognition
Jain, Yash
Tang, Chi Ian
Min, Chulhong
Kawsar, Fahim
Mathur, Akhil
PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2022, 6 (01):
[43] Fine-Grained Self-Supervised Learning with Jigsaw puzzles for medical image classification
Park W.
Ryu J.
Comput. Biol. Med., 2024,
[44] Motion Guided Attention Learning for Self-Supervised 3D Human Action Recognition
Yang, Yang
Liu, Guangjun
Gao, Xuehao
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8623 - 8634
[45] Cross-Model Cross-Stream Learning for Self-Supervised Human Action Recognition
Liu, Mengyuan
Liu, Hong
Guo, Tianyu
IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2024, 54 (06) : 743 - 752
[46] Webly Supervised Fine-Grained Image Recognition with Graph Representation and Metric Learning
Lin, Jianman
Lin, Jiantao
Gao, Yuefang
Yang, Zhijing
Chen, Tianshui
ELECTRONICS, 2022, 11 (24)
[47] Self-Distilled Self-supervised Representation Learning
Jang, Jiho
Kim, Seonhoon
Yoo, Kiyoon
Kong, Chaerin
Kim, Jangho
Kwak, Nojun
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2828 - 2838
[48] Self-Supervised Representation Learning for Skeleton-Based Group Activity Recognition
Bian, Cunling
Feng, Wei
Wang, Song
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5990 - 5998
[49] Self-supervised Learning Representation based Accent Recognition with Persistent Accent Memory
Li, Rui
Xie, Zhiwei
Xu, Haihua
Peng, Yizhou
Liu, Hexin
Huang, Hao
Chng, Eng Siong
INTERSPEECH 2023, 2023, : 1968 - 1972
[50] Applying Self-Supervised Representation Learning for Emotion Recognition Using Physiological Signals
Quispe, Kevin G. Montero G.
Utyiama, Daniel M. S.
dos Santos, Eulanda M. M.
Oliveira, Horacio A. B. F.
Souto, Eduardo J. P.
SENSORS, 2022, 22 (23)

← 1 2 3 4 5 →