Self-supervised Representation Learning for Fine Grained Human Hand Action Recognition in Industrial Assembly Lines

被引:1
|
作者
Sturm, Fabian [1 ,2 ]
Sathiyababu, Rahul [1 ]
Allipilli, Harshitha [1 ]
Hergenroether, Elke [2 ]
Siegel, Melanie [2 ]
机构
[1] Bosch Rexroth AG, Lise Meitner Str 4, D-89081 Ulm, Germany
[2] Univ Appl Sci Darmstadt, Schoefferstr 3, D-64295 Darmstadt, Germany
关键词
Self-Supervised Learning; Human Action Recognition; Industrial Vision;
D O I
10.1007/978-3-031-47969-4_14
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Humans are still indispensable on industrial assembly lines, but in the event of an error, they need support from intelligent systems. In addition to the objects to be observed, it is equally important to understand the fine-grained hand movements of a human to be able to track the entire process. However, these deep learning based hand action recognition methods are very label intensive, which cannot be offered by all industrial companies due to the associated costs. This work therefore presents a self-supervised learning approach for industrial assembly processes that allows a spatio-temporal transformer architecture to be pre-trained on a variety of information from real-world video footage of daily life. Subsequently, this deep learning model is adapted to the industrial assembly task at hand using only a few labels. It is shown which known real-world datasets are best suited for representation learning of these hand actions in a regression task, and to what extent they optimize the subsequent supervised trained classification task.
引用
收藏
页码:172 / 184
页数:13
相关论文
共 50 条
  • [41] SELF-SUPERVISED CONTRASTIVE LEARNING FOR AUDIO-VISUAL ACTION RECOGNITION
    Liu, Yang
    Tan, Ying
    Lan, Haoyuan
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1000 - 1004
  • [42] ColloSSL: Collaborative Self-Supervised Learning for Human Activity Recognition
    Jain, Yash
    Tang, Chi Ian
    Min, Chulhong
    Kawsar, Fahim
    Mathur, Akhil
    PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2022, 6 (01):
  • [43] Fine-Grained Self-Supervised Learning with Jigsaw puzzles for medical image classification
    Park W.
    Ryu J.
    Comput. Biol. Med., 2024,
  • [44] Motion Guided Attention Learning for Self-Supervised 3D Human Action Recognition
    Yang, Yang
    Liu, Guangjun
    Gao, Xuehao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8623 - 8634
  • [45] Cross-Model Cross-Stream Learning for Self-Supervised Human Action Recognition
    Liu, Mengyuan
    Liu, Hong
    Guo, Tianyu
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2024, 54 (06) : 743 - 752
  • [46] Webly Supervised Fine-Grained Image Recognition with Graph Representation and Metric Learning
    Lin, Jianman
    Lin, Jiantao
    Gao, Yuefang
    Yang, Zhijing
    Chen, Tianshui
    ELECTRONICS, 2022, 11 (24)
  • [47] Self-Distilled Self-supervised Representation Learning
    Jang, Jiho
    Kim, Seonhoon
    Yoo, Kiyoon
    Kong, Chaerin
    Kim, Jangho
    Kwak, Nojun
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2828 - 2838
  • [48] Self-Supervised Representation Learning for Skeleton-Based Group Activity Recognition
    Bian, Cunling
    Feng, Wei
    Wang, Song
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5990 - 5998
  • [49] Self-supervised Learning Representation based Accent Recognition with Persistent Accent Memory
    Li, Rui
    Xie, Zhiwei
    Xu, Haihua
    Peng, Yizhou
    Liu, Hexin
    Huang, Hao
    Chng, Eng Siong
    INTERSPEECH 2023, 2023, : 1968 - 1972
  • [50] Applying Self-Supervised Representation Learning for Emotion Recognition Using Physiological Signals
    Quispe, Kevin G. Montero G.
    Utyiama, Daniel M. S.
    dos Santos, Eulanda M. M.
    Oliveira, Horacio A. B. F.
    Souto, Eduardo J. P.
    SENSORS, 2022, 22 (23)