Diffusion-Based Unsupervised Pre-training for Automated Recognition of Vitality Forms

被引:0
|
作者
Canovi, Noemi [1 ]
Montagna, Federico [1 ]
Niewiadomski, Radoslaw [2 ]
Sciutti, Alessandra [3 ]
Di Cesare, Giuseppe [3 ,4 ]
Beyan, Cigdem [5 ]
机构
[1] Univ Trento, Dep Informat Engn & Comp Sci, Trento, Italy
[2] Univ Genoa, Dept Informat Bioengn Robot & Syst Engn, Genoa, Italy
[3] Ist Italiano Tecnol, CONTACT Unit, Genoa, Italy
[4] Univ Parma, Dept Med & Surg, Parma, Italy
[5] Univ Verona, Dept Comp Sci, Verona, Italy
基金
欧洲研究理事会;
关键词
Vitality forms; nonverbal communication; unsupervised pre-training; diffusion models; autoencoders; gestures; actions; trajectory; EXPRESSION;
D O I
10.1145/3656650.3656689
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Social communication involves interpreting nonverbal behaviors, detecting and anticipating others' actions and intentions. Actions convey not only the goal and motor intention but also the form, i.e., variations in action execution. These variations, termed vitality forms, communicate attitudes during interactions, such as being gentle, calm, vigorous, and rude. Automatic vitality form recognition may have several applications in social robotics, social skills training, and therapy, yet it remains a rarely studied topic. This paper introduces an unsupervised pre-training approach that utilizes 2D-body key point trajectories as input and employs diffusion models to derive more effective features for representing these trajectories. The features learned from the diffusion model's encoder are utilized to train a multilayer perceptron for vitality form recognition. Experimental analysis showcases the superior performance of the proposed method not only across various videos but also for action classes not encountered during training.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Unsupervised Pre-Training of Image Features on Non-Curated Data
    Caron, Mathilde
    Bojanowski, Piotr
    Mairal, Julien
    Joulin, Armand
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2959 - 2968
  • [42] UP-DETR: Unsupervised Pre-training for Object Detection with Transformers
    Dai, Zhigang
    Cai, Bolun
    Lin, Yugeng
    Chen, Junying
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1601 - 1610
  • [43] Unsupervised Pre-Training for 3D Leaf Instance Segmentation
    Roggiolani, Gianmarco
    Magistri, Federico
    Guadagnino, Tiziano
    Behley, Jens
    Stachniss, Cyrill
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (11) : 7448 - 7455
  • [44] A novel unsupervised deep transfer learning method based on contrast pre-training for fault diagnosis
    Cao, Jungang
    Yang, Zhe
    Huang, Yunwei
    Guo, Jianwen
    Li, Chuan
    Jiang, Lingli
    Long, Jianyu
    [J]. 2023 IEEE 2ND INDUSTRIAL ELECTRONICS SOCIETY ANNUAL ON-LINE CONFERENCE, ONCON, 2023,
  • [45] Explicit Cross-lingual Pre-training for Unsupervised Machine Translation
    Ren, Shuo
    Wu, Yu
    Liu, Shujie
    Zhou, Ming
    Ma, Shuai
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 770 - 779
  • [46] ProposalContrast: Unsupervised Pre-training for LiDAR-Based 3D Object Detection
    Yin, Junbo
    Zhou, Dingfu
    Zhang, Liangjun
    Fang, Jin
    Xu, Cheng-Zhong
    Shen, Jianbing
    Wang, Wenguan
    [J]. COMPUTER VISION, ECCV 2022, PT XXXIX, 2022, 13699 : 17 - 33
  • [47] Automated Bridge Inspection Image Interpretation Based on Vision-Language Pre-Training
    Wang, Shengyi
    El-Gohary, Nora
    [J]. COMPUTING IN CIVIL ENGINEERING 2023-DATA, SENSING, AND ANALYTICS, 2024, : 1 - 8
  • [48] Graph Pre-training for Reconnaissance Perception in Automated Penetration Testing
    Wang, Yunfei
    Liu, Shixuan
    Zhang, Chao
    Wang, Wenhao
    Jin, Jiandong
    Zhu, Cheng
    Zhou, Changling
    [J]. ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14864 : 302 - 318
  • [49] Nested Named Entity Recognition in Geotechnical Engineering Based on Pre-training and Information Enhancement
    Chen, Guanyu
    Hu, Yang
    Wang, Zuheng
    Song, Zhiquan
    Hu, Jun
    Yang, Tuo
    Wang, Quanyu
    [J]. ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14877 : 291 - 303
  • [50] MEMOBERT: PRE-TRAINING MODEL WITH PROMPT-BASED LEARNING FOR MULTIMODAL EMOTION RECOGNITION
    Zhao, Jinming
    Li, Ruichen
    Jin, Qin
    Wang, Xinchao
    Li, Haizhou
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4703 - 4707