Towards Real-time Co-speech Gesture Generation in Online Interaction in Social XR

被引:1
|
作者
Krome, Niklas [1 ]
Kopp, Stefan [1 ]
机构
[1] Bielefeld Univ, Bielefeld, Germany
关键词
extended reality; social interaction; animation; gesture generation; BEHAVIOR; QUALITY;
D O I
10.1145/3570945.3607315
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extended Reality (XR) has a potential to allow social interaction for people that are distant from one another, in educational, clinical or co-working applications, as well as for scientific studies. However, a full-blown embodied social presence and interaction via avatars in XR requires motion tracking hardware that many users do not have. At the same time, modern machine learning approaches enable the synthesis of natural and life-like nonverbal behavior, but only in offline settings and with considerable lag. We evaluate the applicability of current gesture generation systems for online interaction in social XR. We define a set of requirements for real-time-capable gesture generation and propose an approach to employ a state-of-the-art model in a real-time XR interaction pipeline. To test the model under conditions of online interaction, we divide an input audio stream into chunks of different lengths and stitch the resulting gesture animations together to form continuous motion. We evaluate the quality of the resulting multimodal avatar behavior in a user study. Our results show a significant trade-off between real-time generation capabilities and gesture quality. Suggestions for future improvement to retain model performance during online interaction in Social XR are made. A project page with videos of the generated gestures is available at https://nkrome.github.io/CAGE.html.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] REAL-TIME BRAIN MACHINE INTERACTION VIA SOCIAL ROBOT GESTURE CONTROL
    Abiri, Reza
    Borhani, Soheil
    Zhao, Xiaopeng
    Jiang, Yang
    PROCEEDINGS OF THE ASME 10TH ANNUAL DYNAMIC SYSTEMS AND CONTROL CONFERENCE, 2017, VOL 1, 2017,
  • [32] Improving Co-speech gesture rule-map generation via wild pose matching with gesture units.
    Ali, Ghazanfar
    Hwang, Jae-In
    SIGGRAPH ASIA 2022 POSTERS, SA 2022, 2022,
  • [33] The GENEA Challenge 2022: A large evaluation of data-driven co-speech gesture generation
    Yoon, Youngwoo
    Wolfert, Pieter
    Kucherenko, Taras
    Viegas, Carla
    Nikolov, Teodor
    Tsakov, Mihail
    Henter, Gustav Eje
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2022, 2022, : 736 - 747
  • [34] Towards an Interaction Stimulator Social Robot in the Parent-Child Interaction Therapy based on Real-time Speech Processing
    Vincze, David
    Niitsuma, Mihoko
    2024 33RD IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, ROMAN 2024, 2024, : 886 - 892
  • [35] MDG:Multilingual Co-speech Gesture Generation with Low-level Audio Representation and Diffusion Models
    Yang, Jie
    Bao, Feilong
    2024 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, IALP 2024, 2024, : 210 - 215
  • [36] Hybrid Seq2Seq Architecture for 3D Co-Speech Gesture Generation
    Saleh, Khaled
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2022, 2022, : 748 - 752
  • [37] EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation
    Qi, Xingqun
    Liu, Chen
    Li, Lincheng
    Hou, Jie
    Xin, Haoran
    Yu, Xin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 10420 - 10430
  • [38] An Approach to Dynamic Gesture Recognition for Real-Time Interaction
    Zhao, Jinli
    Chen, Tianding
    SIXTH INTERNATIONAL SYMPOSIUM ON NEURAL NETWORKS (ISNN 2009), 2009, 56 : 369 - 377
  • [39] Real-Time Hand Gesture Recognition for Human Robot Interaction
    Correa, Mauricio
    Ruiz-del-Solar, Javier
    Verschae, Rodrigo
    Lee-Ferny, Jong
    Castillo, Nelson
    ROBOCUP 2009: ROBOT SOCCER WORLD CUP XIII, 2010, 5949 : 46 - 57
  • [40] Approach to tracking deformable hand gesture for real-time interaction
    Laboratory of Human-Computer Interaction and Intelligent Information Processing, Institute of Software, Chinese Academy of Sciences, Beijing 100080, China
    Ruan Jian Xue Bao, 2007, 10 (2423-2433):