Towards Real-time Co-speech Gesture Generation in Online Interaction in Social XR

被引:1
|
作者
Krome, Niklas [1 ]
Kopp, Stefan [1 ]
机构
[1] Bielefeld Univ, Bielefeld, Germany
关键词
extended reality; social interaction; animation; gesture generation; BEHAVIOR; QUALITY;
D O I
10.1145/3570945.3607315
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extended Reality (XR) has a potential to allow social interaction for people that are distant from one another, in educational, clinical or co-working applications, as well as for scientific studies. However, a full-blown embodied social presence and interaction via avatars in XR requires motion tracking hardware that many users do not have. At the same time, modern machine learning approaches enable the synthesis of natural and life-like nonverbal behavior, but only in offline settings and with considerable lag. We evaluate the applicability of current gesture generation systems for online interaction in social XR. We define a set of requirements for real-time-capable gesture generation and propose an approach to employ a state-of-the-art model in a real-time XR interaction pipeline. To test the model under conditions of online interaction, we divide an input audio stream into chunks of different lengths and stitch the resulting gesture animations together to form continuous motion. We evaluate the quality of the resulting multimodal avatar behavior in a user study. Our results show a significant trade-off between real-time generation capabilities and gesture quality. Suggestions for future improvement to retain model performance during online interaction in Social XR are made. A project page with videos of the generated gestures is available at https://nkrome.github.io/CAGE.html.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Real-time Social Touch Gesture Recognition for Sensate Robots
    Knight, Heather
    Toscano, Robert
    Stiehl, Walter D.
    Chang, Angela
    Wang, Yi
    Breazeal, Cynthia
    2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, : 3715 - 3720
  • [42] TOWARDS REAL-TIME MISPRONUNCIATION DETECTION IN KIDS' SPEECH
    Plantinga, Peter
    Fosler-Lussier, Eric
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 690 - 696
  • [43] Towards a Real-Time Speech Neuroprosthesis to Restore Speech in a Person With ALS
    Wairagkar, Maitreyee
    Card, Nicholas
    Iacobacci, Carrina
    Hochberg, Leigh R.
    Stavisky, Sergey
    Brandman, David M.
    NEUROSURGERY, 2025, 71 : 58 - 58
  • [44] TAG2G: A Diffusion-Based Approach to Interlocutor-Aware Co-Speech Gesture Generation
    Favali, Filippo
    Schmuck, Viktor
    Villani, Valeria
    Celiktutan, Oya
    ELECTRONICS, 2024, 13 (17)
  • [45] Real-time Generation of Dynamic Wave Interaction
    Shu Dan
    Wang Chang-bo
    Li, Haishen
    Fu Yinyu
    Wang Zhaomeng
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 3143 - 3146
  • [46] Real-time Hand Gesture Communication System in Hindi for Speech and Hearing Impaired
    Chaman, Shilpa
    D'souza, Dylan
    D'mello, Benz
    Bhavsar, Karan
    D'souza, Jolton
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 1954 - 1958
  • [47] A Real-Time Robot Motion Generation System Based on Human Gesture
    Haramaki, Toshiyuki
    Goto, Kenta
    Tsutsumi, Hokuto
    Yatsuda, Akihito
    Nishino, Hiroaki
    ADVANCES ON BROADBAND AND WIRELESS COMPUTING, COMMUNICATION AND APPLICATIONS, BWCCA-2018, 2019, 25 : 135 - 146
  • [48] Real-Time Hand Gesture Detection and Recognition for Human Computer Interaction
    Yadav, Kapil
    Bhattacharya, Jhilik
    INTELLIGENT SYSTEMS TECHNOLOGIES AND APPLICATIONS, VOL 1, 2016, 384 : 559 - 567
  • [49] Networked Gesture Tracking System Based on Immersive Real-Time Interaction
    Li, Jie
    Wang, Zhelong
    Jiang, Yongmei
    Qiu, Sen
    Wang, JiaXing
    Tang, Kai
    2017 IEEE 21ST INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2017, : 139 - 144
  • [50] Real-time online photonic random number generation
    Li, Pu
    Zhang, Jianguo
    Sang, Luxiao
    Liu, Xianglian
    Guo, Yanqiang
    Guo, Xiaomin
    Wang, Anbang
    Shore, K. Alan
    Wang, Yuncai
    OPTICS LETTERS, 2017, 42 (14) : 2699 - 2702