Real-Time Conversational Gaze Synthesis for Avatars

被引:3
|
作者
Canales, Ryan [1 ]
Jain, Eakta [2 ]
Joerg, Sophie [1 ,3 ]
机构
[1] Clemson Univ, Clemson, SC 29634 USA
[2] Univ Florida, Gainesville, FL USA
[3] Univ Bamberg, Bamberg, Germany
基金
美国国家科学基金会;
关键词
gaze animation; avatars; motion perception; virtual reality; EYE GAZE; MODEL;
D O I
10.1145/3623264.3624446
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Eye movement plays an important role in face-to-face communication. In this work, we present a deep learning approach for synthesizing the eye movements of avatars for two-party conversations and evaluate viewer perception of different types of eye motions. We aim to synthesize believable gaze behavior based on head motions and audio features as they would typically be available in virtual reality applications. To this end, we captured the head motion, eye motion, and audio of several two-party conversations and trained an RNN-based model to predict where an avatar looks in a two-person conversational scenario. We evaluated our approach with a user study on the perceived quality of the eye animation and compared our method with other eye animation methods. While our model was not rated highest, our model and our user study lead to a series of insights on model features, viewer perception, and study design that we present.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Real-time Gaze Transition Entropy
    Ebeid, Islam Akef
    Gwizdka, Jacek
    2018 ACM SYMPOSIUM ON EYE TRACKING RESEARCH & APPLICATIONS (ETRA 2018), 2018,
  • [2] Real-Time Human Gaze Estimation
    Rowntree, Thomas
    Pontecorvo, Carmine
    Reid, Ian
    2019 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2019, : 98 - 104
  • [3] Perpetual Humanoid Control for Real-time Simulated Avatars
    Luo, Zhengyi
    Cao, Jinkun
    Winkler, Alexander
    Kitani, Kris
    Xu, Weipeng
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10861 - 10870
  • [4] paGAN: Real-time Avatars Using Dynamic Textures
    Nagano, Koki
    Seo, Jaewoo
    Xing, Jun
    Wei, Lingyu
    Li, Zimo
    Saito, Shunsuke
    Agarwal, Aviral
    Fursund, Jens
    Li, Hao
    ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (06):
  • [5] AI-empowered Pose Reconstruction for Real-time Synthesis of Remote Metaverse Avatars
    Gu, Xingci
    Yuan, Ye
    Yang, Jianjun
    Li, Longjiang
    2024 21ST INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING, JCSSE 2024, 2024, : 86 - 93
  • [6] paGAN: Real-time Avatars Using Dynamic Textures
    Nagano, Koki
    Seo, Jaewoo
    Xing, Jun
    Wei, Lingyu
    Li, Zimo
    Saito, Shunsuke
    Agarwal, Aviral
    Fursund, Jens
    Li, Hao
    SIGGRAPH ASIA'18: SIGGRAPH ASIA 2018 TECHNICAL PAPERS, 2018,
  • [7] ConvLogRecaller: A Real-Time Conversational Lifelog Recaller
    Lee, Yuan-Chi
    Yen, An-Zi
    Huang, Hen-Hsen
    Chen, Hsin-Hsi
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2724 - 2728
  • [8] ConvLogMiner: A Real-Time Conversational Lifelog Miner
    Kao, Pei-Wei
    Yen, An-Zi
    Huang, Hen-Hsen
    Chen, Hsin-Hsi
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4992 - 4995
  • [9] Real-Time Multi-Map Saliency-Driven Gaze Behavior for Non-Conversational Characters
    Goude, Ific
    Bruckert, Alexandre
    Olivier, Anne-Helene
    Pettre, Julien
    Cozot, Remi
    Bouatouch, Kadi
    Christie, Marc
    Hoyet, Ludovic
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (07) : 3871 - 3883
  • [10] Real-Time Gaze Tracking for Public Displays
    Sippl, Andreas
    Holzmann, Clemens
    Zachhuber, Doris
    Ferscha, Alois
    AMBIENT INTELLIGENCE, 2010, 6439 : 167 - +