Real-Time Conversational Gaze Synthesis for Avatars

被引：3

作者：

Canales, Ryan ^{[1
]}

Jain, Eakta ^{[2
]}

Joerg, Sophie ^{[1
,3
]}

机构：

[1] Clemson Univ, Clemson, SC 29634 USA

[2] Univ Florida, Gainesville, FL USA

[3] Univ Bamberg, Bamberg, Germany

来源：

15TH ANNUAL ACM SIGGRAPH CONFERENCE ON MOTION, INTERACTION AND GAMES, MIG 2023 | 2023年

基金：

美国国家科学基金会;

关键词：

gaze animation; avatars; motion perception; virtual reality; EYE GAZE; MODEL;

D O I：

10.1145/3623264.3624446

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Eye movement plays an important role in face-to-face communication. In this work, we present a deep learning approach for synthesizing the eye movements of avatars for two-party conversations and evaluate viewer perception of different types of eye motions. We aim to synthesize believable gaze behavior based on head motions and audio features as they would typically be available in virtual reality applications. To this end, we captured the head motion, eye motion, and audio of several two-party conversations and trained an RNN-based model to predict where an avatar looks in a two-person conversational scenario. We evaluated our approach with a user study on the perceived quality of the eye animation and compared our method with other eye animation methods. While our model was not rated highest, our model and our user study lead to a series of insights on model features, viewer perception, and study design that we present.

引用

页数：7

共 50 条

[1] Real-time Gaze Transition Entropy
Ebeid, Islam Akef
Gwizdka, Jacek
2018 ACM SYMPOSIUM ON EYE TRACKING RESEARCH & APPLICATIONS (ETRA 2018), 2018,
[2] Real-Time Human Gaze Estimation
Rowntree, Thomas
Pontecorvo, Carmine
Reid, Ian
2019 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2019, : 98 - 104
[3] Perpetual Humanoid Control for Real-time Simulated Avatars
Luo, Zhengyi
Cao, Jinkun
Winkler, Alexander
Kitani, Kris
Xu, Weipeng
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10861 - 10870
[4] paGAN: Real-time Avatars Using Dynamic Textures
Nagano, Koki
Seo, Jaewoo
Xing, Jun
Wei, Lingyu
Li, Zimo
Saito, Shunsuke
Agarwal, Aviral
Fursund, Jens
Li, Hao
ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (06):
[5] AI-empowered Pose Reconstruction for Real-time Synthesis of Remote Metaverse Avatars
Gu, Xingci
Yuan, Ye
Yang, Jianjun
Li, Longjiang
2024 21ST INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING, JCSSE 2024, 2024, : 86 - 93
[6] paGAN: Real-time Avatars Using Dynamic Textures
Nagano, Koki
Seo, Jaewoo
Xing, Jun
Wei, Lingyu
Li, Zimo
Saito, Shunsuke
Agarwal, Aviral
Fursund, Jens
Li, Hao
SIGGRAPH ASIA'18: SIGGRAPH ASIA 2018 TECHNICAL PAPERS, 2018,
[7] ConvLogRecaller: A Real-Time Conversational Lifelog Recaller
Lee, Yuan-Chi
Yen, An-Zi
Huang, Hen-Hsen
Chen, Hsin-Hsi
PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2724 - 2728
[8] ConvLogMiner: A Real-Time Conversational Lifelog Miner
Kao, Pei-Wei
Yen, An-Zi
Huang, Hen-Hsen
Chen, Hsin-Hsi
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4992 - 4995
[9] Real-Time Multi-Map Saliency-Driven Gaze Behavior for Non-Conversational Characters
Goude, Ific
Bruckert, Alexandre
Olivier, Anne-Helene
Pettre, Julien
Cozot, Remi
Bouatouch, Kadi
Christie, Marc
Hoyet, Ludovic
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (07) : 3871 - 3883
[10] Real-Time Gaze Tracking for Public Displays
Sippl, Andreas
Holzmann, Clemens
Zachhuber, Doris
Ferscha, Alois
AMBIENT INTELLIGENCE, 2010, 6439 : 167 - +

← 1 2 3 4 5 →