Synthesizing multimodal utterances for conversational agents

被引：121

作者：

Kopp, S ^{[1
]}

Wachsmuth, P ^{[1
]}

机构：

[1] Univ Bielefeld, Fac Technol, Artificial Intelligence Grp, D-33594 Bielefeld, Germany

来源：

COMPUTER ANIMATION AND VIRTUAL WORLDS | 2004年 / 15卷 / 01期

关键词：

multimodal conversational agents; gesture animation; model-based computer animation; motion control;

D O I：

10.1002/cav.6

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Conversational agents are supposed to combine speech with non-verbal modalities for intelligible multimodal utterances. In this paper, we focus on the generation of gesture and speech from XML-based descriptions of their overt form. An incremental production model is presented that combines the synthesis of synchronized gestural, verbal, and facial behaviors with mechanisms for linking them in fluent utterances with natural co-articulation and transition effects. In particular, an efficient kinematic approach for animating hand gestures from shape specifications is presented, which provides fine adaptation to temporal constraints that are imposed by cross-modal synchrony. Copyright (C) 2004 John Wiley Sons, Ltd.

引用

页码：39 / 52

页数：14

共 50 条

[41] Multimodal Conversational Interaction with a Humanoid Robot
Csapo, Adam
Gilmartin, Emer
Grizou, Jonathan
Han, JingGuang
Meena, Raveesh
Anastasiou, Dimitra
Jokinen, Kristiina
Wilcock, Graham
3RD IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM 2012), 2012, : 666 - 671
[42] A Dependency-Aware Utterances Permutation Strategy to Improve Conversational Evaluation
Faggioli, Guglielmo
Ferrante, Marco
Ferro, Nicola
Perego, Raffaele
Tonellotto, Nicola
ADVANCES IN INFORMATION RETRIEVAL, PT I, 2022, 13185 : 184 - 198
[43] Grammatical characteristics of children's conversational utterances that contain disfluency clusters
Logan, KJ
LaSalle, LR
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 1999, 42 (01): : 80 - 91
[44] Analysing Utterances in LLM-Based User Simulation for Conversational Search
Sekulic, Ivan
Aliannejadi, Mohammad
Crestani, Fabio
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2024, 15 (03)
[45] Multimodal output for a conversational telephony system
Mast, M
Günther, C
Kunzmann, S
Ross, T
2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 293 - 296
[46] Conversational Forensics Building conversational pedagogical agents with attitude
Fakinlede, Ireti
Kumar, Vive
Wen, Dunwei
Kinshuk
2013 IEEE FIFTH INTERNATIONAL CONFERENCE ON TECHNOLOGY FOR EDUCATION (T4E 2013), 2013, : 65 - 68
[47] Prediction of Various Backchannel Utterances Based on Multimodal Information
Onishi, Toshiki
Azuma, Naoki
Kinoshita, Shunichi
Ishii, Ryo
Fukayama, Atsushi
Nakamura, Takao
Miyata, Akihiro
PROCEEDINGS OF THE 23RD ACM INTERNATIONAL CONFERENCE ON INTELLIGENT VIRTUAL AGENTS, IVA 2023, 2023,
[48] Conversational AI: Dialogue Systems, Conversational Agents, and Chatbots
Seminck, Olga
COMPUTATIONAL LINGUISTICS, 2023, 49 (01) : 257 - 259
[49] Conversational psychological agents. A study of rational and psychological behaviors of conversational assistant agents
Bouchet F.
Sansonnet J.-P.
Revue d'Intelligence Artificielle, 2011, 25 (05) : 591 - 623
[50] Exploring persuasive potential of embodied conversational agents utilizing synthetic embodied conversational agents
Shearer, John
Olivier, Patrick
De Boni, Marco
Hurling, Robert
PERSUASIVE TECHNOLOGY, 2007, 4744 : 210 - 213

← 1 2 3 4 5 →