Synthesizing multimodal utterances for conversational agents

被引:121
|
作者
Kopp, S [1 ]
Wachsmuth, P [1 ]
机构
[1] Univ Bielefeld, Fac Technol, Artificial Intelligence Grp, D-33594 Bielefeld, Germany
关键词
multimodal conversational agents; gesture animation; model-based computer animation; motion control;
D O I
10.1002/cav.6
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Conversational agents are supposed to combine speech with non-verbal modalities for intelligible multimodal utterances. In this paper, we focus on the generation of gesture and speech from XML-based descriptions of their overt form. An incremental production model is presented that combines the synthesis of synchronized gestural, verbal, and facial behaviors with mechanisms for linking them in fluent utterances with natural co-articulation and transition effects. In particular, an efficient kinematic approach for animating hand gestures from shape specifications is presented, which provides fine adaptation to temporal constraints that are imposed by cross-modal synchrony. Copyright (C) 2004 John Wiley Sons, Ltd.
引用
收藏
页码:39 / 52
页数:14
相关论文
共 50 条
  • [41] Multimodal Conversational Interaction with a Humanoid Robot
    Csapo, Adam
    Gilmartin, Emer
    Grizou, Jonathan
    Han, JingGuang
    Meena, Raveesh
    Anastasiou, Dimitra
    Jokinen, Kristiina
    Wilcock, Graham
    3RD IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM 2012), 2012, : 666 - 671
  • [42] A Dependency-Aware Utterances Permutation Strategy to Improve Conversational Evaluation
    Faggioli, Guglielmo
    Ferrante, Marco
    Ferro, Nicola
    Perego, Raffaele
    Tonellotto, Nicola
    ADVANCES IN INFORMATION RETRIEVAL, PT I, 2022, 13185 : 184 - 198
  • [43] Grammatical characteristics of children's conversational utterances that contain disfluency clusters
    Logan, KJ
    LaSalle, LR
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 1999, 42 (01): : 80 - 91
  • [44] Analysing Utterances in LLM-Based User Simulation for Conversational Search
    Sekulic, Ivan
    Aliannejadi, Mohammad
    Crestani, Fabio
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2024, 15 (03)
  • [45] Multimodal output for a conversational telephony system
    Mast, M
    Günther, C
    Kunzmann, S
    Ross, T
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 293 - 296
  • [46] Conversational Forensics Building conversational pedagogical agents with attitude
    Fakinlede, Ireti
    Kumar, Vive
    Wen, Dunwei
    Kinshuk
    2013 IEEE FIFTH INTERNATIONAL CONFERENCE ON TECHNOLOGY FOR EDUCATION (T4E 2013), 2013, : 65 - 68
  • [47] Prediction of Various Backchannel Utterances Based on Multimodal Information
    Onishi, Toshiki
    Azuma, Naoki
    Kinoshita, Shunichi
    Ishii, Ryo
    Fukayama, Atsushi
    Nakamura, Takao
    Miyata, Akihiro
    PROCEEDINGS OF THE 23RD ACM INTERNATIONAL CONFERENCE ON INTELLIGENT VIRTUAL AGENTS, IVA 2023, 2023,
  • [48] Conversational AI: Dialogue Systems, Conversational Agents, and Chatbots
    Seminck, Olga
    COMPUTATIONAL LINGUISTICS, 2023, 49 (01) : 257 - 259
  • [49] Conversational psychological agents. A study of rational and psychological behaviors of conversational assistant agents
    Bouchet F.
    Sansonnet J.-P.
    Revue d'Intelligence Artificielle, 2011, 25 (05) : 591 - 623
  • [50] Exploring persuasive potential of embodied conversational agents utilizing synthetic embodied conversational agents
    Shearer, John
    Olivier, Patrick
    De Boni, Marco
    Hurling, Robert
    PERSUASIVE TECHNOLOGY, 2007, 4744 : 210 - 213