Towards an Integrated Model of Speech and Gesture Production for Multi-Modal Robot Behavior

被引:0
|
作者
Salem, Maha [1 ]
Kopp, Stefan [2 ]
Wachsmuth, Ipke [3 ]
Joublin, Frank [4 ]
机构
[1] Res Inst Cognit & Robot, Bielefeld, Germany
[2] Univ Bielefeld, Sociable Agents Grp, Bielefeld, Germany
[3] Univ Bielefeld, Artificial Intelligence Grp, Bielefeld, Germany
[4] Honda Res Inst Europe, Offenbach, Germany
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The generation of communicative, speech-accompanying robot gesture is still largely unexplored. We present an approach to enable the humanoid robot ASIMO to flexibly produce speech and co-verbal gestures at run-time, while not being limited to a pre-defined repertoire of motor actions. Since much research has already been dedicated to this challenge within the domain of virtual conversational agents, we build upon the experience gained from the development of a speech and gesture production model used for the virtual human Max. We propose a robot control architecture building upon the Articulated Communicator Engine (ACE) that was developed to allow virtual agents to flexibly realize planned multi-modal behavior representations on the spot. Our approach tightly couples ACE with ASIMO's perceptuo-motor system, combining conceptual representation and planning with motor control primitives for speech and arm movements of a physical robot body. First results of both gesture production and speech synthesis using ACE and the MARY text-to-speech system are presented and discussed.
引用
收藏
页码:614 / 619
页数:6
相关论文
共 50 条
  • [1] Multi-modal fusion of Speech-Gesture using Integrated Probability Density Distribution
    Lee, Chi-geun
    Han, Mun-sung
    [J]. 2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL I, PROCEEDINGS, 2008, : 361 - 364
  • [2] MULTI-MODAL FUSION OF SPEECH-GESTURE USING INTEGRATED PROBABILITY DENSITY FUNCTION
    Lee, Chi-Geun
    Han, Mun-Sung
    Bae, Chang-Seok
    Kim, Jin-Tae
    [J]. BIOSIGNALS 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIO-INSPIRED SYSTEMS AND SIGNAL PROCESSING, 2009, : 211 - 215
  • [3] Multi-modal Gesture Recognition using Integrated Model of Motion, Audio and Video
    GOUTSU Yusuke
    KOBAYASHI Takaki
    OBARA Junya
    KUSAJIMA Ikuo
    TAKEICHI Kazunari
    TAKANO Wataru
    NAKAMURA Yoshihiko
    [J]. Chinese Journal of Mechanical Engineering, 2015, (04) : 657 - 665
  • [4] Multi-modal Gesture Recognition using Integrated Model of Motion, Audio and Video
    Goutsu, Yusuke
    Kobayashi, Takaki
    Obara, Junya
    Kusajima, Ikuo
    Takeichi, Kazunari
    Takano, Wataru
    Nakamura, Yoshihiko
    [J]. CHINESE JOURNAL OF MECHANICAL ENGINEERING, 2015, 28 (04) : 657 - 665
  • [5] Multi-modal gesture recognition using integrated model of motion, audio and video
    Yusuke Goutsu
    Takaki Kobayashi
    Junya Obara
    Ikuo Kusajima
    Kazunari Takeichi
    Wataru Takano
    Yoshihiko Nakamura
    [J]. Chinese Journal of Mechanical Engineering, 2015, 28 : 657 - 665
  • [6] Multi-modal Gesture Recognition using Integrated Model of Motion, Audio and Video
    GOUTSU Yusuke
    KOBAYASHI Takaki
    OBARA Junya
    KUSAJIMA Ikuo
    TAKEICHI Kazunari
    TAKANO Wataru
    NAKAMURA Yoshihiko
    [J]. Chinese Journal of Mechanical Engineering, 2015, 28 (04) : 657 - 665
  • [7] Mixed Reality Deictic Gesture for Multi-Modal Robot Communication
    Williams, Tom
    Bussing, Matthew
    Cabrol, Sebastian
    Boyle, Elizabeth
    Nhan Tran
    [J]. HRI '19: 2019 14TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2019, : 191 - 201
  • [8] Gesture desk - An integrated multi-modal gestural workplace for sonification
    Hermann, T
    Henning, T
    Ritter, H
    [J]. GESTURE-BASED COMMUNICATION IN HUMAN-COMPUTER INTERACTION, 2003, 2915 : 369 - 379
  • [9] Erratum to: Multi-modal Gesture Recognition using Integrated Model of Motion, Audio and Video
    GOUTSU Yusuke
    KOBAYASHI Takaki
    OBARA Junya
    KUSAJIMA Ikuo
    TAKEICHI Kazunari
    TAKANO Wataru
    NAKAMURA Yoshihiko
    [J]. Chinese Journal of Mechanical Engineering, 2017, 30 : 1473 - 1473
  • [10] Towards a multi-modal perceptual model
    Hollier, MP
    Voelcker, R
    [J]. BT TECHNOLOGY JOURNAL, 1997, 15 (04): : 162 - 171