A robot that can engage in both task-oriented and non-task-oriented dialogues

被引:6
|
作者
Nakano, Mikio [1 ]
Hoshino, Atsushi [2 ]
Takeuchi, Johane [1 ]
Hasegawa, Yuji [1 ]
Torii, Toyotaka [1 ]
Nakadai, Kazuhiro [1 ]
Kato, Kazuhiko [2 ]
Tsujino, Hiroshi [1 ]
机构
[1] Honda Res Inst Japan Co Ltd, 8-1 Honcho, Wako, Saitama 3510188, Japan
[2] Univ Tsukuba, Inst Informat Sci & Elect, Oho, Ibaraki 305, Japan
关键词
D O I
10.1109/ICHR.2006.321304
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a new type of conversational humanoid robot, which can engage in both task-oriented dialogues and non-task-oriented dialogues. Most previously built conversational robots can engage in either task-oriented dialogues for accurately understanding human requests or non-task-oriented dialogues to allow humans to enjoy conversations. Since both are beneficial functionalities for a humanoid robot as a human partner, it is desirable for one humanoid robot to be able to engage in both types of dialogues. Our model is based on a multiexpert model, which features control modules called experts each or which is specialized to perform certain kinds of tasks through performing physical actions and engaging in dialogues. One of the experts takes charge in understanding human utterances and deciding robot utterances or actions. Non-task-oriented dialogue functionality is incorporated into this model by building an expert called the chat expert which is dedicated to non-task-oriented dialogues. The chat expert utilizes the outputs of a large-vocabulary speech recognizer, while other task-oriented experts utilize the outputs of a small-vocabulary speech recognizer. By selecting an appropriate expert according to the speech recognition result and dialogue context, we can alleviate degradation in speech recognition accuracy in spite of incorporating a large-vocabulary speech recognizer. The chat expert is dealt with as the default expert that has the responsibility to reply to human utterances. If a human utterance is considered to be a request for a task with a high plausibility, the expert for understanding the request is selected. The implemented system, which has been combined with Honda ASIMO, demonstrates that it can dynamically change dialogue strategy based on speech recognition results.
引用
收藏
页码:404 / +
页数:3
相关论文
共 50 条
  • [11] Contextual Semantic Parsing for Multilingual Task-Oriented Dialogues
    Moradshahi, Mehrad
    Tsai, Victoria
    Campagna, Giovanni
    Lam, Monica S.
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 902 - 915
  • [12] Flexibly-Structured Model for Task-Oriented Dialogues
    Shu, Lei
    Molino, Piero
    Namazifar, Mahdi
    Xu, Hu
    Liu, Bing
    Zheng, Huaixiu
    Tur, Gokhan
    20TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2019), 2019, : 178 - 187
  • [13] PentoRef: A Corpus of Spoken References in Task-oriented Dialogues
    Zarriess, Sina
    Hough, Julian
    Kennington, Casey
    Manuvinakurike, Ramesh
    DeVault, David
    Fernandez, Raquel
    Schlangen, David
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 125 - 131
  • [14] Interactive teaching of task-oriented robot grasps
    Aleotti, Jacopo
    Caselli, Stefano
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2010, 58 (05) : 539 - 550
  • [15] Task-oriented Internet robot system architecture
    Zhou, Wei
    Su, Jianbo
    Gaojishu Tongxin/Chinese High Technology Letters, 2005, 15 (07): : 29 - 34
  • [16] Research on Task-Oriented Robot Action Generalization
    Xiao J.
    Yuan H.
    Liu H.
    Zhao W.
    Li X.
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2024, 6 (575-587): : 575 - 587
  • [17] Towards Universal Dialogue Act Tagging for Task-Oriented Dialogues
    Paul, Shachi
    Goel, Rahul
    Hakkani-Tur, Dilek
    INTERSPEECH 2019, 2019, : 1453 - 1457
  • [18] Modeling Multi-Action Policy for Task-Oriented Dialogues
    Shu, Lei
    Xu, Hu
    Liu, Bing
    Molino, Piero
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1304 - 1310
  • [19] Cooperative and Uncooperative Behaviour in Task-oriented Dialogues with Social Robots
    Wilcock, Graham
    Jokinen, Kristiina
    2022 31ST IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (IEEE RO-MAN 2022), 2022, : 763 - 768
  • [20] TOD-Flow: Modeling the Structure of Task-Oriented Dialogues
    Sohn, Sungryull
    Lyu, Yiwei
    Liu, Anthony Zhe
    Logeswaran, Lajanugen
    Kim, Dong-Ki
    Shim, Dongsub
    Lee, Honglak
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 3355 - 3371