Evaluation of Real-Time Deep Learning Turn-Taking Models for Multiple Dialogue Scenarios

被引:22
|
作者
Lala, Divesh [1 ]
Inoue, Koji [1 ]
Kawahara, Tatsuya [1 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Kyoto, Japan
关键词
dialogue systems; turn-taking; evaluation methods; deep learning; neural networks;
D O I
10.1145/3242969.3242994
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The task of identifying when to take a conversational turn is an important function of spoken dialogue systems. The turn-taking system should also ideally be able to handle many types of dialogue, from structured conversation to spontaneous and unstructured discourse. Our goal is to determine how much a generalized model trained on many types of dialogue scenarios would improve on a model trained only for a specific scenario. To achieve this goal we created a large corpus of Wizard-of-Oz conversation data which consisted of several different types of dialogue sessions, and then compared a generalized model with scenario-specific models. For our evaluation we go further than simply reporting conventional metrics, which we show are not informative enough to evaluate turn-taking in a real-time system. Instead, we process results using a performance curve of latency and false cut-in rate, and further improve our model's real-time performance using a finite-state turn-taking machine. Our results show that the generalized model greatly outperformed the individual model for attentive listening scenarios but was worse in job interview scenarios. This implies that a model based on a large corpus is better suited to conversation which is more user-initiated and unstructured. We also propose that our method of evaluation leads to more informative performance metrics in a real-time system.
引用
收藏
页码:78 / 86
页数:9
相关论文
共 50 条
  • [1] Real-Time Multimodal Turn-taking Prediction to Enhance Cooperative Dialogue during Human-Agent Interaction
    Bae, Young-Ho
    Bennett, Casey C.
    [J]. 2023 32ND IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, RO-MAN, 2023, : 2037 - 2044
  • [2] Real-Time Changes to Social Dynamics in Human-Robot Turn-Taking
    Smith, Justin S.
    Chao, Crystal
    Thomaz, Andrea L.
    [J]. 2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 3024 - 3029
  • [3] Incremental Learning and Forgetting in Stochastic Turn-Taking Models
    Laskowski, Kornel
    Edlund, Jens
    Heldner, Mattias
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2080 - 2083
  • [4] Encouragement of Turn-Taking by Real-Time Feedback Impacts Creative Idea Generation in Dyads
    Hosseini, Sarinasadat
    Deng, Xiaoqi
    Miyake, Yoshihiro
    Nozawa, Takayuki
    [J]. IEEE ACCESS, 2021, 9 : 57976 - 57988
  • [5] Turn-Taking Strategies for Human-Robot Peer-Learning Dialogue
    Das, Ranjini
    Pon-Barry, Heather
    [J]. 19TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2018), 2018, : 119 - 129
  • [6] A methodology for turn-taking capabilities enhancement in Spoken Dialogue Systems using Reinforcement Learning
    Khouzaimi, Hatim
    Laroche, Romain
    Lefevre, Fabrice
    [J]. COMPUTER SPEECH AND LANGUAGE, 2018, 47 : 93 - 111
  • [7] GATED MULTIMODAL FUSION WITH CONTRASTIVE LEARNING FOR TURN-TAKING PREDICTION IN HUMAN-ROBOT DIALOGUE
    Yang, Jiudong
    Wang, Peiying
    Zhu, Yi
    Feng, Mingchao
    Chen, Meng
    He, Xiaodong
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7747 - 7751
  • [8] Design and Evaluation of Deep Learning Models for Real-Time Credibility Assessment in Twitter
    Kaufhold, Marc-Andre
    Bayer, Markus
    Hartung, Daniel
    Reuter, Christian
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 396 - 408
  • [9] Dialogue Efficiency Evaluation of Turn-Taking Phenomena in a Multi-layer Incremental Simulated Environment
    Khouzaimi, Hatim
    Laroche, Romain
    Lefevre, Fabrice
    [J]. HCI INTERNATIONAL 2015 - POSTERS' EXTENDED ABSTRACTS, PT I, 2015, 528 : 753 - 758
  • [10] Comparing how students collaborate to learn about the self and relationships in a real-time non-turn-taking online and turn-taking face-to-face environment
    Lobel, M
    Neubauer, M
    Swedburg, R
    [J]. JOURNAL OF COMPUTER-MEDIATED COMMUNICATION, 2005, 10 (04):