Evaluation of Real-Time Deep Learning Turn-Taking Models for Multiple Dialogue Scenarios

被引:22
|
作者
Lala, Divesh [1 ]
Inoue, Koji [1 ]
Kawahara, Tatsuya [1 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Kyoto, Japan
关键词
dialogue systems; turn-taking; evaluation methods; deep learning; neural networks;
D O I
10.1145/3242969.3242994
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The task of identifying when to take a conversational turn is an important function of spoken dialogue systems. The turn-taking system should also ideally be able to handle many types of dialogue, from structured conversation to spontaneous and unstructured discourse. Our goal is to determine how much a generalized model trained on many types of dialogue scenarios would improve on a model trained only for a specific scenario. To achieve this goal we created a large corpus of Wizard-of-Oz conversation data which consisted of several different types of dialogue sessions, and then compared a generalized model with scenario-specific models. For our evaluation we go further than simply reporting conventional metrics, which we show are not informative enough to evaluate turn-taking in a real-time system. Instead, we process results using a performance curve of latency and false cut-in rate, and further improve our model's real-time performance using a finite-state turn-taking machine. Our results show that the generalized model greatly outperformed the individual model for attentive listening scenarios but was worse in job interview scenarios. This implies that a model based on a large corpus is better suited to conversation which is more user-initiated and unstructured. We also propose that our method of evaluation leads to more informative performance metrics in a real-time system.
引用
收藏
页码:78 / 86
页数:9
相关论文
共 50 条
  • [41] Real-Time Guitar Amplifier Emulation with Deep Learning
    Wright, Alec
    Damskagg, Eero-Pekka
    Juvela, Lauri
    Valimaki, Vesa
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (03):
  • [42] Real-time Yield Estimation based on Deep Learning
    Rahnemoonfar, Maryam
    Sheppard, Clay
    [J]. AUTONOMOUS AIR AND GROUND SENSING SYSTEMS FOR AGRICULTURAL OPTIMIZATION AND PHENOTYPING II, 2017, 10218
  • [43] Unsupervised Deep Representation Learning for Real-Time Tracking
    Ning Wang
    Wengang Zhou
    Yibing Song
    Chao Ma
    Wei Liu
    Houqiang Li
    [J]. International Journal of Computer Vision, 2021, 129 : 400 - 418
  • [44] Real-time Facemask Recognition Using Deep Learning
    Sasikumar, R.
    Shanmugaraja, P.
    Kailash, K.
    Reddy, M. Prudhvi Charan
    Jagadeesh, S. Nikhil
    [J]. REVISTA GEINTEC-GESTAO INOVACAO E TECNOLOGIAS, 2021, 11 (02): : 2079 - 2085
  • [45] Deep learning for real-time image steganalysis: a survey
    Feng Ruan
    Xing Zhang
    Dawei Zhu
    Zhanyang Xu
    Shaohua Wan
    Lianyong Qi
    [J]. Journal of Real-Time Image Processing, 2020, 17 : 149 - 160
  • [46] Robust Real-Time Traffic Surveillance with Deep Learning
    Fernandez, Jessica
    Canas, Jose M.
    Fernandez, Vanessa
    Paniego, Sergio
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [47] A Conceptual Deep Learning Model for Real-Time Routing
    Ikidid, Abdelouafi
    El Fazziki, Abdelaziz
    Sadgal, Mohammed
    El Ghazouani, Mohamed
    Ichahane, My Youssef
    [J]. 2022 16TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS, SITIS, 2022, : 453 - 456
  • [48] Deep Learning for Real-Time Neural Decoding of Grasp
    Viviani, Paolo
    Gesmundo, Ilaria
    Ghinato, Elios
    Agudelo-Toro, Andres
    Vercellino, Chiara
    Vitali, Giacomo
    Bergamasco, Letizia
    Scionti, Alberto
    Ghislieri, Marco
    Agostini, Valentina
    Terzo, Olivier
    Scherberger, Hansjoerg
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VI, 2023, 14174 : 379 - 393
  • [49] Deep Bilateral Learning for Real-Time Image Enhancement
    Gharbi, Michael
    Chen, Jiawen
    Barron, Jonathan T.
    Hasinoff, Samuel W.
    Durand, Fredo
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (04):
  • [50] Real-Time Lane Detection Based on Deep Learning
    Baek, Sun-Woo
    Kim, Myeong-Jun
    Suddamalla, Upendra
    Wong, Anthony
    Lee, Bang-Hyon
    Kim, Jung-Ha
    [J]. JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2022, 17 (01) : 655 - 664