"HOW ROBUST R U?": EVALUATING TASK-ORIENTED DIALOGUE SYSTEMS ON SPOKEN CONVERSATIONS

被引:3
|
作者
Kim, Seokhwan [1 ]
Liu, Yang [1 ]
Fin, Di [1 ]
Papangelis, Alexandros [1 ]
Gopalakrishnan, Karthik [1 ]
Hedayatnia, Behnam [1 ]
Hakkani-Tur, Dilek [1 ]
机构
[1] Amazon Alexa AI, Sunnyvale, CA 94089 USA
关键词
spoken dialogue systems; dialogue state tracking; knowledge-grounded dialogue generation; NETWORKS;
D O I
10.1109/ASRU51503.2021.9688274
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most prior work in dialogue modeling has been on written conversations mostly because of existing data sets. However, written dialogues are not sufficient to fully capture the nature of spoken conversations as well as the potential speech recognition errors in practical spoken dialogue systems. This work presents a new benchmark on spoken task-oriented conversations, which is intended to study multi-domain dialogue state tracking and knowledge-grounded dialogue modeling. We report that the existing state-of-the-art models trained on written conversations are not performing well on our spoken data, as expected. Furthermore, we observe improvements in task performances when leveraging n-best speech recognition hypotheses such as by combining predictions based on individual hypotheses. Our data set enables speech-based bench-marking of task-oriented dialogue systems.
引用
收藏
页码:1147 / 1154
页数:8
相关论文
共 50 条
  • [31] Initiative conflicts in task-oriented dialogue
    Yang, Fan
    Heeman, Peter A.
    COMPUTER SPEECH AND LANGUAGE, 2010, 24 (02): : 175 - 189
  • [32] Estimating Uncertainty in Task-Oriented Dialogue
    Kontogiorgos, Dimosthenis
    Pereira, Andre
    Gustafson, Joakim
    ICMI'19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2019, : 414 - 418
  • [33] Incremental Learning from Scratch for Task-Oriented Dialogue Systems
    Wang, Weikang
    Zhang, Jiajun
    Li, Qian
    Hwang, Mei-Yuh
    Zong, Chengqing
    Li, Zhifei
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3710 - 3720
  • [34] A New Task for Predicting Emotions and Dialogue Strategies in Task-Oriented Dialogue
    Vanel, Lorraine
    Yacoubi, Alya
    Clavel, Chloe
    2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, ACII, 2023,
  • [35] Emotion detection in task-oriented spoken dialogs
    Devillers, L
    Lamel, L
    Vasilescu, I
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 549 - 552
  • [36] TaskDiff: A Similarity Metric for Task-Oriented Conversations
    Bhaumik, Ankita
    Venkateswaran, Praveen
    Rizk, Yara
    Isahagian, Vatche
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 16234 - 16240
  • [37] Recent Neural Methods on Dialogue State Tracking for Task-Oriented Dialogue Systems: A Survey
    Balaraman, Vevake
    Sheikhalishahi, Seyedmostafa
    Magnini, Bernardo
    SIGDIAL 2021: 22ND ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2021), 2021, : 239 - 251
  • [38] DiactTOD: Learning Generalizable Latent Dialogue Acts for Controllable Task-Oriented Dialogue Systems
    Wu, Qingyang
    Gung, James
    Shu, Raphael
    Zhang, Yi
    24TH MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE, SIGDIAL 2023, 2023, : 255 - 267
  • [39] Are Current Task-Oriented Dialogue Systems Able to Satisfy Impolite Users?
    Hu, Zhiqiang
    Chen, Nancy F.
    Lee, Roy Ka-Wei
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2025,
  • [40] Dual-Feedback Knowledge Retrieval for Task-Oriented Dialogue Systems
    Shi, Tianyuan
    Li, Liangzhi
    Lin, Zijian
    Yang, Tao
    Quan, Xiaojun
    Wang, Qifan
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 6566 - 6580