"HOW ROBUST R U?": EVALUATING TASK-ORIENTED DIALOGUE SYSTEMS ON SPOKEN CONVERSATIONS

被引：3

作者：

Kim, Seokhwan ^{[1
]}

Liu, Yang ^{[1
]}

Fin, Di ^{[1
]}

Papangelis, Alexandros ^{[1
]}

Gopalakrishnan, Karthik ^{[1
]}

Hedayatnia, Behnam ^{[1
]}

Hakkani-Tur, Dilek ^{[1
]}

机构：

[1] Amazon Alexa AI, Sunnyvale, CA 94089 USA

来源：

2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU) | 2021年

关键词：

spoken dialogue systems; dialogue state tracking; knowledge-grounded dialogue generation; NETWORKS;

D O I：

10.1109/ASRU51503.2021.9688274

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most prior work in dialogue modeling has been on written conversations mostly because of existing data sets. However, written dialogues are not sufficient to fully capture the nature of spoken conversations as well as the potential speech recognition errors in practical spoken dialogue systems. This work presents a new benchmark on spoken task-oriented conversations, which is intended to study multi-domain dialogue state tracking and knowledge-grounded dialogue modeling. We report that the existing state-of-the-art models trained on written conversations are not performing well on our spoken data, as expected. Furthermore, we observe improvements in task performances when leveraging n-best speech recognition hypotheses such as by combining predictions based on individual hypotheses. Our data set enables speech-based bench-marking of task-oriented dialogue systems.

引用

页码：1147 / 1154

页数：8

共 50 条

[31] Initiative conflicts in task-oriented dialogue
Yang, Fan
Heeman, Peter A.
COMPUTER SPEECH AND LANGUAGE, 2010, 24 (02): : 175 - 189
[32] Estimating Uncertainty in Task-Oriented Dialogue
Kontogiorgos, Dimosthenis
Pereira, Andre
Gustafson, Joakim
ICMI'19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2019, : 414 - 418
[33] Incremental Learning from Scratch for Task-Oriented Dialogue Systems
Wang, Weikang
Zhang, Jiajun
Li, Qian
Hwang, Mei-Yuh
Zong, Chengqing
Li, Zhifei
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3710 - 3720
[34] A New Task for Predicting Emotions and Dialogue Strategies in Task-Oriented Dialogue
Vanel, Lorraine
Yacoubi, Alya
Clavel, Chloe
2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, ACII, 2023,
[35] Emotion detection in task-oriented spoken dialogs
Devillers, L
Lamel, L
Vasilescu, I
2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 549 - 552
[36] TaskDiff: A Similarity Metric for Task-Oriented Conversations
Bhaumik, Ankita
Venkateswaran, Praveen
Rizk, Yara
Isahagian, Vatche
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 16234 - 16240
[37] Recent Neural Methods on Dialogue State Tracking for Task-Oriented Dialogue Systems: A Survey
Balaraman, Vevake
Sheikhalishahi, Seyedmostafa
Magnini, Bernardo
SIGDIAL 2021: 22ND ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2021), 2021, : 239 - 251
[38] DiactTOD: Learning Generalizable Latent Dialogue Acts for Controllable Task-Oriented Dialogue Systems
Wu, Qingyang
Gung, James
Shu, Raphael
Zhang, Yi
24TH MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE, SIGDIAL 2023, 2023, : 255 - 267
[39] Are Current Task-Oriented Dialogue Systems Able to Satisfy Impolite Users?
Hu, Zhiqiang
Chen, Nancy F.
Lee, Roy Ka-Wei
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2025,
[40] Dual-Feedback Knowledge Retrieval for Task-Oriented Dialogue Systems
Shi, Tianyuan
Li, Liangzhi
Lin, Zijian
Yang, Tao
Quan, Xiaojun
Wang, Qifan
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 6566 - 6580

← 1 2 3 4 5 →