"HOW ROBUST R U?": EVALUATING TASK-ORIENTED DIALOGUE SYSTEMS ON SPOKEN CONVERSATIONS

被引:3
|
作者
Kim, Seokhwan [1 ]
Liu, Yang [1 ]
Fin, Di [1 ]
Papangelis, Alexandros [1 ]
Gopalakrishnan, Karthik [1 ]
Hedayatnia, Behnam [1 ]
Hakkani-Tur, Dilek [1 ]
机构
[1] Amazon Alexa AI, Sunnyvale, CA 94089 USA
关键词
spoken dialogue systems; dialogue state tracking; knowledge-grounded dialogue generation; NETWORKS;
D O I
10.1109/ASRU51503.2021.9688274
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most prior work in dialogue modeling has been on written conversations mostly because of existing data sets. However, written dialogues are not sufficient to fully capture the nature of spoken conversations as well as the potential speech recognition errors in practical spoken dialogue systems. This work presents a new benchmark on spoken task-oriented conversations, which is intended to study multi-domain dialogue state tracking and knowledge-grounded dialogue modeling. We report that the existing state-of-the-art models trained on written conversations are not performing well on our spoken data, as expected. Furthermore, we observe improvements in task performances when leveraging n-best speech recognition hypotheses such as by combining predictions based on individual hypotheses. Our data set enables speech-based bench-marking of task-oriented dialogue systems.
引用
收藏
页码:1147 / 1154
页数:8
相关论文
共 50 条
  • [1] Evaluating Task-oriented Dialogue Systems with Users
    Siro, Clemencia
    [J]. PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 3495 - 3495
  • [2] Metaphorical User Simulators for Evaluating Task-oriented Dialogue Systems
    Sun, Weiwei
    Guo, Shuyu
    Zhang, Shuo
    Ren, Pengjie
    Chen, Zhumin
    de Rijke, Maarten
    Ren, Zhaochun
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (01)
  • [3] Spoken Language Understanding for Task-oriented Dialogue Systems with Augmented Memory Networks
    Wu, Jie
    Harris, Ian G.
    Zhao, Hongzhi
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 797 - 806
  • [4] A Survey on Task-Oriented Dialogue Systems
    任务型对话系统研究综述
    [J]. Wang, Zhen-Yu (wangzy@scut.edu.cn), 1862, Science Press (43): : 1862 - 1896
  • [5] Chat Detection in an Intelligent Assistant: Combining Task-oriented and Non-task-oriented Spoken Dialogue Systems
    Akasaki, Satoshi
    Kaji, Nobuhiro
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1308 - 1319
  • [6] Robust Cross-lingual Task-oriented Dialogue
    Xiang, Lu
    Zhu, Junnan
    Zhao, Yang
    Zhou, Yu
    Zong, Chengqing
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (06)
  • [7] Intent Disambiguation for Task-oriented Dialogue Systems
    Alfieri, Andrea
    Wolter, Ralf
    Hashemi, Seyyed Hadi
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 5079 - 5080
  • [8] Continual Learning in Task-Oriented Dialogue Systems
    Madotto, Andrea
    Lin, Zhaojiang
    Zhou, Zhenpeng
    Moon, Seungwhan
    Crook, Paul
    Liu, Bing
    Yu, Zhou
    Cho, Eunjoon
    Fung, Pascale
    Wang, Zhiguang
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 7452 - 7467
  • [9] Alexa Conversations: An Extensible Data-driven Approach for Building Task-oriented Dialogue Systems
    Acharya, Anish
    Adhikari, Suranjit
    Agarwal, Sanchit
    Auvray, Vincent
    Belgamwar, Nehal
    Biswas, Arijit
    Chandra, Shubhra
    Chung, Tagyoung
    Fazel-Zarandi, Maryam
    Gabriel, Raefer
    Gao, Shuyang
    Goel, Rahul
    Hakkani-Tur, Dilek
    Jezabek, Jan
    Jha, Abhay
    Kao, Jiun-Yu
    Krishnan, Prakash
    Ku, Peter
    Goyal, Anuj
    Lin, Chien-Wei
    Liu, Qing
    Mandal, Arindam
    Metallinou, Angeliki
    Naik, Vishal
    Pan, Yi
    Paul, Shachi
    Perera, Vittorio
    Sethi, Abhishek
    Shen, Minmin
    Strom, Nikko
    Wang, Eddie
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: DEMONSTRATIONS (NAACL-HLT 2021), 2021, : 125 - 132
  • [10] A Method for Evaluating Task-oriented Spoken Dialog Translation Systems Based on Communication Efficiency
    Takezawa, Toshiyuki
    Mizushima, Masahide
    Shimizu, Tohru
    Kikui, Genichiro
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 85 - +