Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System

被引:0
|
作者
Qin, Libo [1 ]
Xie, Tianbao [1 ]
Huang, Shijue [1 ]
Chen, Qiguang [1 ]
Xu, Xiao [1 ]
Che, Wanxiang [1 ]
机构
[1] Harbin Inst Technol, Res Ctr Social Comp & Informat Retrieval, Harbin, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Consistency Identification has obtained remarkable success on open-domain dialogue, which can be used for preventing inconsistent response generation. However, in contrast to the rapid development in open-domain dialogue, few efforts have been made to the task-oriented dialogue direction. In this paper, we argue that consistency problem is more urgent in task-oriented domain. To facilitate the research, we introduce CI-ToD, a novel dataset for Consistency Identification in Task-oriented Dialog system. In addition, we not only annotate the single label to enable the model to judge whether the system response is contradictory, but also provide more fine-grained labels (i.e., Dialogue History Inconsistency, User Query Inconsistency and Knowledge Base Inconsistency) to encourage model to know what inconsistent sources lead to it. Empirical results show that state-of-the-art methods only achieve 51.3%, which is far behind the human performance of 93.2%, indicating that there is ample room for improving consistency identification ability. Finally, we conduct exhaustive experiments and qualitative analysis to comprehend key challenges and provide guidance for future directions. All datasets and models are publicly available at https://github.com/yizhen20133868/CI-ToD.
引用
收藏
页码:2357 / 2367
页数:11
相关论文
共 7 条
  • [1] ToD4IR: A Humanised Task-Oriented Dialogue System for Industrial Robots
    Li, Chen
    Zhang, Xiaochun
    Chrysostomou, Dimitrios
    Yang, Hongji
    IEEE ACCESS, 2022, 10 : 91631 - 91649
  • [2] DORA: Towards policy optimization for task-oriented dialogue system with efficient context
    Jeon, Hyunmin
    Lee, Gary Geunbae
    COMPUTER SPEECH AND LANGUAGE, 2022, 72
  • [3] DORA: Towards policy optimization for task-oriented dialogue system with efficient context
    Jeon, Hyunmin
    Lee, Gary Geunbae
    Computer Speech and Language, 2022, 72
  • [4] STOD: Towards Scalable Task-Oriented Dialogue System on MultiWOZ-API
    Lu, Hengtong
    Yuan, Caixia
    Wang, Xiaojie
    APPLIED SCIENCES-BASEL, 2024, 14 (12):
  • [5] KMc-ToD: Structure knowledge enhanced multi-copy network for task-oriented dialogue system
    Ding, Zeyuan
    Yang, Zhihao
    Qiao, Yinbo
    Lin, Hongfei
    KNOWLEDGE-BASED SYSTEMS, 2024, 293
  • [6] Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue
    Gupta, Raghav
    Lee, Harrison
    Zhao, Jeffrey
    Rastogi, Abhinav
    Cao, Yuan
    Wu, Yonghui
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 4541 - 4549
  • [7] Towards Complex Scenarios: Building End-to-End Task-Oriented Dialogue System across Multiple Knowledge Bases
    Qin, Libo
    Li, Zhouyang
    Yu, Qiying
    Wang, Lehan
    Che, Wanxiang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13483 - 13491