Proxy Indicators for the Quality of Open-domain Dialogues

被引:0
|
作者
Nedelchev, Rostislav [1 ]
Lehmann, Jens [1 ,2 ,3 ]
Usbeck, Ricardo [1 ,4 ]
机构
[1] Univ Bonn, Smart Data Analyt Grp, Bonn, Germany
[2] Fraunhofer IAIS, St Augustin, Germany
[3] Fraunhofer IAIS, Dresden, Germany
[4] Univ Hamburg, Semant Syst Grp, Hamburg, Germany
基金
欧盟地平线“2020”;
关键词
DATASET;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The automatic evaluation of open-domain dialogues remains a largely unsolved challenge. Thus, despite the abundance of work done in the field, human judges have to evaluate dialogues' quality. As a consequence, performing such evaluations at scale is usually expensive. This work investigates using a deep-learning model trained on the General Language Understanding Evaluation (GLUE) benchmark to serve as a quality indication of open-domain dialogues. The aim is to use the various GLUE tasks as different perspectives on judging the quality of conversation, thus reducing the need for additional training data or responses that serve as quality references. Due to this nature, the method can infer various quality metrics and derive a component-based overall score. We achieve statistically significant correlation coefficients of up to 0.7.
引用
收藏
页码:7834 / 7855
页数:22
相关论文
共 50 条
  • [1] Implicit Discourse Relation Identification for Open-domain Dialogues
    Ma, Mingyu Derek
    Bowden, Kevin K.
    Wu, Jiaqi
    Cui, Wen
    Walker, Marilyn
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 666 - 672
  • [2] Automatic summarization of open-domain multiparty dialogues in diverse genres
    Zechner, K
    [J]. COMPUTATIONAL LINGUISTICS, 2002, 28 (04) : 447 - 485
  • [3] Fusing Task-Oriented and Open-Domain Dialogues in Conversational Agents
    Young, Tom
    Xing, Frank
    Pandelea, Vlad
    Ni, Jinjie
    Cambria, Erik
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11622 - 11629
  • [4] Identifying Untrustworthy Samples: Data Filtering for Open-domain Dialogues with Bayesian Optimization
    Shen, Lei
    Zhan, Haolan
    Shen, Xin
    Chen, Hongshen
    Zhao, Xiaofang
    Zhu, Xiaodan
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 1598 - 1608
  • [5] Intent-calibrated Self-training for Answer Selection in Open-domain Dialogues
    Deng, Wentao
    Pei, Jiahuan
    Ren, Zhaochun
    Chen, Zhumin
    Ren, Pengjie
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2023, 11 : 1232 - 1249
  • [6] Evaluating Open-Domain Dialogues in Latent Space with Next Sentence Prediction and Mutual Information
    Zhao, Kun
    Yang, Bohao
    Lin, Chenghua
    Rong, Wenge
    Villavicencio, Aline
    Cui, Xiaohui
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 562 - 574
  • [7] Personality prediction from task-oriented and open-domain human–machine dialogues
    Ao Guo
    Ryu Hirai
    Atsumoto Ohashi
    Yuya Chiba
    Yuiko Tsunomori
    Ryuichiro Higashinaka
    [J]. Scientific Reports, 14
  • [8] Personality prediction from task-oriented and open-domain human-machine dialogues
    Guo, Ao
    Hirai, Ryu
    Ohashi, Atsumoto
    Chiba, Yuya
    Tsunomori, Yuiko
    Higashinaka, Ryuichiro
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01)
  • [9] Improving Response Quality with Backward Reasoning in Open-domain Dialogue Systems
    Li, Ziming
    Kiseleva, Julia
    de Rijke, Maarten
    [J]. SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1940 - 1944
  • [10] Advances in open-domain question answering
    Zhang, Zhi-Chang
    Zhang, Yu
    Liu, Ting
    Li, Sheng
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2009, 37 (05): : 1058 - 1069