Intent-calibrated Self-training for Answer Selection in Open-domain Dialogues

被引:0
|
作者
Deng, Wentao [1 ]
Pei, Jiahuan [2 ]
Ren, Zhaochun [1 ]
Chen, Zhumin [1 ]
Ren, Pengjie [1 ]
机构
[1] Shandong Univ, Qingdao, Peoples R China
[2] Ctr Wiskunde & Informat, Amsterdam, Netherlands
基金
国家重点研发计划;
关键词
Compendex;
D O I
10.1162/tacl_a_00599
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Answer selection in open-domain dialogues aims to select an accurate answer from candidates. The recent success of answer selection models hinges on training with large amounts of labeled data. However, collecting large-scale labeled data is labor-intensive and time-consuming. In this paper, we introduce the predicted intent labels to calibrate answer labels in a self-training paradigm. Specifically, we propose intent-calibrated self-training (ICAST) to improve the quality of pseudo answer labels through the intent-calibrated answer selection paradigm, in which we employ pseudo intent labels to help improve pseudo answer labels. We carry out extensive experiments on two benchmark datasets with open-domain dialogues. The experimental results show that ICAST outperforms baselines consistently with 1%, 5%, and 10% labeled data. Specifically, it improves 2.06% and 1.00% of F1 score on the two datasets, compared with the strongest baseline with only 5% labeled data.
引用
收藏
页码:1232 / 1249
页数:18
相关论文
共 50 条
  • [1] Proxy Indicators for the Quality of Open-domain Dialogues
    Nedelchev, Rostislav
    Lehmann, Jens
    Usbeck, Ricardo
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 7834 - 7855
  • [2] Reading Wikipedia to Answer Open-Domain Questions
    Chen, Danqi
    Fisch, Adam
    Weston, Jason
    Bordes, Antoine
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1870 - 1879
  • [3] Implicit Discourse Relation Identification for Open-domain Dialogues
    Ma, Mingyu Derek
    Bowden, Kevin K.
    Wu, Jiaqi
    Cui, Wen
    Walker, Marilyn
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 666 - 672
  • [4] Automatic summarization of open-domain multiparty dialogues in diverse genres
    Zechner, K
    [J]. COMPUTATIONAL LINGUISTICS, 2002, 28 (04) : 447 - 485
  • [5] Domain Adaptive Action Recognition with Integrated Self-training and Feature Selection
    Suzuki, Takafumi
    Kato, Jien
    Wang, Yu
    Mase, Kenji
    [J]. 2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 105 - 109
  • [6] Fusing Task-Oriented and Open-Domain Dialogues in Conversational Agents
    Young, Tom
    Xing, Frank
    Pandelea, Vlad
    Ni, Jinjie
    Cambria, Erik
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11622 - 11629
  • [7] Cycle Self-Training for Domain Adaptation
    Liu, Hong
    Wang, Jianmin
    Long, Mingsheng
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [8] Self-Training with Selection-by-Rejection
    Zhou, Yan
    Kantarcioglu, Murat
    Thuraisingham, Bhavani
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, : 795 - 803
  • [9] What's in a Name? Answer Equivalence For Open-Domain Question Answering
    Si, Chenglei
    Zhao, Chen
    Boyd-Graber, Jordan
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 9623 - 9629
  • [10] Identifying Untrustworthy Samples: Data Filtering for Open-domain Dialogues with Bayesian Optimization
    Shen, Lei
    Zhan, Haolan
    Shen, Xin
    Chen, Hongshen
    Zhao, Xiaofang
    Zhu, Xiaodan
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 1598 - 1608