A Pre-training Strategy for Zero-Resource Response Selection in Knowledge-Grounded Conversations

被引:0
|
作者
Tao, Chongyang [1 ]
Chen, Changyu [2 ]
Feng, Jiazhan [1 ]
Wen, Ji-Rong [2 ,3 ]
Yan, Rui [2 ,3 ]
机构
[1] Peking Univ, Beijing, Peoples R China
[2] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing, Peoples R China
[3] Beijing Acad Artificial Intelligence, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, many studies are emerging towards building a retrieval-based dialogue system that is able to effectively leverage background knowledge (e.g., documents) when conversing with humans. However, it is non-trivial to collect large-scale dialogues that are naturally grounded on the background documents, which hinders the effective and adequate training of knowledge selection and response matching. To overcome the challenge, we consider decomposing the training of the knowledge-grounded response selection into three tasks including: 1) query-passage matching task; 2) query-dialogue history matching task; 3) multi-turn response matching task, and joint learning all these tasks in a unified pre-trained language model. The former two tasks could help the model in knowledge selection and comprehension, while the last task is designed for matching the proper response with the given query and background knowledge (dialogue history). By this means, the model can be learned to select relevant knowledge and distinguish proper response, with the help of ad-hoc retrieval corpora and a large number of ungrounded multi-turn dialogues. Experimental results on two benchmarks of knowledge-grounded response selection indicate that our model can achieve comparable performance with several existing methods that rely on crowd-sourced data for training.
引用
收藏
页码:4446 / 4457
页数:12
相关论文
共 14 条
  • [1] Zero-Resource Knowledge-Grounded Dialogue Generation
    Li, Linxiao
    Xu, Can
    Wu, Wei
    Zhao, Yufan
    Zhao, Xueliang
    Tao, Chongyang
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [2] KGPT: Knowledge-Grounded Pre-Training for Data-to-Text Generation
    Chen, Wenhu
    Su, Yu
    Yan, Xifeng
    Wang, William Yang
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 8635 - 8648
  • [3] GLM-Dialog: Noise-tolerant Pre-training for Knowledge-grounded Dialogue Generation
    Zhang, Jing
    Zhang, Xiaokang
    Zhang-Li, Daniel
    Yu, Jifan
    Yao, Zijun
    Ma, Zeyao
    Xu, Yiqi
    Wang, Haohua
    Zhang, Xiaohan
    Lin, Nianyi
    Lu, Sunrui
    Li, Juanzi
    Tang, Jie
    [J]. PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 5564 - 5575
  • [4] Enhancing Knowledge Selection with Data Processing Based on Multiple Turns of Dialog in Knowledge-Grounded Open-Domain Conversations
    Joo, Eojin
    Lee, Do Kyung
    Youn, Junyoung
    Sung, Joo-Won
    Choi, Ho-Jin
    [J]. 2024 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, IEEE BIGCOMP 2024, 2024, : 365 - 366
  • [5] Filtering before Iteratively Referring for Knowledge-Grounded Response Selection in Retrieval-Based Chatbots
    Gu, Jia-Chen
    Ling, Zhen-Hua
    Liu, Quan
    Chen, Zhigang
    Zhu, Xiaodan
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1412 - 1422
  • [6] Augmenting Low-Resource Text Classification with Graph-Grounded Pre-training and Prompting
    Wen, Zhihao
    Fang, Yuan
    [J]. PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 506 - 516
  • [7] Global and local interaction matching model for knowledge-grounded response selection in retrieval-based chatbots
    Ma, Hui
    Wang, Jian
    Lin, Hongfei
    Yang, Liang
    [J]. NEUROCOMPUTING, 2022, 497 : 39 - 49
  • [8] Emotion-Aware Multimodal Pre-training for Image-Grounded Emotional Response Generation
    Tian, Zhiliang
    Wen, Zhihua
    Wu, Zhenghao
    Song, Yiping
    Tang, Jintao
    Li, Dongsheng
    Zhang, Nevin L.
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT III, 2022, : 3 - 19
  • [9] Improving Dialog System Grounded with Unstructured Knowledge by Domain Adaptive Pre-Training and Post-Ranking
    Zeng, Zhi
    Lam, Kam-Yiu
    Chow, Chi-Yin
    Li, Ning
    [J]. INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2021, 18 (06)
  • [10] Consecutive Pre-Training: A Knowledge Transfer Learning Strategy with Relevant Unlabeled Data for Remote Sensing Domain
    Zhang, Tong
    Gao, Peng
    Dong, Hao
    Zhuang, Yin
    Wang, Guanqun
    Zhang, Wei
    Chen, He
    [J]. REMOTE SENSING, 2022, 14 (22)