GLM-Dialog: Noise-tolerant Pre-training for Knowledge-grounded Dialogue Generation

被引:0
|
作者
Zhang, Jing [1 ]
Zhang, Xiaokang [1 ]
Zhang-Li, Daniel [2 ]
Yu, Jifan [2 ]
Yao, Zijun [2 ]
Ma, Zeyao [3 ]
Xu, Yiqi [1 ]
Wang, Haohua [2 ]
Zhang, Xiaohan [4 ]
Lin, Nianyi [2 ]
Lu, Sunrui [2 ]
Li, Juanzi [2 ]
Tang, Jie [2 ]
机构
[1] Renmin Univ China, Beijing, Peoples R China
[2] Tsinghua Univ, Beijing, Peoples R China
[3] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[4] ZHIPU AI, Beijing, Peoples R China
关键词
Dialogue System; Dialogue Evaluation; Large Language Model;
D O I
10.1145/3580305.3599832
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present GLM-Dialog, a large-scale language model (LLM) with 10B parameters capable of knowledge-grounded conversation in Chinese using a search engine to access the Internet knowledge. GLM-Dialog offers a series of applicable techniques for exploiting various external knowledge including both helpful and noisy knowledge, enabling the creation of robust knowledge-grounded dialogue LLMs with limited proper datasets. To evaluate the GLM-Dialog more fairly, we also propose a novel evaluation method to allow humans to converse with multiple deployed bots simultaneously and compare their performance implicitly instead of explicitly rating using multidimensional metrics. Comprehensive evaluations from automatic to human perspective demonstrate the advantages of GLM-Dialog comparing with existing open source Chinese dialogue models. We release both the model checkpoint and source code, and also deploy it as a WeChat application to interact with users(1). We offer our evaluation platform online(2) in an effort to prompt the development of open source models and reliable dialogue evaluation systems. All the source code is available on Github(3).
引用
收藏
页码:5564 / 5575
页数:12
相关论文
共 34 条
  • [21] Unified Dialog Model Pre-training for Task-Oriented Dialog Understanding and Generation
    He, Wanwei
    Dai, Yinpei
    Yang, Min
    Sun, Jian
    Huang, Fei
    Si, Luo
    Li, Yongbin
    [J]. PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 187 - 200
  • [22] KC2UM: Knowledge-Conversation Cyclic Utilization Mechanism for Knowledge-Grounded Dialogue Generation
    Sun, Yajing
    Hu, Yue
    Xing, Luxi
    Peng, Wei
    Xie, Yuqiang
    Zhang, Xingsheng
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [23] EM Pre-training for Multi-party Dialogue Response Generation
    Li, Yiyang
    Zhao, Hai
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 92 - 103
  • [24] A Model of Cross-Lingual Knowledge-Grounded Response Generation for Open-Domain Dialogue Systems
    Kim, San
    Jang, Jin Yea
    Jung, Minyoung
    Shin, Saim
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 352 - 365
  • [25] Knowledge Grounded Pre-Trained Model For Dialogue Response Generation
    Wang, Yanmeng
    Rong, Wenge
    Zhang, Jianfei
    Ouyang, Yuanxin
    Xiong, Zhang
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [26] PRAL: A Tailored Pre-Training Model for Task-Oriented Dialog Generation
    Gu, Jing
    Wu, Qingyang
    Wu, Chongruo
    Shi, Weiyan
    Yu, Zhou
    [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 305 - 313
  • [27] KnowPrefix-Tuning: A Two-Stage Prefix-Tuning Framework for Knowledge-Grounded Dialogue Generation
    Bai, Jiaqi
    Yan, Zhao
    Yang, Ze
    Yang, Jian
    Liang, Xinnian
    Guo, Hongcheng
    Li, Zhoujun
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT II, 2023, 14170 : 525 - 542
  • [28] A Pre-Training Based Personalized Dialogue Generation Model with Persona-Sparse Data
    Zheng, Yinhe
    Zhang, Rongsheng
    Mao, Xiaoxi
    Huang, Minlie
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9693 - 9700
  • [29] Emotion-Aware Multimodal Pre-training for Image-Grounded Emotional Response Generation
    Tian, Zhiliang
    Wen, Zhihua
    Wu, Zhenghao
    Song, Yiping
    Tang, Jintao
    Li, Dongsheng
    Zhang, Nevin L.
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT III, 2022, : 3 - 19
  • [30] Knowledge Graph Based Synthetic Corpus Generation for Knowledge-Enhanced Language Model Pre-training
    Agarwal, Oshin
    Ge, Heming
    Shakeri, Siamak
    Al-Rfou, Rami
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 3554 - 3565