Adaboost with Auto-Evaluation for Conversational Models

被引:0
|
作者
Li, Juncen [1 ]
Lu, Ping [2 ,3 ]
Zhou, Ganbin [2 ,3 ]
Lin, Fen [1 ]
Niu, Cheng [1 ]
机构
[1] Tencent, WeChat Search Applicat Dept, Shenzhen, Peoples R China
[2] Chinese Acad Sci, Key Lab Intelligent Informat Proc, CAS, Inst Comp Technol, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a boosting method for conversational models to generate more human-like dialogs. In our method, we consider the existing conversational models as weak generators and apply the Adaboost to update those models. However, conventional Adaboost cannot be directly applied on conversational models, since conventional Adaboost cannot adaptively adjust the weight on the instance for subsequent learning. This results from the conventional methods based on the simple comparison between the true output y (to an input x) and its corresponding predicted output y', cannot effectively evaluate the learning performance on x. To address this issue, we develop the Adaboost with Auto-Evaluation (called AwE). In AwE, an auto-evaluator is proposed to evaluate the predicted results, which makes Adaboost applicable to conversational models. Furthermore, we present the theoretical analysis that the training error drops exponentially fast only if certain assumption over the proposed auto-evaluator holds. Finally, we empirically show that AwE visibly boosts the performance of existing single conversational models and also outperforms the other ensemble methods for conversational models.
引用
收藏
页码:4173 / 4179
页数:7
相关论文
共 50 条
  • [21] STUDY OF AN AUTO-EVALUATION SCALE OF DEPRESSION - EXISTENCE SCALE OF 2 FRANCOPHONIC POPULATIONS, BELGIUM AND SWITZERLAND
    HEIMANN, H
    BOBONSCH.H
    SCHMOCKER, AM
    BOBON, DP
    [J]. JOURNAL DE PHARMACOLOGIE, 1974, 5 : 41 - 41
  • [22] Auto-Evaluation of Motion Imitation in a Child-Robot Imitation Game for Upper Arm Rehabilitation
    Guneysu, Arzu
    Siyli, Recep Doga
    Salah, Albert Ali
    [J]. 2014 23RD IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (IEEE RO-MAN), 2014, : 199 - 204
  • [23] A Monte Carlo method for the auto-evaluation of the uncertainties in the analog-to-digital conversion-based measurements
    Nuccio, S
    Spataro, C
    [J]. COMPEL-THE INTERNATIONAL JOURNAL FOR COMPUTATION AND MATHEMATICS IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2004, 23 (01) : 148 - 158
  • [24] Initial evaluation of hidden dynamic models on conversational speech
    Picone, J
    Pike, S
    Regan, R
    Kamm, T
    Bridle, J
    Deng, L
    Ma, Z
    Richards, H
    Schuster, M
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 109 - 112
  • [25] Empirical Auto-Evaluation of Python']Python Code for Performance Analysis of Transformer Network Using T5 Architecture
    Ganguli, Isha
    Bhowmick, Rajat Subhra
    Biswas, Shivam
    Sil, Jaya
    [J]. 2021 8TH INTERNATIONAL CONFERENCE ON SMART COMPUTING AND COMMUNICATIONS (ICSCC), 2021, : 75 - 79
  • [26] The importance of patients' body awareness, auto-evaluation, empowerment, and of allowing self-help when things go wrong
    Fondanesche, C.
    [J]. HAEMOPHILIA, 2012, 18 : 112 - 112
  • [27] Auto-Evaluation Model for the Prediction of Building Energy Consumption That Combines Modified Kalman Filtering and Long Short-Term Memory
    Yang, Fan
    Mao, Qian
    [J]. SUSTAINABILITY, 2023, 15 (22)
  • [28] Specification and Evaluation of a Spanish Conversational System Using Dialogue Models
    Meza, Ivan V.
    Salinas, Lisset
    Venegas, Esther
    Castellanos, Hayde
    Chavarria, Alejandra
    Pineda, Luis A.
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2010, 2010, 6433 : 346 - 355
  • [29] CHBias: Bias Evaluation and Mitigation of Chinese Conversational Language Models
    Zhao, Jiaxu
    Fang, Meng
    Shi, Zijing
    Li, Yitong
    Chen, Ling
    Pechenizkiy, Mykola
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 13538 - 13556
  • [30] Towards Automatic Evaluation of NLG Tasks Using Conversational Large Language Models
    Riyadh, Md
    Shafiq, M. Omair
    [J]. ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2023, PT II, 2023, 676 : 425 - 437