Transfer Learning for User Adaptation in Spoken Dialogue Systems

被引:0
|
作者
Genevay, Aude [1 ]
Laroche, Romain [1 ]
机构
[1] Orange Labs, Issy Les Moulineaux, France
关键词
Reinforcement Learning; Transfer Learning; Markov Decision Processes; Online Learning; User Profiling; Multi-Armed Bandit; Jumpstart; Asymptotic Performance Spoken Dialogue Systems;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper focuses on user adaptation in Spoken Dialogue Systems. It is considered that the system has already been optimised with Reinforcement Learning methods for a set of users. The goal is to use and transfer this prior knowledge to adapt the system to a new user as quickly as possible without impacting asymptotic performance. The first contribution is a source selection method using a multi-armed stochastic bandit algorithm in order to improve the jumpstart, i.e. the average performance at the start of the learning curve. Contrarily to previous source selection methods, there is no need to define a metric between users, and it is parameter free. The second contribution is an innovative method for selecting the most informative transitions within the previously selected source, to improve the target model, in such a way that only transitions that were not observed with the target user are transferred from the selected source. For our experimentation, Reinforcement Learning is performed with the Fitted Q-Iteration algorithm. Both methods are validated on a negotiation game: an appointment scheduling simulator that allows the definition of simulated user models adopting diversified behaviours. Compared to state-of-the-art transfer algorithms, results show significant improvements for both jumpstart and asymptotic performance.
引用
收藏
页码:975 / 983
页数:9
相关论文
共 50 条
  • [21] LEARNING CONCEPTS THROUGH CONVERSATIONS IN SPOKEN DIALOGUE SYSTEMS
    Jia, Robin
    Heck, Larry
    Hakkani-Tur, Dilek
    Nikolov, Georgi
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5725 - 5729
  • [22] Spoken Dialogue Systems
    Foster, Mary Ellen
    COMPUTATIONAL LINGUISTICS, 2010, 36 (04) : 781 - 783
  • [23] Spoken dialogue systems
    Jokinen K.
    McTear M.
    Synthesis Lectures on Human Language Technologies, 2010, 2 (01): : 1 - 167
  • [24] Spoken Dialogue Systems
    Rosset, Sophie
    Vilnat, Anne
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2010, 51 (01): : 151 - 154
  • [25] A Sequence-to-Sequence Model for User Simulation in Spoken Dialogue Systems
    El Asri, Layla
    He, Jing
    Suleman, Kaheer
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1151 - 1155
  • [26] Flexible guidance generation using user model in spoken dialogue systems
    Komatani, K
    Ueno, S
    Kawahara, T
    Okuno, HG
    41ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2003, : 256 - 263
  • [27] Testing the performance of spoken dialogue systems by means of an artificially simulated user
    Lopez-Cozar, Ramon
    Callejas, Zoraida
    McTear, Michael
    ARTIFICIAL INTELLIGENCE REVIEW, 2006, 26 (04) : 291 - 323
  • [28] Testing the performance of spoken dialogue systems by means of an artificially simulated user
    Ramón López-Cózar
    Zoraida Callejas
    Michael McTear
    Artificial Intelligence Review, 2006, 26 : 291 - 323
  • [29] POLICY COMMITTEE FOR ADAPTATION IN MULTI-DOMAIN SPOKEN DIALOGUE SYSTEMS
    Gasic, M.
    Mrksic, N.
    Su, P-H.
    Vandyke, D.
    Wen, T-H.
    Young, S.
    2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 806 - 812
  • [30] Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogue Systems
    Chen, Yun-Nung
    53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP (ACL-IJCNLP 2015), 2015, : 1 - 7