Transfer Learning for User Adaptation in Spoken Dialogue Systems

被引:0
|
作者
Genevay, Aude [1 ]
Laroche, Romain [1 ]
机构
[1] Orange Labs, Issy Les Moulineaux, France
关键词
Reinforcement Learning; Transfer Learning; Markov Decision Processes; Online Learning; User Profiling; Multi-Armed Bandit; Jumpstart; Asymptotic Performance Spoken Dialogue Systems;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper focuses on user adaptation in Spoken Dialogue Systems. It is considered that the system has already been optimised with Reinforcement Learning methods for a set of users. The goal is to use and transfer this prior knowledge to adapt the system to a new user as quickly as possible without impacting asymptotic performance. The first contribution is a source selection method using a multi-armed stochastic bandit algorithm in order to improve the jumpstart, i.e. the average performance at the start of the learning curve. Contrarily to previous source selection methods, there is no need to define a metric between users, and it is parameter free. The second contribution is an innovative method for selecting the most informative transitions within the previously selected source, to improve the target model, in such a way that only transitions that were not observed with the target user are transferred from the selected source. For our experimentation, Reinforcement Learning is performed with the Fitted Q-Iteration algorithm. Both methods are validated on a negotiation game: an appointment scheduling simulator that allows the definition of simulated user models adopting diversified behaviours. Compared to state-of-the-art transfer algorithms, results show significant improvements for both jumpstart and asymptotic performance.
引用
收藏
页码:975 / 983
页数:9
相关论文
共 50 条
  • [41] TOWARDS FINE-GRAIN USER-SIMULATION FOR SPOKEN DIALOGUE SYSTEMS
    Lopez-Cozar, Ramon
    Griol, David
    Espejo, Gonzalo
    Callejas, Zoraida
    Abalos, Nieves
    SPOKEN DIALOGUE SYSTEMS: TECHNOLOGY AND DESIGN, 2011, : 53 - 81
  • [42] Regularized Neural User Model for Goal-Oriented Spoken Dialogue Systems
    Serras, Manex
    Ines Torres, Maria
    del Pozo, Arantza
    ADVANCED SOCIAL INTERACTION WITH AGENTS, 2019, 510 : 235 - 245
  • [43] Online Learning of Attributed Bi-Automata for Dialogue Management in Spoken Dialogue Systems
    Serras, Manex
    Ines Torres, Maria
    Del Pozo, Arantza
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2017), 2017, 10255 : 22 - 31
  • [44] User modeling for spoken dialogue system evaluation
    Eckert, W
    Levin, E
    Pieraccini, R
    1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 80 - 87
  • [45] Review of spoken dialogue systems
    Lopez-Cozar, Ramon
    Callejas, Zoraida
    Griol, David
    Quesada, Jose F.
    LOQUENS, 2014, 1 (02):
  • [46] Spoken language dialogue systems
    Giachin, E
    McGlashan, S
    CORPUS-BASED METHODS IN LANGUAGE AND SPEECH PROCESSING, 1997, 2 : 69 - 117
  • [47] Dialogue-Learning Correlations in Spoken Dialogue Tutoring
    Forbes-Riley, Kate
    Litman, Diane
    Huettner, Alison
    Ward, Arthur
    ARTIFICIAL INTELLIGENCE IN EDUCATION: SUPPORTING LEARNING THROUGH INTELLIGENT AND SOCIALLY INFORMED TECHNOLOGY, 2005, 125 : 225 - 232
  • [48] Learning from Real Users: Rating Dialogue Success with Neural Networks for Reinforcement Learning in Spoken Dialogue Systems
    Su, Pei-Hao
    Vandyke, David
    Gasic, Milica
    Kim, Dongho
    Mrksic, Nikola
    Wen, Tsung-Hsien
    Young, Steve
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2007 - 2011
  • [49] Learning Adaptive Referring Expression Generation Policies for Spoken Dialogue Systems
    Janarthanam, Srinivasan
    Lemon, Oliver
    EMPIRICAL METHODS IN NATURAL LANGUAGE GENERATION: DATA-ORIENTED METHODS AND EMPIRICAL EVALUATION, 2010, 5790 : 67 - +
  • [50] Semi-supervised learning for character expression of spoken dialogue systems
    Yamamoto, Kenta
    Inoue, Koji
    Kawahara, Tatsuya
    INTERSPEECH 2020, 2020, : 4188 - 4192