Transfer Learning for User Adaptation in Spoken Dialogue Systems

被引:0
|
作者
Genevay, Aude [1 ]
Laroche, Romain [1 ]
机构
[1] Orange Labs, Issy Les Moulineaux, France
关键词
Reinforcement Learning; Transfer Learning; Markov Decision Processes; Online Learning; User Profiling; Multi-Armed Bandit; Jumpstart; Asymptotic Performance Spoken Dialogue Systems;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper focuses on user adaptation in Spoken Dialogue Systems. It is considered that the system has already been optimised with Reinforcement Learning methods for a set of users. The goal is to use and transfer this prior knowledge to adapt the system to a new user as quickly as possible without impacting asymptotic performance. The first contribution is a source selection method using a multi-armed stochastic bandit algorithm in order to improve the jumpstart, i.e. the average performance at the start of the learning curve. Contrarily to previous source selection methods, there is no need to define a metric between users, and it is parameter free. The second contribution is an innovative method for selecting the most informative transitions within the previously selected source, to improve the target model, in such a way that only transitions that were not observed with the target user are transferred from the selected source. For our experimentation, Reinforcement Learning is performed with the Fitted Q-Iteration algorithm. Both methods are validated on a negotiation game: an appointment scheduling simulator that allows the definition of simulated user models adopting diversified behaviours. Compared to state-of-the-art transfer algorithms, results show significant improvements for both jumpstart and asymptotic performance.
引用
收藏
页码:975 / 983
页数:9
相关论文
共 50 条
  • [1] LEARNING USER INTENTIONS IN SPOKEN DIALOGUE SYSTEMS
    Chinaei, Hamid R.
    Chaib-draa, Brahim
    Lamontagne, Luc
    ICAART 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, 2009, : 107 - +
  • [2] User Simulation for Spoken Dialogue Systems: Learning and Evaluation
    Georgila, Kallirroi
    Henderson, James
    Lenzon, Oliver
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1065 - 1068
  • [3] Emotion recognition and adaptation in spoken dialogue systems
    Pittermann, Johannes
    Pittermann, Angela
    Minker, Wolfgang
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2010, 13 (01) : 49 - 60
  • [4] Naturalness, Adaptation and Cooperativeness in Spoken Dialogue Systems
    Gnjatovic, Milan
    Pekar, Darko
    Delic, Vlado
    TOWARD AUTONOMOUS, ADAPTIVE, AND CONTEXT-AWARE MULTIMODAL INTERFACES: THEORETICAL AND PRACTICAL ISSUES, 2011, 6456 : 298 - 304
  • [5] Cross-Lingual Transfer Learning for Affective Spoken Dialogue Systems
    Gjoreski, Kristijan
    Gjoreski, Aleksandar
    Kraljevski, Ivan
    Hirschfeld, Diane
    INTERSPEECH 2019, 2019, : 1916 - 1920
  • [6] Reinforcement learning for spoken dialogue systems
    Singh, S
    Kearns, M
    Litman, D
    Walker, M
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 956 - 962
  • [7] Learning to ground in spoken dialogue systems
    Pietquin, Olivier
    2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol IV, Pts 1-3, 2007, : 165 - 168
  • [8] Machine Learning for Spoken Dialogue Systems
    Lemon, Oliver
    Pietquin, Olivier
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1761 - +
  • [9] Predicting user mental states in spoken dialogue systems
    Callejas, Zoraida
    Griol, David
    Lopez-Cozar, Ramon
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2011,
  • [10] Identifying user corrections automatically in spoken dialogue systems
    Hirschberg, J
    Litman, D
    Swerts, M
    2ND MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2001, : 208 - 215