Transfer Learning for User Adaptation in Spoken Dialogue Systems

被引：0

作者：

Genevay, Aude ^{[1
]}

Laroche, Romain ^{[1
]}

机构：

[1] Orange Labs, Issy Les Moulineaux, France

来源：

AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS | 2016年

关键词：

Reinforcement Learning; Transfer Learning; Markov Decision Processes; Online Learning; User Profiling; Multi-Armed Bandit; Jumpstart; Asymptotic Performance Spoken Dialogue Systems;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper focuses on user adaptation in Spoken Dialogue Systems. It is considered that the system has already been optimised with Reinforcement Learning methods for a set of users. The goal is to use and transfer this prior knowledge to adapt the system to a new user as quickly as possible without impacting asymptotic performance. The first contribution is a source selection method using a multi-armed stochastic bandit algorithm in order to improve the jumpstart, i.e. the average performance at the start of the learning curve. Contrarily to previous source selection methods, there is no need to define a metric between users, and it is parameter free. The second contribution is an innovative method for selecting the most informative transitions within the previously selected source, to improve the target model, in such a way that only transitions that were not observed with the target user are transferred from the selected source. For our experimentation, Reinforcement Learning is performed with the Fitted Q-Iteration algorithm. Both methods are validated on a negotiation game: an appointment scheduling simulator that allows the definition of simulated user models adopting diversified behaviours. Compared to state-of-the-art transfer algorithms, results show significant improvements for both jumpstart and asymptotic performance.

引用

页码：975 / 983

页数：9

共 50 条

[1] LEARNING USER INTENTIONS IN SPOKEN DIALOGUE SYSTEMS
Chinaei, Hamid R.
Chaib-draa, Brahim
Lamontagne, Luc
ICAART 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, 2009, : 107 - +
[2] User Simulation for Spoken Dialogue Systems: Learning and Evaluation
Georgila, Kallirroi
Henderson, James
Lenzon, Oliver
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1065 - 1068
[3] Emotion recognition and adaptation in spoken dialogue systems
Pittermann, Johannes
Pittermann, Angela
Minker, Wolfgang
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2010, 13 (01) : 49 - 60
[4] Naturalness, Adaptation and Cooperativeness in Spoken Dialogue Systems
Gnjatovic, Milan
Pekar, Darko
Delic, Vlado
TOWARD AUTONOMOUS, ADAPTIVE, AND CONTEXT-AWARE MULTIMODAL INTERFACES: THEORETICAL AND PRACTICAL ISSUES, 2011, 6456 : 298 - 304
[5] Cross-Lingual Transfer Learning for Affective Spoken Dialogue Systems
Gjoreski, Kristijan
Gjoreski, Aleksandar
Kraljevski, Ivan
Hirschfeld, Diane
INTERSPEECH 2019, 2019, : 1916 - 1920
[6] Reinforcement learning for spoken dialogue systems
Singh, S
Kearns, M
Litman, D
Walker, M
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 956 - 962
[7] Learning to ground in spoken dialogue systems
Pietquin, Olivier
2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol IV, Pts 1-3, 2007, : 165 - 168
[8] Machine Learning for Spoken Dialogue Systems
Lemon, Oliver
Pietquin, Olivier
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1761 - +
[9] Predicting user mental states in spoken dialogue systems
Callejas, Zoraida
Griol, David
Lopez-Cozar, Ramon
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2011,
[10] Identifying user corrections automatically in spoken dialogue systems
Hirschberg, J
Litman, D
Swerts, M
2ND MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2001, : 208 - 215

← 1 2 3 4 5 →