ON-LINE POLICY OPTIMISATION OF BAYESIAN SPOKEN DIALOGUE SYSTEMS VIA HUMAN INTERACTION

被引：0

作者：

Gasic, M. ^{[1
]}

Breslin, C. ^{[1
]}

Henderson, M. ^{[1
]}

Kim, D. ^{[1
]}

Szummer, M. ^{[1
]}

Thomson, B. ^{[1
]}

Tsiakoulis, P. ^{[1
]}

Young, S. ^{[1
]}

机构：

[1] Univ Cambridge, Dept Engn, Cambridge CB2 1TN, England

来源：

2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年

关键词：

dialogue systems; POMDP; Gaussian process;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

A partially observable Markov decision process has been proposed as a dialogue model that enables robustness to speech recognition errors and automatic policy optimisation using reinforcement learning (RL). However, conventional RL algorithms require a very large number of dialogues, necessitating a user simulator. Recently, Gaussian processes have been shown to substantially speed up the optimisation, making it possible to learn directly from interaction with human users. However, early studies have been limited to very low dimensional spaces and the learning has exhibited convergence problems. Here we investigate learning from human interaction using the Bayesian Update of Dialogue State system. This dynamic Bayesian network based system has an optimisation space covering more than one hundred features, allowing a wide range of behaviours to be learned. Using an improved policy model and a more robust reward function, we show that stable learning can be achieved that significantly outperforms a simulator trained policy.

引用

页码：8367 / 8371

页数：5

共 50 条

[1] On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems
Su, Pei-Hao
Gasic, Milica
Mrksic, Nikola
Rojas-Barahona, Lina
Ultes, Stefan
Vandyke, David
Wen, Tsung-Hsien
Young, Steve
[J]. PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 2431 - 2441
[2] Neural User Simulation for Corpus-based Policy Optimisation for Spoken Dialogue Systems
Kreyssig, Florian L.
Casanueva, Inigo
Budzianowski, Pawel
Gasic, Milica
[J]. 19TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2018), 2018, : 60 - 69
[3] Uncertainty management for on-line optimisation of a POMDP-based large-scale spoken dialogue system
Daubigney, Lucie
Gasic, Milica
Chandramohan, Senthilkumar
Geist, Matthieu
Pietquin, Olivier
Young, Steve
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1308 - +
[4] Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems
Thomson, Blaise
Young, Steve
[J]. COMPUTER SPEECH AND LANGUAGE, 2010, 24 (04): : 562 - 588
[5] Duplex Conversation: Towards Human-like Interaction in Spoken Dialogue Systems
Lin, Ting-En
Wu, Yuchuan
Huang, Fei
Si, Luo
Sun, Jian
Li, Yongbin
[J]. PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 3299 - 3308
[6] Spoken Dialogue Systems for its Interaction in Social Networks
Griol, D.
Patricio, M. A.
Molina, J. M.
Arroyo, A.
Callejas, Z.
Lopez-Cozar, R.
[J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (44): : 107 - 114
[7] Towards human-like spoken dialogue systems
Edlund, Jens
Gustafson, Joakim
Heldner, Mattias
Hjalmarsson, Anna
[J]. SPEECH COMMUNICATION, 2008, 50 (8-9) : 630 - 645
[8] Human-robot interaction through spoken language dialogue
Lopes, LS
Teixeira, A
[J]. 2000 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2000), VOLS 1-3, PROCEEDINGS, 2000, : 528 - 534
[9] POLICY COMMITTEE FOR ADAPTATION IN MULTI-DOMAIN SPOKEN DIALOGUE SYSTEMS
Gasic, M.
Mrksic, N.
Su, P-H.
Vandyke, D.
Wen, T-H.
Young, S.
[J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 806 - 812
[10] On-Line Audio Dilation for Human Interaction
Novak, John S., III
Archer, Jason
Shafiro, Valeriy
Kenyon, Robert
Leigh, Jason
[J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1868 - 1870

← 1 2 3 4 5 →