ON-LINE POLICY OPTIMISATION OF BAYESIAN SPOKEN DIALOGUE SYSTEMS VIA HUMAN INTERACTION

被引:0
|
作者
Gasic, M. [1 ]
Breslin, C. [1 ]
Henderson, M. [1 ]
Kim, D. [1 ]
Szummer, M. [1 ]
Thomson, B. [1 ]
Tsiakoulis, P. [1 ]
Young, S. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1TN, England
关键词
dialogue systems; POMDP; Gaussian process;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A partially observable Markov decision process has been proposed as a dialogue model that enables robustness to speech recognition errors and automatic policy optimisation using reinforcement learning (RL). However, conventional RL algorithms require a very large number of dialogues, necessitating a user simulator. Recently, Gaussian processes have been shown to substantially speed up the optimisation, making it possible to learn directly from interaction with human users. However, early studies have been limited to very low dimensional spaces and the learning has exhibited convergence problems. Here we investigate learning from human interaction using the Bayesian Update of Dialogue State system. This dynamic Bayesian network based system has an optimisation space covering more than one hundred features, allowing a wide range of behaviours to be learned. Using an improved policy model and a more robust reward function, we show that stable learning can be achieved that significantly outperforms a simulator trained policy.
引用
收藏
页码:8367 / 8371
页数:5
相关论文
共 50 条
  • [1] On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems
    Su, Pei-Hao
    Gasic, Milica
    Mrksic, Nikola
    Rojas-Barahona, Lina
    Ultes, Stefan
    Vandyke, David
    Wen, Tsung-Hsien
    Young, Steve
    [J]. PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 2431 - 2441
  • [2] Neural User Simulation for Corpus-based Policy Optimisation for Spoken Dialogue Systems
    Kreyssig, Florian L.
    Casanueva, Inigo
    Budzianowski, Pawel
    Gasic, Milica
    [J]. 19TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2018), 2018, : 60 - 69
  • [3] Uncertainty management for on-line optimisation of a POMDP-based large-scale spoken dialogue system
    Daubigney, Lucie
    Gasic, Milica
    Chandramohan, Senthilkumar
    Geist, Matthieu
    Pietquin, Olivier
    Young, Steve
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1308 - +
  • [4] Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems
    Thomson, Blaise
    Young, Steve
    [J]. COMPUTER SPEECH AND LANGUAGE, 2010, 24 (04): : 562 - 588
  • [5] Duplex Conversation: Towards Human-like Interaction in Spoken Dialogue Systems
    Lin, Ting-En
    Wu, Yuchuan
    Huang, Fei
    Si, Luo
    Sun, Jian
    Li, Yongbin
    [J]. PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 3299 - 3308
  • [6] Spoken Dialogue Systems for its Interaction in Social Networks
    Griol, D.
    Patricio, M. A.
    Molina, J. M.
    Arroyo, A.
    Callejas, Z.
    Lopez-Cozar, R.
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (44): : 107 - 114
  • [7] Towards human-like spoken dialogue systems
    Edlund, Jens
    Gustafson, Joakim
    Heldner, Mattias
    Hjalmarsson, Anna
    [J]. SPEECH COMMUNICATION, 2008, 50 (8-9) : 630 - 645
  • [8] Human-robot interaction through spoken language dialogue
    Lopes, LS
    Teixeira, A
    [J]. 2000 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2000), VOLS 1-3, PROCEEDINGS, 2000, : 528 - 534
  • [9] POLICY COMMITTEE FOR ADAPTATION IN MULTI-DOMAIN SPOKEN DIALOGUE SYSTEMS
    Gasic, M.
    Mrksic, N.
    Su, P-H.
    Vandyke, D.
    Wen, T-H.
    Young, S.
    [J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 806 - 812
  • [10] On-Line Audio Dilation for Human Interaction
    Novak, John S., III
    Archer, Jason
    Shafiro, Valeriy
    Kenyon, Robert
    Leigh, Jason
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1868 - 1870