Optimizing Policy via Deep Reinforcement Learning for Dialogue Management

被引:1
|
作者
Xu, Guanghao [1 ]
Lee, Hyunjung [2 ]
Koo, Myoung-Wan [1 ]
Seo, Jungyun [1 ]
机构
[1] Sogang Univ, Dept Comp Sci & Engn, Seoul, South Korea
[2] Univ Leipzig, Inst Linguist, D-04107 Leipzig, Germany
关键词
Deep Reinforcement Learning; Dialogue Management; Dialogue Policy;
D O I
10.1109/BigComp.2018.00101
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we propose a dialogue manager model based on Deep Reinforcement Learning, which automatically optimizes a dialogue policy. The policy is trained within deep Q-learning algorithm, which efficiently approximates value of actions given a large space of dialogue state. Evaluation processes are conducted by comparing the performance of the proposed model to a rule-based one on the dialogue corpora of DSTC2 and 3 under three different levels of error rate in Spoken Language Understanding. Experimental results prove that given certain level of SLU error, the dialogue manager with self-learned policy shows higher completion rate and the robustness to SLU error. Overcoming the drawbacks of rule-based approach such as limited flexibility and high maintenance cost, our model shows the strength of self-learning algorithm in optimizing policy of dialogue manager without any hand-crafted features.
引用
收藏
页码:582 / 589
页数:8
相关论文
共 50 条
  • [31] Bayesian Deep Reinforcement Learning via Deep Kernel Learning
    Junyu Xuan
    Jie Lu
    Zheng Yan
    Guangquan Zhang
    International Journal of Computational Intelligence Systems, 2018, 12 : 164 - 171
  • [32] Bayesian Deep Reinforcement Learning via Deep Kernel Learning
    Xuan, Junyu
    Lu, Jie
    Yan, Zheng
    Zhang, Guangquan
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2019, 12 (01) : 164 - 171
  • [33] CURIOSITY-DRIVEN REINFORCEMENT LEARNING FOR DIALOGUE MANAGEMENT
    Wesselmann, Paula
    Wu, Yen-Chen
    Gasic, Milica
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7210 - 7214
  • [34] A Comprehensive Reinforcement Learning Framework for Dialogue Management Optimization
    Daubigney, Lucie
    Geist, Matthieu
    Chandramohan, Senthilkumar
    Pietquin, Olivier
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2012, 6 (08) : 891 - 902
  • [35] Optimizing Automated Trading Systems with Deep Reinforcement Learning
    Tran, Minh
    Pham-Hi, Duc
    Bui, Marc
    ALGORITHMS, 2023, 16 (01)
  • [36] Optimizing ZX-diagrams with deep reinforcement learning
    Naegele, Maximilian
    Marquardt, Florian
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (03):
  • [37] Optimizing warfarin dosing using deep reinforcement learning
    Anzabi Zadeh, Sadjad
    Street, W. Nick
    Thomas, Barrett W.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 137
  • [38] Optimizing Sequential Experimental Design with Deep Reinforcement Learning
    Blau, Tom
    Bonilla, Edwin V.
    Chades, Iadine
    Dezfouli, Amir
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [39] Reward-free offline reinforcement learning: Optimizing behavior policy via action exploration
    Huang, Zhenbo
    Sun, Shiliang
    Zhao, Jing
    KNOWLEDGE-BASED SYSTEMS, 2024, 299
  • [40] Deep Reinforcement Learning of Dialogue Policies with Less Weight Updates
    Cuayahuitl, Heriberto
    Yu, Seunghak
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2511 - 2515