Optimizing Policy via Deep Reinforcement Learning for Dialogue Management

被引：1

作者：

Xu, Guanghao ^{[1
]}

Lee, Hyunjung ^{[2
]}

Koo, Myoung-Wan ^{[1
]}

Seo, Jungyun ^{[1
]}

机构：

[1] Sogang Univ, Dept Comp Sci & Engn, Seoul, South Korea

[2] Univ Leipzig, Inst Linguist, D-04107 Leipzig, Germany

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP) | 2018年

关键词：

Deep Reinforcement Learning; Dialogue Management; Dialogue Policy;

D O I：

10.1109/BigComp.2018.00101

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In this paper, we propose a dialogue manager model based on Deep Reinforcement Learning, which automatically optimizes a dialogue policy. The policy is trained within deep Q-learning algorithm, which efficiently approximates value of actions given a large space of dialogue state. Evaluation processes are conducted by comparing the performance of the proposed model to a rule-based one on the dialogue corpora of DSTC2 and 3 under three different levels of error rate in Spoken Language Understanding. Experimental results prove that given certain level of SLU error, the dialogue manager with self-learned policy shows higher completion rate and the robustness to SLU error. Overcoming the drawbacks of rule-based approach such as limited flexibility and high maintenance cost, our model shows the strength of self-learning algorithm in optimizing policy of dialogue manager without any hand-crafted features.

引用

页码：582 / 589

页数：8

共 50 条

[31] Bayesian Deep Reinforcement Learning via Deep Kernel Learning
Junyu Xuan
Jie Lu
Zheng Yan
Guangquan Zhang
International Journal of Computational Intelligence Systems, 2018, 12 : 164 - 171
[32] Bayesian Deep Reinforcement Learning via Deep Kernel Learning
Xuan, Junyu
Lu, Jie
Yan, Zheng
Zhang, Guangquan
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2019, 12 (01) : 164 - 171
[33] CURIOSITY-DRIVEN REINFORCEMENT LEARNING FOR DIALOGUE MANAGEMENT
Wesselmann, Paula
Wu, Yen-Chen
Gasic, Milica
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7210 - 7214
[34] A Comprehensive Reinforcement Learning Framework for Dialogue Management Optimization
Daubigney, Lucie
Geist, Matthieu
Chandramohan, Senthilkumar
Pietquin, Olivier
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2012, 6 (08) : 891 - 902
[35] Optimizing Automated Trading Systems with Deep Reinforcement Learning
Tran, Minh
Pham-Hi, Duc
Bui, Marc
ALGORITHMS, 2023, 16 (01)
[36] Optimizing ZX-diagrams with deep reinforcement learning
Naegele, Maximilian
Marquardt, Florian
MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (03):
[37] Optimizing warfarin dosing using deep reinforcement learning
Anzabi Zadeh, Sadjad
Street, W. Nick
Thomas, Barrett W.
JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 137
[38] Optimizing Sequential Experimental Design with Deep Reinforcement Learning
Blau, Tom
Bonilla, Edwin V.
Chades, Iadine
Dezfouli, Amir
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[39] Reward-free offline reinforcement learning: Optimizing behavior policy via action exploration
Huang, Zhenbo
Sun, Shiliang
Zhao, Jing
KNOWLEDGE-BASED SYSTEMS, 2024, 299
[40] Deep Reinforcement Learning of Dialogue Policies with Less Weight Updates
Cuayahuitl, Heriberto
Yu, Seunghak
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2511 - 2515

← 1 2 3 4 5 →