Optimizing Policy via Deep Reinforcement Learning for Dialogue Management

被引:1
|
作者
Xu, Guanghao [1 ]
Lee, Hyunjung [2 ]
Koo, Myoung-Wan [1 ]
Seo, Jungyun [1 ]
机构
[1] Sogang Univ, Dept Comp Sci & Engn, Seoul, South Korea
[2] Univ Leipzig, Inst Linguist, D-04107 Leipzig, Germany
关键词
Deep Reinforcement Learning; Dialogue Management; Dialogue Policy;
D O I
10.1109/BigComp.2018.00101
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we propose a dialogue manager model based on Deep Reinforcement Learning, which automatically optimizes a dialogue policy. The policy is trained within deep Q-learning algorithm, which efficiently approximates value of actions given a large space of dialogue state. Evaluation processes are conducted by comparing the performance of the proposed model to a rule-based one on the dialogue corpora of DSTC2 and 3 under three different levels of error rate in Spoken Language Understanding. Experimental results prove that given certain level of SLU error, the dialogue manager with self-learned policy shows higher completion rate and the robustness to SLU error. Overcoming the drawbacks of rule-based approach such as limited flexibility and high maintenance cost, our model shows the strength of self-learning algorithm in optimizing policy of dialogue manager without any hand-crafted features.
引用
收藏
页码:582 / 589
页数:8
相关论文
共 50 条
  • [11] Towards integrated dialogue policy learning for multiple domains and intents using Hierarchical Deep Reinforcement Learning
    Saha, Tulika
    Gupta, Dhawal
    Saha, Sriparna
    Bhattacharyya, Pushpak
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 162 (162)
  • [12] Optimizing Energy Efficiency for Data Center via Parameterized Deep Reinforcement Learning
    Ran, Yongyi
    Hu, Han
    Wen, Yonggang
    Zhou, Xin
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (02) : 1310 - 1323
  • [13] Reinforcement learning in optimizing forest management
    Malo, Pekka
    Tahvonen, Olli
    Suominen, Antti
    Back, Philipp
    Viitasaari, Lauri
    CANADIAN JOURNAL OF FOREST RESEARCH, 2021, 51 (10) : 1393 - 1409
  • [14] Dispatching Policy Optimizing of Cruise Taxi in a Multiagent‑Based Deep Reinforcement Learning Framework
    Ma X.
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2023, 48 (12): : 2108
  • [15] Optimizing Communication in Deep Reinforcement Learning with XingTian
    Pan, Lichen
    Qian, Jun
    Xia, Wei
    Mao, Hangyu
    Yao, Jun
    Li, Pengze
    Xiao, Zhen
    PROCEEDINGS OF THE TWENTY-THIRD ACM/IFIP INTERNATIONAL MIDDLEWARE CONFERENCE, MIDDLEWARE 2022, 2022, : 255 - 268
  • [16] Optimizing Traffic at Intersections With Deep Reinforcement Learning
    Boyko, Nataliya
    Mokryk, Yaroslav
    JOURNAL OF ENGINEERING, 2024, 2024
  • [17] Optimizing Chemical Reactions with Deep Reinforcement Learning
    Zhou, Zhenpeng
    Li, Xiaocheng
    Zare, Richard N.
    ACS CENTRAL SCIENCE, 2017, 3 (12) : 1337 - 1344
  • [18] Policy Reuse in Deep Reinforcement Learning
    Glatt, Ruben
    Helena, Anna
    Costa, Reali
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4929 - 4930
  • [19] Optimizing hybrid electric vehicle coupling organic Rankine cycle energy management strategy via deep reinforcement learning
    Zhang, Xuanang
    Wang, Xuan
    Yuan, Ping
    Tian, Hua
    Shu, Gequn
    ENERGY AND AI, 2024, 17
  • [20] Optimizing Crop Management with Reinforcement Learning and Imitation Learning
    Tao, Ran
    Zhao, Pan
    Wu, Jing
    Martin, Nicolas
    Harrison, Matthew T.
    Ferreira, Carla
    Kalantari, Zahra
    Hovakimyan, Naira
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 6228 - 6236