Optimizing Policy via Deep Reinforcement Learning for Dialogue Management

被引:1
|
作者
Xu, Guanghao [1 ]
Lee, Hyunjung [2 ]
Koo, Myoung-Wan [1 ]
Seo, Jungyun [1 ]
机构
[1] Sogang Univ, Dept Comp Sci & Engn, Seoul, South Korea
[2] Univ Leipzig, Inst Linguist, D-04107 Leipzig, Germany
关键词
Deep Reinforcement Learning; Dialogue Management; Dialogue Policy;
D O I
10.1109/BigComp.2018.00101
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we propose a dialogue manager model based on Deep Reinforcement Learning, which automatically optimizes a dialogue policy. The policy is trained within deep Q-learning algorithm, which efficiently approximates value of actions given a large space of dialogue state. Evaluation processes are conducted by comparing the performance of the proposed model to a rule-based one on the dialogue corpora of DSTC2 and 3 under three different levels of error rate in Spoken Language Understanding. Experimental results prove that given certain level of SLU error, the dialogue manager with self-learned policy shows higher completion rate and the robustness to SLU error. Overcoming the drawbacks of rule-based approach such as limited flexibility and high maintenance cost, our model shows the strength of self-learning algorithm in optimizing policy of dialogue manager without any hand-crafted features.
引用
收藏
页码:582 / 589
页数:8
相关论文
共 50 条
  • [1] POLICY ADAPTATION FOR DEEP REINFORCEMENT LEARNING-BASED DIALOGUE MANAGEMENT
    Chen, Lu
    Chang, Cheng
    Chen, Zhi
    Tan, Bowen
    Gasic, Milica
    Yu, Kai
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6074 - 6078
  • [2] Optimizing dialogue management with reinforcement learning: Experiments with the NJFun system
    Singh, S
    Litman, D
    Kearns, M
    Walker, M
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2002, 16 : 105 - 133
  • [3] Deep Reinforcement Learning for Optimizing Finance Portfolio Management
    Hu, Yuh-Jong
    Lin, Shang-Jen
    PROCEEDINGS 2019 AMITY INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AICAI), 2019, : 14 - 20
  • [4] BENCHMARKING UNCERTAINTY ESTIMATES WITH DEEP REINFORCEMENT LEARNING FOR DIALOGUE POLICY OPTIMISATION
    Tegho, Christopher
    Budzianowski, Pawel
    Gasic, Milica
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6069 - 6073
  • [5] Optimizing Nitrogen Management with Deep Reinforcement Learning and Crop Simulations
    Wu, Jing
    Tao, Ran
    Zhao, Pan
    Martin, Nicolas F.
    Hovakimyan, Naira
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 1711 - 1719
  • [6] AgentGraph: Toward Universal Dialogue Management With Structured Deep Reinforcement Learning
    Chen, Lu
    Chen, Zhi
    Tan, Bowen
    Long, Sishan
    Gasic, Milica
    Yu, Kai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (09) : 1378 - 1391
  • [7] On Optimizing Operational Efficiency in Storage Systems via Deep Reinforcement Learning
    Srinivasa, Sunil
    Kathalagiri, Girish
    Varanasi, Julu Subramanyam
    Quintela, Luis Carlos
    Charafeddine, Mohamad
    Lee, Chi-Hoon
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2018, PT III, 2019, 11053 : 238 - 253
  • [8] Efficient Deep Reinforcement Learning via Adaptive Policy Transfer
    Yang, Tianpei
    Hao, Jianye
    Meng, Zhaopeng
    Zhang, Zongzhang
    Hu, Yujing
    Chen, Yingfeng
    Fan, Changjie
    Wang, Weixun
    Liu, Wulong
    Wang, Zhaodong
    Peng, Jiajie
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3094 - 3100
  • [9] Reinforcement Learning for Personalized Dialogue Management
    den Hengst, Floris
    Hoogendoorn, Mark
    van Harmelen, Frank
    Bosman, Joost
    2019 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2019), 2019, : 59 - 67
  • [10] Optimizing deep-space DTN congestion control via deep reinforcement learning
    Yang, Lei
    Fraire, Juan A.
    Zhao, Kanglian
    Wang, Ruhai
    Li, Wenfeng
    Yang, Hong
    COMPUTER NETWORKS, 2024, 255