AgentGraph: Toward Universal Dialogue Management With Structured Deep Reinforcement Learning

被引:26
|
作者
Chen, Lu [1 ]
Chen, Zhi [1 ]
Tan, Bowen [1 ]
Long, Sishan [1 ]
Gasic, Milica [2 ]
Yu, Kai [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
[2] Heinrich Heine Univ Dusseldorf, D-40225 Dusseldorf, Germany
关键词
Dialogue policy; deep reinforcement learning; graph neural networks; policy adaptation; transfer learning; STATE; SYSTEMS;
D O I
10.1109/TASLP.2019.2919872
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Dialogue policy plays an important role in task-oriented spoken dialogue systems. It determines how to respond to users. The recently proposed deep reinforcement learning (DRL) approaches have been used for policy optimization. However, these deep models are still challenging for two reasons: first, many DRL-based policies are not sample efficient; and second, most models do not have the capability of policy transfer between different domains. In this paper, we propose a universal framework, AgentGraph, to tackle these two problems. The proposed AgentGraph is the combination of graph neural network (GNN) based architecture and DRL-based algorithm. It can be regarded as one of the multi-agent reinforcement learning approaches. Each agent corresponds to a node in a graph, which is defined according to the dialogue domain ontology. When making a decision, each agent can communicate with its neighbors on the graph. Under AgentGraph framework, we further propose dual GNN-based dialogue policy, which implicitly decomposes the decision in each turn into a high-level global decision and a low-level local decision. Experiments show that AgentGraph models significantly outperform traditional reinforcement learning approaches on most of the 18 tasks of the PyDial benchmark. Moreover, when transferred from the source task to a target task, these models not only have acceptable initial performance but also converge much faster on the target task.
引用
收藏
页码:1378 / 1391
页数:14
相关论文
共 50 条
  • [11] Deep Reinforcement Learning for On-line Dialogue State Tracking
    Chen, Zhi
    Chen, Lu
    Zhou, Xiang
    Yu, Kai
    MAN-MACHINE SPEECH COMMUNICATION, NCMMSC 2022, 2023, 1765 : 278 - 292
  • [12] Deep reinforcement learning for portfolio management
    Yang, Shantian
    KNOWLEDGE-BASED SYSTEMS, 2023, 278
  • [13] Resource Management with Deep Reinforcement Learning
    Mao, Hongzi
    Alizadeh, Mohammad
    Menache, Ishai
    Kandula, Srikanth
    PROCEEDINGS OF THE 15TH ACM WORKSHOP ON HOT TOPICS IN NETWORKS (HOTNETS '16), 2016, : 50 - 56
  • [14] Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management
    Gupta, Dhawal
    Chow, Yinlam
    Tulepbergenov, Aza
    Ghavamzadeh, Mohammad
    Boutilier, Craig
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [15] Optimizing dialogue management with reinforcement learning: Experiments with the NJFun system
    Singh, S
    Litman, D
    Kearns, M
    Walker, M
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2002, 16 : 105 - 133
  • [16] Model-based Bayesian Reinforcement Learning for Dialogue Management
    Lison, Pierre
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 475 - 479
  • [17] Toward robust and scalable deep spiking reinforcement learning
    Akl, Mahmoud
    Ergene, Deniz
    Walter, Florian
    Knoll, Alois
    FRONTIERS IN NEUROROBOTICS, 2023, 16
  • [18] BENCHMARKING UNCERTAINTY ESTIMATES WITH DEEP REINFORCEMENT LEARNING FOR DIALOGUE POLICY OPTIMISATION
    Tegho, Christopher
    Budzianowski, Pawel
    Gasic, Milica
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6069 - 6073
  • [19] Cryptocurrency Portfolio Management with Deep Reinforcement Learning
    Jiang, Zhengyao
    Liang, Jinjun
    PROCEEDINGS OF THE 2017 INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), 2017, : 905 - 913
  • [20] Deep Reinforcement Learning for Quantitative Portfolio Management
    Wei, Ziqiang
    Chen, Deng
    2023 THE 6TH INTERNATIONAL CONFERENCE ON ROBOT SYSTEMS AND APPLICATIONS, ICRSA 2023, 2023, : 237 - 242