A Collaborative Multi-agent Reinforcement Learning Framework for Dialog Action Decomposition

被引:0
|
作者
Wang, Huimin [1 ]
Wong, Kam-Fai [1 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most reinforcement learning methods for dialog policy learning train a centralized agent that selects a predefined joint action concatenating domain name, intent type, and slot name. The centralized dialog agent suffers from a great many user-agent interaction requirements due to the large action space. Besides, designing the concatenated actions is laborious to engineers and maybe struggled with edge cases. To solve these problems, we model the dialog policy learning problem with a novel multi-agent framework, in which each part of the action is led by a different agent. The framework reduces labor costs for action templates and decreases the size of the action space for each agent. Furthermore, we relieve the non-stationary problem caused by the changing dynamics of the environment as evolving of agents' policies by introducing a joint optimization process that makes agents can exchange their policy information. Concurrently, an independent experience replay buffer mechanism is integrated to reduce the dependence between gradients of samples to improve training efficiency. The effectiveness of the proposed framework is demonstrated in a multi-domain environment with both user simulator evaluation and human evaluation.
引用
收藏
页码:7882 / 7889
页数:8
相关论文
共 50 条
  • [21] Hierarchical Reinforcement Learning Framework towards Multi-agent Navigation
    Ding, Wenhao
    Li, Shuaijun
    Qian, Huihuan
    Chen, Yongquan
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 237 - 242
  • [22] Towards a Distributed Framework for Multi-Agent Reinforcement Learning Research
    Zhou, Yutai
    Manuel, Shawn
    Morales, Peter
    Li, Sheng
    Pena, Jaime
    Allen, Ross
    2020 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2020,
  • [23] A synchronous multi-agent reinforcement learning framework for UVMS grasping
    Chen, Yanhu
    Tu, Zhangpeng
    Zhang, Suohang
    Zhou, Jifei
    Yang, Canjun
    OCEAN ENGINEERING, 2024, 307
  • [24] DTDE: A new cooperative multi-agent reinforcement learning framework
    Wen, Guanghui
    Fu, Junjie
    Dai, Pengcheng
    Zhou, Jialing
    INNOVATION, 2021, 2 (04):
  • [25] Multi-agent Reinforcement Learning Model for Effective Action Selection
    Youk, Sang Jo
    Lee, Bong Keun
    INFORMATION SECURITY AND ASSURANCE, 2010, 76 : 309 - +
  • [26] PowerGridworld: A Framework for Multi-Agent Reinforcement Learning in Power Systems
    Biagioni, David
    Zhang, Xiangyu
    Wald, Dylan
    Vaidhynathan, Deepthi
    Chintala, Rohit
    King, Jennifer
    Zamzam, Ahmed S.
    PROCEEDINGS OF THE 2022 THE THIRTEENTH ACM INTERNATIONAL CONFERENCE ON FUTURE ENERGY SYSTEMS, E-ENERGY 2022, 2022, : 565 - 570
  • [27] Multi-agent reinforcement learning with bidding for segmenting action sequences
    Sun, R
    Sessions, C
    FROM ANIMALS TO ANIMATS 6, 2000, : 317 - 324
  • [28] Action Prediction for Cooperative Exploration in Multi-agent Reinforcement Learning
    Zhang, Yanqiang
    Feng, Dawei
    Ding, Bo
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT II, 2024, 14448 : 358 - 372
  • [29] Multi-Agent Reinforcement Learning Algorithm Based on Action Prediction
    童亮
    陆际联
    Journal of Beijing Institute of Technology, 2006, (02) : 133 - 137
  • [30] Multi-Agent Reinforcement Learning with Optimal Equivalent Action of Neighborhood
    Wang, Haixing
    Yang, Yi
    Lin, Zhiwei
    Wang, Tian
    ACTUATORS, 2022, 11 (04)