Multi-agent adaptive dynamic programming

被引:0
|
作者
Mukhopadhyay, S [1 ]
Varghese, J [1 ]
机构
[1] Indiana Univ Purdue Univ, Dept Comp & Informat Sci, Indianapolis, IN 46202 USA
关键词
adaptive dynamic programming; Markov decision process; reinforcement learning; multiple learning agents; knowledge combining;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dynamic programming offers an exact, general solution method for completely known sequential decision problems, formulated as Markov Decision Processes (MDP), with a finite number of states. Recently, there has been a great amount of interest in the adaptive version of the problem, where the task to be solved is not completely known a priori. In such a case, an agent has to acquire the necessary knowledge through learning, while simultaneously solving the optimal control or decision problem. A large variety of algorithms, variously known as Adaptive Dynamic Programming (ADP) or Reinforcement Learning (RL), has been proposed in the literature. However, almost invariably such algorithms suffer from slow convergence in terms of the number of experiments needed. In this paper Re investigate how the learning speed can be considerably improved by exploiting and combining knowledge accumulated by multiple agents. These agents operate in the same task environment but follow possibly different trajectories. We discuss methods of combining the knowledge structures associated with the multiple agents and different strategies (with varying overheads) for knowledge communication between agents. Results of simulation experiments are also presented to indicate that combining multiple learning agents is a promising direction to improve learning speed. The method also performs significantly better than some of the fastest MDP learning algorithms such as the prioritized sweeping.
引用
收藏
页码:574 / 585
页数:12
相关论文
共 50 条
  • [1] Adaptive Multi-Agent Programming in GTGolog
    Finzi, Alberto
    Lukasiewicz, Thomas
    [J]. ECAI 2006, PROCEEDINGS, 2006, 141 : 753 - +
  • [2] Adaptive multi-agent programming in GTGolog
    Finzi, Alberto
    Lukasiewicz, Thomas
    [J]. KI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4314 : 389 - +
  • [3] Cooperative optimal output regulation of multi-agent systems using adaptive dynamic programming
    Gao, Weinan
    Jiang, Zhong-Ping
    Lewis, Frank L.
    Wang, Yebin
    [J]. 2017 AMERICAN CONTROL CONFERENCE (ACC), 2017, : 2674 - 2679
  • [4] Event-triggered Multi-agent Optimal Regulation Using Adaptive Dynamic Programming
    Zhong, Xiangnan
    He, Haibo
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [5] Modeling city logistics using adaptive dynamic programming based multi-agent simulation
    Firdausiyah, N.
    Taniguchi, E.
    Qureshi, A. G.
    [J]. TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2019, 125 : 74 - 96
  • [6] Dynamic adaptive autonomy in multi-agent systems
    Barber, KS
    Goel, A
    Martin, CE
    [J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2000, 12 (02) : 129 - 147
  • [7] Dynamic Event-Triggered Consensus Control for Multi-Agent Systems Using Adaptive Dynamic Programming
    Zhang, Qi
    Yang, Yang
    Xie, Xiaoran
    Xu, Chunming
    Yang, Han
    [J]. IEEE ACCESS, 2022, 10 : 110285 - 110293
  • [8] Optimized Control for Human-Multi-Robot Collaboration via Multi-Agent Adaptive Dynamic Programming
    Liu, Xing
    Ge, Shuzhi Sam
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 9207 - 9212
  • [9] Adaptive Dynamic Programming and Cooperative Output Regulation of Discrete-time Multi-agent Systems
    Weinan Gao
    Yiyang Liu
    Adedapo Odekunle
    Yunjun Yu
    Pingli Lu
    [J]. International Journal of Control, Automation and Systems, 2018, 16 : 2273 - 2281
  • [10] Pinning consensus control for switched multi-agent systems: A switched adaptive dynamic programming method
    Qi, Yiwen
    Geng, Honglin
    [J]. NONLINEAR ANALYSIS-HYBRID SYSTEMS, 2023, 48