Multi-agent adaptive dynamic programming

被引：0

作者：

Mukhopadhyay, S ^{[1
]}

Varghese, J ^{[1
]}

机构：

[1] Indiana Univ Purdue Univ, Dept Comp & Informat Sci, Indianapolis, IN 46202 USA

来源：

MICAI 2000: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS | 2000年 / 1793卷

关键词：

adaptive dynamic programming; Markov decision process; reinforcement learning; multiple learning agents; knowledge combining;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Dynamic programming offers an exact, general solution method for completely known sequential decision problems, formulated as Markov Decision Processes (MDP), with a finite number of states. Recently, there has been a great amount of interest in the adaptive version of the problem, where the task to be solved is not completely known a priori. In such a case, an agent has to acquire the necessary knowledge through learning, while simultaneously solving the optimal control or decision problem. A large variety of algorithms, variously known as Adaptive Dynamic Programming (ADP) or Reinforcement Learning (RL), has been proposed in the literature. However, almost invariably such algorithms suffer from slow convergence in terms of the number of experiments needed. In this paper Re investigate how the learning speed can be considerably improved by exploiting and combining knowledge accumulated by multiple agents. These agents operate in the same task environment but follow possibly different trajectories. We discuss methods of combining the knowledge structures associated with the multiple agents and different strategies (with varying overheads) for knowledge communication between agents. Results of simulation experiments are also presented to indicate that combining multiple learning agents is a promising direction to improve learning speed. The method also performs significantly better than some of the fastest MDP learning algorithms such as the prioritized sweeping.

引用

页码：574 / 585

页数：12

共 50 条

[1] Adaptive Multi-Agent Programming in GTGolog
Finzi, Alberto
Lukasiewicz, Thomas
[J]. ECAI 2006, PROCEEDINGS, 2006, 141 : 753 - +
[2] Adaptive multi-agent programming in GTGolog
Finzi, Alberto
Lukasiewicz, Thomas
[J]. KI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4314 : 389 - +
[3] Cooperative optimal output regulation of multi-agent systems using adaptive dynamic programming
Gao, Weinan
Jiang, Zhong-Ping
Lewis, Frank L.
Wang, Yebin
[J]. 2017 AMERICAN CONTROL CONFERENCE (ACC), 2017, : 2674 - 2679
[4] Event-triggered Multi-agent Optimal Regulation Using Adaptive Dynamic Programming
Zhong, Xiangnan
He, Haibo
[J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[5] Modeling city logistics using adaptive dynamic programming based multi-agent simulation
Firdausiyah, N.
Taniguchi, E.
Qureshi, A. G.
[J]. TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2019, 125 : 74 - 96
[6] Dynamic adaptive autonomy in multi-agent systems
Barber, KS
Goel, A
Martin, CE
[J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2000, 12 (02) : 129 - 147
[7] Dynamic Event-Triggered Consensus Control for Multi-Agent Systems Using Adaptive Dynamic Programming
Zhang, Qi
Yang, Yang
Xie, Xiaoran
Xu, Chunming
Yang, Han
[J]. IEEE ACCESS, 2022, 10 : 110285 - 110293
[8] Optimized Control for Human-Multi-Robot Collaboration via Multi-Agent Adaptive Dynamic Programming
Liu, Xing
Ge, Shuzhi Sam
[J]. IFAC PAPERSONLINE, 2020, 53 (02): : 9207 - 9212
[9] Adaptive Dynamic Programming and Cooperative Output Regulation of Discrete-time Multi-agent Systems
Weinan Gao
Yiyang Liu
Adedapo Odekunle
Yunjun Yu
Pingli Lu
[J]. International Journal of Control, Automation and Systems, 2018, 16 : 2273 - 2281
[10] Pinning consensus control for switched multi-agent systems: A switched adaptive dynamic programming method
Qi, Yiwen
Geng, Honglin
[J]. NONLINEAR ANALYSIS-HYBRID SYSTEMS, 2023, 48

← 1 2 3 4 5 →