Optimal tracking agent: a new framework of reinforcement learning for multiagent systems

被引：3

作者：

Cao, Weihua ^{[1
]}

Chen, Gang ^{[1
]}

Chen, Xin ^{[1
]}

Wu, Min ^{[1
]}

机构：

[1] Cent South Univ, Inst Adv Control & Intelligent Automat, Sch Informat Sci & Engn, Changsha 410083, Hunan, Peoples R China

来源：

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE | 2013年 / 25卷 / 14期

基金：

高等学校博士学科点专项科研基金;

关键词：

estimator; action selection mechanism; curse of dimensionality; optimal tracking agent; multiagent systems;

D O I：

10.1002/cpe.2870

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

SUMMARYThe curse of dimensionality is a ubiquitous problem for multiagent reinforcement learning, which means the learning and storing space grows exponentially with the number of agents and hinders the application of multiagent reinforcement learning. To relieve this problem, we propose a new framework named as optimal tracking agent (OTA). The OTA views the other agents as part of the environment and uses a reduced form to learn the optimal decision. Although merging other agents into the environment may reduce the dimension of action space, the environment characterized by such form is dynamic and does not satisfy the convergence of reinforcement learning (RL). Thus, we develop an estimator to track the dynamics of the environment. The estimator obtains the dynamic model, and then the model-based RL can be used to react to the dynamic environment optimally. Because the Q-function in OTA is also a dynamic process because of other agents' dynamics, different from traditional RL, in which the learning is a stationary process and the usual action selection mechanisms just suit to such stationary process, we improve the greedy action selection mechanism to adapt to such dynamics. Thus, the OTA will have convergence. An experiment illustrates the validity and efficiency of the OTA.Copyright (c) 2012 John Wiley & Sons, Ltd.

引用

页码：2002 / 2015

页数：14

共 50 条

[31] Implementing Traffic Signal Optimal Control by Multiagent Reinforcement Learning
Song, Jiong
Jin, Zhao
Zhu, WenJun
2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 2578 - 2582
[32] Data-Based Optimal Consensus Control for Multiagent Systems With Policy Gradient Reinforcement Learning
Yang, Xindi
Zhang, Hao
Wang, Zhuping
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (08) : 3872 - 3883
[33] Adaptive Optimal Consensus Control of Multiagent Systems With Unknown Dynamics and Disturbances via Reinforcement Learning
Chen L.
Dong C.
Dai S.-L.
IEEE Transactions on Artificial Intelligence, 2024, 5 (05): : 2193 - 2203
[34] Prescribed-Time Optimal Consensus for Switched Stochastic Multiagent Systems: Reinforcement Learning Strategy
Guang, Weiwei
Wang, Xin
Tan, Lihua
Sun, Jian
Huang, Tingwen
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
[35] DTDE: A new cooperative multi-agent reinforcement learning framework
Wen, Guanghui
Fu, Junjie
Dai, Pengcheng
Zhou, Jialing
INNOVATION, 2021, 2 (04):
[36] Beyond Reinforcement Learning and Local View in Multiagent Systems
Bazzan, Ana L. C.
KUNSTLICHE INTELLIGENZ, 2014, 28 (03): : 179 - 189
[37] Reinforcement Learning With Task Decomposition for Cooperative Multiagent Systems
Sun, Changyin
Liu, Wenzhang
Dong, Lu
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (05) : 2054 - 2065
[38] Optimal Tracking Control of a Nonlinear Multiagent System Using Q-Learning via Event-Triggered Reinforcement Learning
Wang, Ziwei
Wang, Xin
Tang, Yijie
Liu, Ying
Hu, Jun
ENTROPY, 2023, 25 (02)
[39] Adversarial Search and Tracking with Multiagent Reinforcement Learning in Sparsely Observable Environment
Wu, Zixuan
Ye, Sean
Natarajan, Manisha
Chen, Letian
Paleja, Rohan
Gombolay, Matthew C.
2023 INTERNATIONAL SYMPOSIUM ON MULTI-ROBOT AND MULTI-AGENT SYSTEMS, MRS, 2023, : 43 - 49
[40] Deep Reinforcement Learning for the Optimal Angle Control of Tracking Bifacial Photovoltaic Systems
Tsuchida, Shuto
Nonaka, Hirofumi
Yamada, Noboru
ENERGIES, 2022, 15 (21)

← 1 2 3 4 5 →