Optimal tracking agent: a new framework of reinforcement learning for multiagent systems

被引:3
|
作者
Cao, Weihua [1 ]
Chen, Gang [1 ]
Chen, Xin [1 ]
Wu, Min [1 ]
机构
[1] Cent South Univ, Inst Adv Control & Intelligent Automat, Sch Informat Sci & Engn, Changsha 410083, Hunan, Peoples R China
来源
基金
高等学校博士学科点专项科研基金;
关键词
estimator; action selection mechanism; curse of dimensionality; optimal tracking agent; multiagent systems;
D O I
10.1002/cpe.2870
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
SUMMARYThe curse of dimensionality is a ubiquitous problem for multiagent reinforcement learning, which means the learning and storing space grows exponentially with the number of agents and hinders the application of multiagent reinforcement learning. To relieve this problem, we propose a new framework named as optimal tracking agent (OTA). The OTA views the other agents as part of the environment and uses a reduced form to learn the optimal decision. Although merging other agents into the environment may reduce the dimension of action space, the environment characterized by such form is dynamic and does not satisfy the convergence of reinforcement learning (RL). Thus, we develop an estimator to track the dynamics of the environment. The estimator obtains the dynamic model, and then the model-based RL can be used to react to the dynamic environment optimally. Because the Q-function in OTA is also a dynamic process because of other agents' dynamics, different from traditional RL, in which the learning is a stationary process and the usual action selection mechanisms just suit to such stationary process, we improve the greedy action selection mechanism to adapt to such dynamics. Thus, the OTA will have convergence. An experiment illustrates the validity and efficiency of the OTA.Copyright (c) 2012 John Wiley & Sons, Ltd.
引用
收藏
页码:2002 / 2015
页数:14
相关论文
共 50 条
  • [1] Optimal Tracking Agent: A New Framework for Multi-Agent Reinforcement Learning
    Cao, Weihua
    Chen, Gang
    Chen, Xin
    Wu, Min
    TRUSTCOM 2011: 2011 INTERNATIONAL JOINT CONFERENCE OF IEEE TRUSTCOM-11/IEEE ICESS-11/FCST-11, 2011, : 1328 - 1334
  • [2] An Advising Framework for Multiagent Reinforcement Learning Systems
    da Silva, Felipe Leno
    Glatt, Ruben
    Reali Costa, Anna Helena
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4913 - 4914
  • [3] An Evolutionary Transfer Reinforcement Learning Framework for Multiagent Systems
    Hou, Yaqing
    Ong, Yew-Soon
    Feng, Liang
    Zurada, Jacek M.
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2017, 21 (04) : 601 - 615
  • [4] Timesharing-Tracking: a new framework for Decentralized Reinforcement Learning in Cooperative Multi-Agent Systems
    Fu Bo
    Chen Xin
    He Yong
    Wu Min
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 7054 - 7059
  • [5] Reinforcement Learning for Optimal Tracking and Regulation: A Unified Framework
    Lewis, F. L.
    Modares, H.
    Kiumarsi, B.
    2015 AMERICAN CONTROL CONFERENCE (ACC), 2015, : 5082 - 5082
  • [6] iEnsemble: A Framework for Committee Machine Based on Multiagent Systems with Reinforcement Learning
    Uber Junior, Arnoldo
    de Freitas Filho, Paulo Jose
    Silveira, Ricardo Azambuja
    Costa e Lima, Mariana Dehon
    Reitz, Rodolfo Wilvert
    ADVANCES IN SOFT COMPUTING, MICAI 2016, PT II, 2017, 10062 : 65 - 80
  • [7] Achieving Socially Optimal Outcomes in Multiagent Systems with Reinforcement Social Learning
    Hao, Jianye
    Leung, Ho-Fung
    ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2013, 8 (03)
  • [8] Temporal and Agent Abstractions in Multiagent Reinforcement Learning
    Clement, Danielle M.
    Huber, Manfred
    2016 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2016, : 2190 - 2195
  • [9] An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning
    Yang, Tianpei
    Wang, Weixun
    Tang, Hongyao
    Hao, Jianye
    Meng, Zhaopeng
    Mao, Hangyu
    Li, Dong
    Liu, Wulong
    Zhang, Chengwei
    Hu, Yujing
    Chen, Yingfeng
    Fan, Changjie
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [10] Blockchain-Based Distributed Multiagent Reinforcement Learning for Collaborative Multiobject Tracking Framework
    Shen, Jiahao
    Sheng, Hao
    Wang, Shuai
    Cong, Ruixuan
    Yang, Da
    Zhang, Yang
    IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (03) : 778 - 788