An approach to the pursuit problem on a heterogeneous multiagent system using reinforcement learning

被引:49
|
作者
Ishiwaka, Y
Sato, T
Kakazu, Y
机构
[1] Hakodate Natl Coll Technol, Hakodate, Hokkaido, Japan
[2] Future Univ Hakodate, Hakodate, Hokkaido, Japan
[3] Hokkaido Univ, Sapporo, Hokkaido, Japan
关键词
pursuit problem; prediction; Q-learning; emergence; heterogeneous multiagent system;
D O I
10.1016/S0921-8890(03)00040-X
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cooperation among agents is important for multiagent systems having a shared goal. In this paper, an example of the pursuit problem is studied, in which four hunters collaborate to catch a target. A reinforcement learning algorithm is employed to model how the hunters acquire this cooperative behavior to achieve the task. In order to apply Q-learning, which is one way of reinforcement learning, two kinds of prediction are needed for each hunter agent. One is the location of the other hunter agents and target agent, and the other is the movement direction of the target agent at next time step t. In our treatment we extend the standard problem to systems with heterogeneous agents. One motivation for this is that the target agent and hunter agents have differing abilities. In addition, even though those hunter agents are homogeneous at the beginning of the problem, their abilities become heterogeneous in the learning process. Simulations of this pursuit problem were performed on a continuous action state space, the results of which are displayed, accompanied by a discussion of their outcomes' dependence upon the initial locations of the hunters and the speeds of the hunters and a target. (C) 2003 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:245 / 256
页数:12
相关论文
共 50 条
  • [41] Reinforcement Learning for Synchronization of Heterogeneous Multiagent Systems by Improved Q-Functions
    Li, Jinna
    Yuan, Lin
    Cheng, Weiran
    Chai, Tianyou
    Lewis, Frank L.
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (11) : 6545 - 6558
  • [42] Evolving Equilibrium Policies for a Multiagent Reinforcement Learning Problem with State Attractors
    Leon, Florin
    COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, PT II: THIRD INTERNATIONAL CONFERENCE, ICCCI 2011, 2011, 6923 : 201 - 210
  • [43] Signal learning with messages by reinforcement learning in multi-agent pursuit problem
    Noro, Kozue
    Tenmoto, Hiroshi
    Kamiya, Akimoto
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS 18TH ANNUAL CONFERENCE, KES-2014, 2014, 35 : 233 - 240
  • [44] Asymmetric multiagent reinforcement learning
    Könönen, V
    IEEE/WIC INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2003, : 336 - 342
  • [45] Adaptive Learning: A New Decentralized Reinforcement Learning Approach for Cooperative Multiagent Systems
    Li, Meng-Lin
    Chen, Shaofei
    Chen, Jing
    IEEE ACCESS, 2020, 8 : 99404 - 99421
  • [46] A novel approach to multiagent reinforcement learning: Utilizing OLAP mining in the learning process
    Kaya, M
    Alhajj, R
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2005, 35 (04): : 582 - 590
  • [47] Learning Multiagent Options for Tabular Reinforcement Learning using Factor Graphs
    Chen J.
    Chen J.
    Lan T.
    Aggarwal V.
    IEEE Transactions on Artificial Intelligence, 2023, 4 (05): : 1141 - 1153
  • [48] Design Environment of Reinforcement Learning Agents for Intelligent Multiagent System
    Itazuro, Syo
    Uchiya, Takahiro
    Takumi, Ichi
    Kinoshita, Tetsuo
    2012 SEVENTH INTERNATIONAL CONFERENCE ON BROADBAND, WIRELESS COMPUTING, COMMUNICATION AND APPLICATIONS (BWCCA 2012), 2012, : 679 - 683
  • [49] Multiagent reinforcement learning method with an improved ant colony system
    Sun, RY
    Shoji, T
    Zhao, G
    2001 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: E-SYSTEMS AND E-MAN FOR CYBERNETICS IN CYBERSPACE, 2002, : 1612 - 1617
  • [50] Reinforcement Learning Algorithms For Navigating Multiagent In Auto Storage System
    Hieu The Pham
    Thang Quoc Nguyen
    Thai-Minh Truong
    Thanh-Binh Tran
    Thinh Ba Vuong
    PROCEEDINGS OF THE 2024 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION TECHNOLOGY, ICIIT 2024, 2024, : 264 - 271