An approach to the pursuit problem on a heterogeneous multiagent system using reinforcement learning

被引:48
|
作者
Ishiwaka, Y
Sato, T
Kakazu, Y
机构
[1] Hakodate Natl Coll Technol, Hakodate, Hokkaido, Japan
[2] Future Univ Hakodate, Hakodate, Hokkaido, Japan
[3] Hokkaido Univ, Sapporo, Hokkaido, Japan
关键词
pursuit problem; prediction; Q-learning; emergence; heterogeneous multiagent system;
D O I
10.1016/S0921-8890(03)00040-X
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cooperation among agents is important for multiagent systems having a shared goal. In this paper, an example of the pursuit problem is studied, in which four hunters collaborate to catch a target. A reinforcement learning algorithm is employed to model how the hunters acquire this cooperative behavior to achieve the task. In order to apply Q-learning, which is one way of reinforcement learning, two kinds of prediction are needed for each hunter agent. One is the location of the other hunter agents and target agent, and the other is the movement direction of the target agent at next time step t. In our treatment we extend the standard problem to systems with heterogeneous agents. One motivation for this is that the target agent and hunter agents have differing abilities. In addition, even though those hunter agents are homogeneous at the beginning of the problem, their abilities become heterogeneous in the learning process. Simulations of this pursuit problem were performed on a continuous action state space, the results of which are displayed, accompanied by a discussion of their outcomes' dependence upon the initial locations of the hunters and the speeds of the hunters and a target. (C) 2003 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:245 / 256
页数:12
相关论文
共 50 条
  • [1] A REINFORCEMENT LEARNING APPROACH FOR MULTIAGENT NAVIGATION
    Martinez-Gil, Francisco
    Barber, Fernando
    Lozano, Miguel
    Grimaldo, Francisco
    Fernandez, Fernando
    [J]. ICAART 2010: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1: ARTIFICIAL INTELLIGENCE, 2010, : 607 - 610
  • [2] A HYBRID MULTIAGENT REINFORCEMENT LEARNING APPROACH USING STRATEGIES AND FUSION
    Partalas, Ioannis
    Feneris, Ioannis
    Vlahavas, Ioannis
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2008, 17 (05) : 945 - 962
  • [3] Multiagent Reinforcement Learning With Heterogeneous Graph Attention Network
    Du, Wei
    Ding, Shifei
    Zhang, Chenglong
    Shi, Zhongzhi
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 6851 - 6860
  • [4] A Policy Resonance Approach to Solve the Problem of Responsibility Diffusion in Multiagent Reinforcement Learning
    Fu, Qingxu
    Qiu, Tenghai
    Yi, Jianqiang
    Pu, Zhiqiang
    Ai, Xiaolin
    Yuan, Wanmai
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [5] Genetic reinforcement learning approach to the heterogeneous machine scheduling problem
    Kim, GH
    Lee, GSG
    [J]. IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1998, 14 (06): : 879 - 893
  • [6] Optimal Robust Output Containment of Unknown Heterogeneous Multiagent System Using Off-Policy Reinforcement Learning
    Zuo, Shan
    Song, Yongduan
    Lewis, Frank L.
    Davoudi, Ali
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (11) : 3197 - 3207
  • [7] Reinforcement learning for encouraging cooperation in a multiagent system
    Jiang, Wei-Cheng
    Huang, Hong-Hao
    Wang, Yu-Teng
    [J]. INFORMATION SCIENCES, 2024, 680
  • [8] UAV Pursuit using Reinforcement Learning
    Bonnet, Alexandre
    Akhloufi, Moulay A.
    [J]. UNMANNED SYSTEMS TECHNOLOGY XXI, 2019, 11021
  • [9] Asymmetric Self-Play-Enabled Intelligent Heterogeneous Multirobot Catching System Using Deep Multiagent Reinforcement Learning
    Gao, Yuan
    Chen, Junfeng
    Chen, Xi
    Wang, Chongyang
    Hu, Junjie
    Deng, Fuqin
    Lam, Tin Lun
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (04) : 2603 - 2622
  • [10] Multiagent reinforcement learning using function approximation
    Abul, O
    Polat, F
    Alhajj, R
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2000, 30 (04): : 485 - 497