Acquisition of coordinated behavior by modular Q-learning agents

被引:0
|
作者
Ono, N
Ikeda, O
Fukumoto, K
机构
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent attempts to let monolithic reinforcement-learning agents synthesize coordinated behavior scale poorly to more complicated multi-agent learning problems where multiple learning agents play different roles and work together for the accomplishment of their common goal. These learning agents have to receive and respond to various sensory information from their partners as well as that from the physical environment itself Hence, their state spaces are subject to grow exponentially in the number of the partners. As an illustrative problem suffered from this kind of combinatorial explosion, we consider a modified version of the pursuit problem, and show how successfully a collection of modular Q-learning hunter agents synthesize coordinated decision policies needed to capture a randomly-fleeing prey agent effectively, by specializing their functionality and acquiring herding behavior.
引用
收藏
页码:1525 / 1529
页数:5
相关论文
共 50 条
  • [1] An adaptive architecture for modular Q-learning
    Kohri, T
    Matsubayashi, K
    Tokoro, M
    IJCAI-97 - PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, 1997, : 820 - 825
  • [2] Cooperative behavior acquisition for multi-agent systems by Q-learning
    Xie, M. C.
    Tachibana, A.
    2007 IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTATIONAL INTELLIGENCE, VOLS 1 AND 2, 2007, : 424 - +
  • [3] Training and delayed reinforcements in Q-learning agents
    Caironi, PVC
    Dorigo, M
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1997, 12 (10) : 695 - 724
  • [4] Q-learning agents in a Cournot oligopoly model
    Waltman, Ludo
    Kaymak, Uzay
    JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2008, 32 (10): : 3275 - 3293
  • [5] Q-Learning Transformation for Training on JADE Agents
    Cepero-Perez, Nayma
    Moreno-Espino, Mailyn
    REVISTA DIGITAL LAMPSAKOS, 2015, (14): : 25 - 32
  • [6] Representation of the Perceived Environment and Acquisition of Behavior Rule for Multi-Agent Systems by Q-Learning
    Xie, Mengchun
    PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOTS AND AGENTS, 2009, : 398 - 402
  • [7] Reinforcement distribution in a team of cooperative Q-learning agents
    Abbasi, Zahra
    Abbasi, Mohammad Ali
    PROCEEDINGS OF NINTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING, 2008, : 154 - +
  • [8] Cooperative behavior acquisition based modular Q learning in multi-agent system
    Zhou, T
    Hong, BR
    Shi, CX
    Zhou, HY
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 205 - 210
  • [9] Acquisition of Movement Pattern by Q-Learning in Peristaltic Crawling Robot
    Saga, Norihiko
    Ikeda, Atsumasa
    INTELLIGENT ROBOTICS AND APPLICATIONS, PROCEEDINGS, 2009, 5928 : 1163 - 1169
  • [10] Q-LEARNING
    WATKINS, CJCH
    DAYAN, P
    MACHINE LEARNING, 1992, 8 (3-4) : 279 - 292