Acquisition of coordinated behavior by modular Q-learning agents

被引：0

作者：

Ono, N

Ikeda, O

Fukumoto, K

机构：

来源：

IROS 96 - PROCEEDINGS OF THE 1996 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS - ROBOTIC INTELLIGENCE INTERACTING WITH DYNAMIC WORLDS, VOLS 1-3 | 1996年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recent attempts to let monolithic reinforcement-learning agents synthesize coordinated behavior scale poorly to more complicated multi-agent learning problems where multiple learning agents play different roles and work together for the accomplishment of their common goal. These learning agents have to receive and respond to various sensory information from their partners as well as that from the physical environment itself Hence, their state spaces are subject to grow exponentially in the number of the partners. As an illustrative problem suffered from this kind of combinatorial explosion, we consider a modified version of the pursuit problem, and show how successfully a collection of modular Q-learning hunter agents synthesize coordinated decision policies needed to capture a randomly-fleeing prey agent effectively, by specializing their functionality and acquiring herding behavior.

引用

页码：1525 / 1529

页数：5

共 50 条

[1] An adaptive architecture for modular Q-learning
Kohri, T
Matsubayashi, K
Tokoro, M
IJCAI-97 - PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, 1997, : 820 - 825
[2] Cooperative behavior acquisition for multi-agent systems by Q-learning
Xie, M. C.
Tachibana, A.
2007 IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTATIONAL INTELLIGENCE, VOLS 1 AND 2, 2007, : 424 - +
[3] Training and delayed reinforcements in Q-learning agents
Caironi, PVC
Dorigo, M
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1997, 12 (10) : 695 - 724
[4] Q-learning agents in a Cournot oligopoly model
Waltman, Ludo
Kaymak, Uzay
JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2008, 32 (10): : 3275 - 3293
[5] Q-Learning Transformation for Training on JADE Agents
Cepero-Perez, Nayma
Moreno-Espino, Mailyn
REVISTA DIGITAL LAMPSAKOS, 2015, (14): : 25 - 32
[6] Representation of the Perceived Environment and Acquisition of Behavior Rule for Multi-Agent Systems by Q-Learning
Xie, Mengchun
PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOTS AND AGENTS, 2009, : 398 - 402
[7] Reinforcement distribution in a team of cooperative Q-learning agents
Abbasi, Zahra
Abbasi, Mohammad Ali
PROCEEDINGS OF NINTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING, 2008, : 154 - +
[8] Cooperative behavior acquisition based modular Q learning in multi-agent system
Zhou, T
Hong, BR
Shi, CX
Zhou, HY
PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 205 - 210
[9] Acquisition of Movement Pattern by Q-Learning in Peristaltic Crawling Robot
Saga, Norihiko
Ikeda, Atsumasa
INTELLIGENT ROBOTICS AND APPLICATIONS, PROCEEDINGS, 2009, 5928 : 1163 - 1169
[10] Q-LEARNING
WATKINS, CJCH
DAYAN, P
MACHINE LEARNING, 1992, 8 (3-4) : 279 - 292

← 1 2 3 4 5 →