Acquisition of coordinated behavior by modular Q-learning agents

被引：0

作者：

Ono, N

Ikeda, O

Fukumoto, K

机构：

来源：

IROS 96 - PROCEEDINGS OF THE 1996 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS - ROBOTIC INTELLIGENCE INTERACTING WITH DYNAMIC WORLDS, VOLS 1-3 | 1996年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recent attempts to let monolithic reinforcement-learning agents synthesize coordinated behavior scale poorly to more complicated multi-agent learning problems where multiple learning agents play different roles and work together for the accomplishment of their common goal. These learning agents have to receive and respond to various sensory information from their partners as well as that from the physical environment itself Hence, their state spaces are subject to grow exponentially in the number of the partners. As an illustrative problem suffered from this kind of combinatorial explosion, we consider a modified version of the pursuit problem, and show how successfully a collection of modular Q-learning hunter agents synthesize coordinated decision policies needed to capture a randomly-fleeing prey agent effectively, by specializing their functionality and acquiring herding behavior.

引用

页码：1525 / 1529

页数：5

共 50 条

[31] Convex Q-Learning
Lu, Fan
Mehta, Prashant G.
Meyn, Sean P.
Neu, Gergely
2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 4749 - 4756
[32] Fuzzy Q-learning
Glorennec, PY
Jouffe, L
PROCEEDINGS OF THE SIXTH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS I - III, 1997, : 659 - 662
[33] Q-learning and robotics
Touzet, CF
Santos, JM
SIMULATION IN INDUSTRY 2001, 2001, : 685 - 689
[34] Periodic Q-Learning
Lee, Donghwan
He, Niao
LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 582 - 598
[35] Q-learning automaton
Qian, F
Hirata, H
IEEE/WIC INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2003, : 432 - 437
[36] Mutual Q-learning
Reid, Cameron
Mukhopadhyay, Snehasis
2020 3RD INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTS (ICCR 2020), 2020, : 128 - 133
[37] Neural Q-learning
Stephan ten Hagen
Ben Kröse
Neural Computing & Applications, 2003, 12 : 81 - 88
[38] Robust Q-Learning
Ertefaie, Ashkan
McKay, James R.
Oslin, David
Strawderman, Robert L.
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2021, 116 (533) : 368 - 381
[39] Modular Q-learning based multi-agent cooperation for robot soccer
Park, KH
Kim, YJ
Kim, JH
ROBOTICS AND AUTONOMOUS SYSTEMS, 2001, 35 (02) : 109 - 122
[40] Neural Q-learning
ten Hagen, S
Kröse, B
NEURAL COMPUTING & APPLICATIONS, 2003, 12 (02): : 81 - 88

← 1 2 3 4 5 →