Acquisition of coordinated behavior by modular Q-learning agents

被引：0

作者：

Ono, N

Ikeda, O

Fukumoto, K

机构：

来源：

IROS 96 - PROCEEDINGS OF THE 1996 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS - ROBOTIC INTELLIGENCE INTERACTING WITH DYNAMIC WORLDS, VOLS 1-3 | 1996年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recent attempts to let monolithic reinforcement-learning agents synthesize coordinated behavior scale poorly to more complicated multi-agent learning problems where multiple learning agents play different roles and work together for the accomplishment of their common goal. These learning agents have to receive and respond to various sensory information from their partners as well as that from the physical environment itself Hence, their state spaces are subject to grow exponentially in the number of the partners. As an illustrative problem suffered from this kind of combinatorial explosion, we consider a modified version of the pursuit problem, and show how successfully a collection of modular Q-learning hunter agents synthesize coordinated decision policies needed to capture a randomly-fleeing prey agent effectively, by specializing their functionality and acquiring herding behavior.

引用

页码：1525 / 1529

页数：5

共 50 条

[11] Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning
Tan, Fuxiao
Yan, Pengfei
Guan, Xinping
NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 475 - 483
[12] Backward Q-learning: The combination of Sarsa algorithm and Q-learning
Wang, Yin-Hao
Li, Tzuu-Hseng S.
Lin, Chih-Jui
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (09) : 2184 - 2193
[13] Modular Production Control with Multi-Agent Deep Q-Learning
Gankin, Dennis
Mayer, Sebastian
Zinn, Jonas
Vogel-Heuser, Birgit
Endisch, Christian
2021 26TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2021,
[14] A study on expertise of agents and its effects on cooperative Q-learning
Araabi, Babak Nadjar
Mastoureshgh, Sahar
Ahmadabadi, Majid Nili
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02): : 398 - 409
[15] Successful cooperation between heterogeneous fuzzy Q-learning agents
Bitaghsir, AA
Moghimi, A
Lesani, M
Keramati, MM
Ahmadabadi, MN
Arabi, BN
2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 5579 - 5583
[16] Q-Learning of Spatial Actions for Hierarchical Planner of Cognitive Agents
Kiselev, Gleb
Panov, Aleksandr
INTERACTIVE COLLABORATIVE ROBOTICS, ICR 2020, 2020, 12336 : 160 - 169
[17] EAQR: A Multiagent Q-Learning Algorithm for Coordination of Multiple Agents
Zhang, Zhen
Wang, Dongqing
COMPLEXITY, 2018,
[18] Q-Learning based supplier-agents for electricity markets
Rahimi-Kian, A
Sadeghi, B
Thomas, RJ
2005 IEEE POWER ENGINEERING SOCIETY GENERAL MEETING, VOLS, 1-3, 2005, : 420 - 427
[19] Implementation of Fuzzy Q-Learning Based on Modular Fuzzy Model and Parallel Structured Learning
Watanabe, Toshihiko
2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 1338 - 1344
[20] The acquisition of sociality by using Q-learning in a multi-agent environment
Nagayuki, Yasuo
PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 16TH '11), 2011, : 820 - 823

← 1 2 3 4 5 →