Acquisition of coordinated behavior by modular Q-learning agents

被引:0
|
作者
Ono, N
Ikeda, O
Fukumoto, K
机构
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent attempts to let monolithic reinforcement-learning agents synthesize coordinated behavior scale poorly to more complicated multi-agent learning problems where multiple learning agents play different roles and work together for the accomplishment of their common goal. These learning agents have to receive and respond to various sensory information from their partners as well as that from the physical environment itself Hence, their state spaces are subject to grow exponentially in the number of the partners. As an illustrative problem suffered from this kind of combinatorial explosion, we consider a modified version of the pursuit problem, and show how successfully a collection of modular Q-learning hunter agents synthesize coordinated decision policies needed to capture a randomly-fleeing prey agent effectively, by specializing their functionality and acquiring herding behavior.
引用
收藏
页码:1525 / 1529
页数:5
相关论文
共 50 条
  • [11] Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning
    Tan, Fuxiao
    Yan, Pengfei
    Guan, Xinping
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 475 - 483
  • [12] Backward Q-learning: The combination of Sarsa algorithm and Q-learning
    Wang, Yin-Hao
    Li, Tzuu-Hseng S.
    Lin, Chih-Jui
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (09) : 2184 - 2193
  • [13] Modular Production Control with Multi-Agent Deep Q-Learning
    Gankin, Dennis
    Mayer, Sebastian
    Zinn, Jonas
    Vogel-Heuser, Birgit
    Endisch, Christian
    2021 26TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2021,
  • [14] A study on expertise of agents and its effects on cooperative Q-learning
    Araabi, Babak Nadjar
    Mastoureshgh, Sahar
    Ahmadabadi, Majid Nili
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02): : 398 - 409
  • [15] Successful cooperation between heterogeneous fuzzy Q-learning agents
    Bitaghsir, AA
    Moghimi, A
    Lesani, M
    Keramati, MM
    Ahmadabadi, MN
    Arabi, BN
    2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 5579 - 5583
  • [16] Q-Learning of Spatial Actions for Hierarchical Planner of Cognitive Agents
    Kiselev, Gleb
    Panov, Aleksandr
    INTERACTIVE COLLABORATIVE ROBOTICS, ICR 2020, 2020, 12336 : 160 - 169
  • [17] EAQR: A Multiagent Q-Learning Algorithm for Coordination of Multiple Agents
    Zhang, Zhen
    Wang, Dongqing
    COMPLEXITY, 2018,
  • [18] Q-Learning based supplier-agents for electricity markets
    Rahimi-Kian, A
    Sadeghi, B
    Thomas, RJ
    2005 IEEE POWER ENGINEERING SOCIETY GENERAL MEETING, VOLS, 1-3, 2005, : 420 - 427
  • [19] Implementation of Fuzzy Q-Learning Based on Modular Fuzzy Model and Parallel Structured Learning
    Watanabe, Toshihiko
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 1338 - 1344
  • [20] The acquisition of sociality by using Q-learning in a multi-agent environment
    Nagayuki, Yasuo
    PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 16TH '11), 2011, : 820 - 823