Learning Complementary Multiagent Behaviors: A Case Study

被引:0
|
作者
Kalyanakrishnan, Shivaram [1 ]
Stone, Peter [1 ]
机构
[1] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As machine learning is applied to increasingly complex tasks, it is likely that the diverse challenges encountered can only be addressed by combining the strengths of different learning algorithms. We examine this aspect of learning through a case study grounded in the robot soccer context. The task we consider is Keepaway, a popular benchmark for multiagent reinforcement learning from the simulation soccer domain. Whereas previous successful results in Keepaway have limited learning to an isolated, infrequent decision that amounts to a turn-taking behavior (passing), we expand the agents' learning capability to include a much more ubiquitous action (moving without the ball, or getting open), such that at any given time, multiple agents are executing learned behaviors simultaneously. We introduce a policy search method for learning "GETOPEN" to complement the temporal difference learning approach employed for learning "PASS". Empirical results indicate that the learned GETOPEN policy matches the best hand-coded policy for this task, and outperforms the best policy found when PASS is learned. We demonstrate that PASS and GETOPEN can be learned simultaneously to realize tightly-coupled soccer team behavior.
引用
收藏
页码:153 / 165
页数:13
相关论文
共 50 条
  • [31] Multiagent Incremental Learning in Networks
    Bourgne, Gauvain
    Seghrouchni, Amal El Fallah
    Maudet, Nicolas
    Soldano, Henry
    INTELLIGENT AGENTS AND MULTI-AGENT SYSTEMS, PROCEEDINGS, 2008, 5357 : 109 - +
  • [32] Learning Multiagent Communication with Backpropagation
    Sukhbaatar, Sainbayar
    Szlam, Arthur
    Fergus, Rob
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [33] Multiagent Learning Model in Grid
    Chen, QingKui
    Na, Lichun
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2006, 6 (8B): : 54 - 59
  • [34] Multiagent learning towards RoboCup
    Minoru Asada
    Eiji Uchibe
    New Generation Computing, 2001, 19 : 103 - 120
  • [35] Simultaneously Learning and Advising in Multiagent Reinforcement Learning
    da Silva, Felipe Leno
    Glatt, Ruben
    Reali Costa, Anna Helena
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1100 - 1108
  • [36] A multiagent cooperative learning algorithm
    Liu, Fei
    Zeng, Guangzhou
    COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN III, 2007, 4402 : 739 - +
  • [37] Conjectural Equilibrium in Multiagent Learning
    Michael P. Wellman
    Junling Hu
    Machine Learning, 1998, 33 : 179 - 200
  • [38] Asymmetric multiagent reinforcement learning
    Könönen, V
    IEEE/WIC INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2003, : 336 - 342
  • [39] Collective learning in multiagent systems
    Calderoni, S
    ECAI 1998: 13TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1998, : 465 - 466
  • [40] Partition Learning for Multiagent Planning
    Wood, Jared
    Hedrick, J. Karl
    JOURNAL OF ROBOTICS, 2012, 2012