Learning Complementary Multiagent Behaviors: A Case Study

被引:0
|
作者
Kalyanakrishnan, Shivaram [1 ]
Stone, Peter [1 ]
机构
[1] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As machine learning is applied to increasingly complex tasks, it is likely that the diverse challenges encountered can only be addressed by combining the strengths of different learning algorithms. We examine this aspect of learning through a case study grounded in the robot soccer context. The task we consider is Keepaway, a popular benchmark for multiagent reinforcement learning from the simulation soccer domain. Whereas previous successful results in Keepaway have limited learning to an isolated, infrequent decision that amounts to a turn-taking behavior (passing), we expand the agents' learning capability to include a much more ubiquitous action (moving without the ball, or getting open), such that at any given time, multiple agents are executing learned behaviors simultaneously. We introduce a policy search method for learning "GETOPEN" to complement the temporal difference learning approach employed for learning "PASS". Empirical results indicate that the learned GETOPEN policy matches the best hand-coded policy for this task, and outperforms the best policy found when PASS is learned. We demonstrate that PASS and GETOPEN can be learned simultaneously to realize tightly-coupled soccer team behavior.
引用
收藏
页码:153 / 165
页数:13
相关论文
共 50 条
  • [21] Social media as a complementary learning tool for teaching and learning: The case of youtube
    Moghavvemi, Sedigheh
    Sulaiman, Ainin
    Jaafar, Noor Ismawati
    Kasem, Nafisa
    INTERNATIONAL JOURNAL OF MANAGEMENT EDUCATION, 2018, 16 (01): : 37 - 42
  • [22] Multiagent Learning of Coordination in Loosely Coupled Multiagent Systems
    Yu, Chao
    Zhang, Minjie
    Ren, Fenghui
    Tan, Guozhen
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (12) : 2853 - 2867
  • [23] Biasing coevolutionary search for optimal multiagent behaviors
    Panait, Liviu
    Luke, Sean
    Wiegand, R. Paul
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2006, 10 (06) : 629 - 645
  • [24] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
    Ana L. C. Bazzan
    Autonomous Agents and Multi-Agent Systems, 2009, 18 : 342 - 375
  • [25] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
    Bazzan, Ana L. C.
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2009, 18 (03) : 342 - 375
  • [26] Determining the Effectiveness of Behavior Skills Training and Observational Learning on Classroom Behaviors: A Case Study
    Ervin, Thea
    Wilson, Alyssa N.
    Maynard, Brandy R.
    Bramblett, Tracy
    SOCIAL WORK RESEARCH, 2018, 42 (02) : 106 - 117
  • [27] Team Learning, Work Behaviors, and Performance: A Qualitative Case Study of a Technical University in Ghana
    Atatsi, Eli Ayawo
    Stoffers, Jol
    Kil, Ad
    SUSTAINABILITY, 2021, 13 (24)
  • [28] UNDERSTANDING USER BEHAVIORS IN SOCIAL NETWORKING SERVICE FOR MOBILE LEARNING: A CASE STUDY WITH TWITTER
    Ha, Ilkyu
    Kim, Chonggun
    MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2014, 27 (02) : 112 - 123
  • [29] Case-Based Multiagent Reinforcement Learning: Cases as Heuristics for Selection of Actions
    Bianchi, Reinaldo A. C.
    Lopez de Mantaras, Ramon
    ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 355 - 360
  • [30] Simulation of a multiagent system for retail inventory control: A case study
    Signorile, R
    SIMULATION-TRANSACTIONS OF THE SOCIETY FOR MODELING AND SIMULATION INTERNATIONAL, 2002, 78 (05): : 304 - 311