Learning Complementary Multiagent Behaviors: A Case Study

被引:0
|
作者
Kalyanakrishnan, Shivaram [1 ]
Stone, Peter [1 ]
机构
[1] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As machine learning is applied to increasingly complex tasks, it is likely that the diverse challenges encountered can only be addressed by combining the strengths of different learning algorithms. We examine this aspect of learning through a case study grounded in the robot soccer context. The task we consider is Keepaway, a popular benchmark for multiagent reinforcement learning from the simulation soccer domain. Whereas previous successful results in Keepaway have limited learning to an isolated, infrequent decision that amounts to a turn-taking behavior (passing), we expand the agents' learning capability to include a much more ubiquitous action (moving without the ball, or getting open), such that at any given time, multiple agents are executing learned behaviors simultaneously. We introduce a policy search method for learning "GETOPEN" to complement the temporal difference learning approach employed for learning "PASS". Empirical results indicate that the learned GETOPEN policy matches the best hand-coded policy for this task, and outperforms the best policy found when PASS is learned. We demonstrate that PASS and GETOPEN can be learned simultaneously to realize tightly-coupled soccer team behavior.
引用
收藏
页码:153 / 165
页数:13
相关论文
共 50 条
  • [1] Diversifying behaviors for learning in Asymmetric Multiagent Systems
    Dixit, Gaurav
    Gonzalez, Everardo
    Tumer, Kagan
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'22), 2022, : 350 - 358
  • [2] Case exchange strategies in multiagent learning
    Ontañón, S
    Plaza, E
    MACHINE LEARNING: ECML 2002, 2002, 2430 : 331 - 344
  • [3] Half field offense in RoboCup soccer: A multiagent reinforcement learning case study
    Kalyanakrishnan, Shivaram
    Liu, Yaxin
    Stone, Peter
    ROBOCUP 2006: ROBOT SOCCER WORLD CUP X, 2007, 4434 : 72 - +
  • [4] MASIVE: A case study in multiagent systems
    Trajkovski, G
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2002, 2002, 2412 : 249 - 254
  • [5] Unbiased Risk Estimators Can Mislead: A Case Study of Learning with Complementary Labels
    Chou, Yu-Ting
    Niu, Gang
    Lin, Hsuan-Tien
    Sugiyama, Masashi
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [6] Unbiased Risk Estimators Can Mislead: A Case Study of Learning with Complementary Labels
    Chou, Yu-Ting
    Niu, Gang
    Lin, Hsuan-Tien
    Sugiyama, Masashi
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [7] Multiagent coordination in antiair defense: A case study
    Noh, S
    Gmytrasiewicz, PJ
    MULTI-AGENT RATIONALITY, 1997, 1237 : 4 - 16
  • [8] A Complementary Study of Mechanisms and Behaviors in Chromatography via Modeling
    Kouskoura, Maria G.
    Mitani, Constantina V.
    Markopoulou, Catherine K.
    JOURNAL OF AOAC INTERNATIONAL, 2015, 98 (05) : 1462 - 1470
  • [9] An actor-critic approach for learning cooperative behaviors of multiagent seesaw balancing problems
    Kawakami, T
    Kinoshita, M
    Takatori, N
    Watanabe, M
    Furukawa, M
    INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS, 2005, : 109 - 114
  • [10] A case study for learning behaviors in mobile robotics by evolutionary fuzzy systems
    Mucientes, M.
    Alcala-Fdez, J.
    Alcala, R.
    Casillas, J.
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (02) : 1471 - 1493