Reinforcement learning for RoboCup soccer keepaway

被引:184
|
作者
Stone, P
Sutton, RS
Kuhlmann, G
机构
[1] Univ Texas, Dept Comp Sci, Austin, TX 78712 USA
[2] Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2M7, Canada
关键词
multiagent systems; machine learning; multiagent learning; reinforcement learning; robot soccer;
D O I
10.1177/105971230501300301
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
RoboCup simulated soccer presents many challenges to reinforcement learning methods, including a large state space, hidden and uncertain state, multiple independent agents learning simultaneously, and long and variable delays in the effects of actions. We describe our application of episodic SMDP Sarsa(lambda) with linear tile-coding function approximation and variable; to learning higher-level decisions in a keepaway subtask of RoboCup soccer. In keepaway, one team, "the keepers," tries to keep control of the ball for as long as possible despite the efforts of "the takers." The keepers learn individually when to hold the ball and when to pass to a teammate. Our agents learned policies that significantly outperform a range of benchmark policies. We demonstrate the generality of our approach by applying it to a number of task variations including different field sizes and different numbers of players on each team.
引用
收藏
页码:165 / 188
页数:24
相关论文
共 50 条
  • [1] Argumentation-Based Reinforcement Learning for RoboCup Soccer Keepaway
    Gao, Yang
    Toni, Francesca
    Craven, Robert
    [J]. 20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 342 - 347
  • [2] Reinforcement Learning in RoboCup KeepAway with Partial Observability
    Devlin, Sam
    Grzes, Marek
    Kudenko, Daniel
    [J]. 2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 2, 2009, : 201 - 208
  • [3] Concurrent Hierarchical Reinforcement Learning for RoboCup Keepaway
    Bai, Aijun
    Russell, Stuart
    Chen, Xiaoping
    [J]. ROBOCUP 2017: ROBOT WORLD CUP XXI, 2018, 11175 : 190 - 203
  • [4] Argumentation-Based Reinforcement Learning for RoboCup Keepaway
    Gao, Yang
    Toni, Francesca
    Craven, Robert
    [J]. COMPUTATIONAL MODELS OF ARGUMENT, 2012, 245 : 519 - +
  • [5] Learning of Keepaway Task for RoboCup Soccer Agent Based on Fuzzy Q-Learning
    Sawa, Toru
    Watanabe, Toshihiko
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 250 - 256
  • [6] Reinforcement learning in simulation robocup soccer
    Cheng, XY
    Yuan, XH
    Pan, LH
    Xia, DS
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 244 - 248
  • [7] Teamwork formation for Keepaway in robotics soccer (reinforcement learning approach)
    Tanaka, Nobuyuki
    Arai, Sachiyo
    [J]. AGENT COMPUTING AND MULTI-AGENT SYSTEMS, 2006, 4088 : 279 - 292
  • [8] Reinforcement learning of player agents in RoboCup soccer simulation
    Sarje, A
    Chawre, A
    Nair, SB
    [J]. HIS'04: Fourth International Conference on Hybrid Intelligent Systems, Proceedings, 2005, : 480 - 481
  • [9] Argumentation-Based Reinforcement Learning for RoboCup Soccer Takeaway
    Gao, Yang
    Toni, Francesca
    [J]. AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 1411 - 1412
  • [10] Half field offense in RoboCup soccer: A multiagent reinforcement learning case study
    Kalyanakrishnan, Shivaram
    Liu, Yaxin
    Stone, Peter
    [J]. ROBOCUP 2006: ROBOT SOCCER WORLD CUP X, 2007, 4434 : 72 - +