Reinforcement learning for RoboCup soccer keepaway

被引：184

作者：

Stone, P

Sutton, RS

Kuhlmann, G

机构：

[1] Univ Texas, Dept Comp Sci, Austin, TX 78712 USA

[2] Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2M7, Canada

来源：

ADAPTIVE BEHAVIOR | 2005年 / 13卷 / 03期

关键词：

multiagent systems; machine learning; multiagent learning; reinforcement learning; robot soccer;

D O I：

10.1177/105971230501300301

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

RoboCup simulated soccer presents many challenges to reinforcement learning methods, including a large state space, hidden and uncertain state, multiple independent agents learning simultaneously, and long and variable delays in the effects of actions. We describe our application of episodic SMDP Sarsa(lambda) with linear tile-coding function approximation and variable; to learning higher-level decisions in a keepaway subtask of RoboCup soccer. In keepaway, one team, "the keepers," tries to keep control of the ball for as long as possible despite the efforts of "the takers." The keepers learn individually when to hold the ball and when to pass to a teammate. Our agents learned policies that significantly outperform a range of benchmark policies. We demonstrate the generality of our approach by applying it to a number of task variations including different field sizes and different numbers of players on each team.

引用

页码：165 / 188

页数：24

共 50 条

[1] Argumentation-Based Reinforcement Learning for RoboCup Soccer Keepaway
Gao, Yang
Toni, Francesca
Craven, Robert
[J]. 20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 342 - 347
[2] Reinforcement Learning in RoboCup KeepAway with Partial Observability
Devlin, Sam
Grzes, Marek
Kudenko, Daniel
[J]. 2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 2, 2009, : 201 - 208
[3] Concurrent Hierarchical Reinforcement Learning for RoboCup Keepaway
Bai, Aijun
Russell, Stuart
Chen, Xiaoping
[J]. ROBOCUP 2017: ROBOT WORLD CUP XXI, 2018, 11175 : 190 - 203
[4] Argumentation-Based Reinforcement Learning for RoboCup Keepaway
Gao, Yang
Toni, Francesca
Craven, Robert
[J]. COMPUTATIONAL MODELS OF ARGUMENT, 2012, 245 : 519 - +
[5] Learning of Keepaway Task for RoboCup Soccer Agent Based on Fuzzy Q-Learning
Sawa, Toru
Watanabe, Toshihiko
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 250 - 256
[6] Reinforcement learning in simulation robocup soccer
Cheng, XY
Yuan, XH
Pan, LH
Xia, DS
[J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 244 - 248
[7] Teamwork formation for Keepaway in robotics soccer (reinforcement learning approach)
Tanaka, Nobuyuki
Arai, Sachiyo
[J]. AGENT COMPUTING AND MULTI-AGENT SYSTEMS, 2006, 4088 : 279 - 292
[8] Reinforcement learning of player agents in RoboCup soccer simulation
Sarje, A
Chawre, A
Nair, SB
[J]. HIS'04: Fourth International Conference on Hybrid Intelligent Systems, Proceedings, 2005, : 480 - 481
[9] Argumentation-Based Reinforcement Learning for RoboCup Soccer Takeaway
Gao, Yang
Toni, Francesca
[J]. AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 1411 - 1412
[10] Half field offense in RoboCup soccer: A multiagent reinforcement learning case study
Kalyanakrishnan, Shivaram
Liu, Yaxin
Stone, Peter
[J]. ROBOCUP 2006: ROBOT SOCCER WORLD CUP X, 2007, 4434 : 72 - +

← 1 2 3 4 5 →