Multi-robot Collaboration Based on Markov Decision Process in Robocup3D Soccer Simulation Game

被引：0

作者：

Cui Xuanyu ^{[1
]}

Liang Zhiwei ^{[1
]}

Yang Yongyi ^{[1
]}

Shen Ping ^{[1
]}

Wang Jiawen ^{[1
]}

Liu Haoran ^{[1
]}

Fan Kai ^{[1
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210046, Jiangsu, Peoples R China

来源：

2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC) | 2015年

关键词：

Markov Decision Process; Sarsa Algorithm; Reinforcement learning; Dynamic role assignment; RoboCup;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Close collaboration and desired strategy is indispensable for humanoid robots in the RoboCup soccer competition. In order to solve the problem that the convergence rate is too low in training local strategies,this paper mainly proposed a method to optimize the parameters in decision and positioning based on reinforcement learning for soccer robots. First, Markov decision process is applied to the framework for reinforcement learning. Then,we propose a relative improved method, which is known as a Sarsa Algorithm to overcome the drawback of the low convergence rate of the average reward reinforcement learning. Meanwhile, in order to deal with the large state space problems arising in the training and improve the generalization ability, this method is applied to the Keepaway local training. The training results show that, this algorithm has a faster convergent speed than other ordinary learning algorithm.

引用

页码：4345 / 4349

页数：5

共 30 条

[21] Orthogonal Vector Field-based Control for a Multi-Robot System Circumnavigating a Moving Target in 3D
Miao, Zhiqiang
Thakur, Divya
Erwin, R. Scott
Pierre, Jean
Wang, Yaonan
Fierro, Rafael
2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 6004 - 6009
[22] Cooperative multi-robot systems - A study of vision-based 3-D mapping using information theory
Rocha, R
Dias, J
Carvalho, A
2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-4, 2005, : 384 - 389
[23] Cooperative multi-robot systems: A study of vision-based 3-D mapping using information theory
Rocha, R
Dias, J
Carvalho, A
ROBOTICS AND AUTONOMOUS SYSTEMS, 2005, 53 (3-4) : 282 - 311
[24] LiDAR Based Multi-Robot Cooperation for the 3D Printing of Continuous Carbon Fiber Reinforced Composite Structures
Li, Nanya
Link, Guido
Ma, Junhui
Jelonnek, John
ADVANCES IN MANUFACTURING TECHNOLOGY XXXIV, 2021, 15 : 125 - 132
[25] Multi Agent Based Approach to Assist the Design Process of 3D Game Environments
Maddegoda, Ramesh
Karunananda, Asoka S.
INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER2012), 2012, : 36 - 44
[26] Multi 3D-Sensor Based Human-Robot Collaboration with Cloud Solution for Object Handover
Bajrami, Aulon
INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 4, INTELLISYS 2023, 2024, 825 : 139 - 155
[27] Particulate matter source localization in dynamic indoor environments: Bridging simulation-experimentation gaps with a 3D multi-robot system
Mao, Hongyi
Guo, Xun
Qiu, Jiamin
Zeng, Lingjie
Li, Fei
Cai, Hao
JOURNAL OF HAZARDOUS MATERIALS, 2025, 488
[28] An optimal maintenance strategy for multi-state deterioration systems based on a semi-Markov decision process coupled with simulation technique
Jin, Haibo
Han, Fangwei
Sang, Yu
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2020, 139
[29] Development of A Robot-Based Multi-Directional Dynamic Fiber Winding Process for Additive Manufacturing Using Shotcrete 3D Printing
Hack, Norman
Bahar, Mohammad
Huehne, Christian
Lopez, William
Gantner, Stefan
Khader, Noor
Rothe, Tom
FIBERS, 2021, 9 (06)
[30] A review on the Representative Volume Element-based multi-scale simulation of 3D woven high performance thermoset composites manufactured using resin transfer molding process
Trofimov, Anton
Ravey, Christophe
Droz, Nicolas
Therriault, Daniel
Levesque, Martin
COMPOSITES PART A-APPLIED SCIENCE AND MANUFACTURING, 2023, 169

← 1 2 3 →