Multi-robot Collaboration Based on Markov Decision Process in Robocup3D Soccer Simulation Game

被引:0
|
作者
Cui Xuanyu [1 ]
Liang Zhiwei [1 ]
Yang Yongyi [1 ]
Shen Ping [1 ]
Wang Jiawen [1 ]
Liu Haoran [1 ]
Fan Kai [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210046, Jiangsu, Peoples R China
关键词
Markov Decision Process; Sarsa Algorithm; Reinforcement learning; Dynamic role assignment; RoboCup;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Close collaboration and desired strategy is indispensable for humanoid robots in the RoboCup soccer competition. In order to solve the problem that the convergence rate is too low in training local strategies,this paper mainly proposed a method to optimize the parameters in decision and positioning based on reinforcement learning for soccer robots. First, Markov decision process is applied to the framework for reinforcement learning. Then,we propose a relative improved method, which is known as a Sarsa Algorithm to overcome the drawback of the low convergence rate of the average reward reinforcement learning. Meanwhile, in order to deal with the large state space problems arising in the training and improve the generalization ability, this method is applied to the Keepaway local training. The training results show that, this algorithm has a faster convergent speed than other ordinary learning algorithm.
引用
收藏
页码:4345 / 4349
页数:5
相关论文
共 30 条
  • [21] Orthogonal Vector Field-based Control for a Multi-Robot System Circumnavigating a Moving Target in 3D
    Miao, Zhiqiang
    Thakur, Divya
    Erwin, R. Scott
    Pierre, Jean
    Wang, Yaonan
    Fierro, Rafael
    2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 6004 - 6009
  • [22] Cooperative multi-robot systems - A study of vision-based 3-D mapping using information theory
    Rocha, R
    Dias, J
    Carvalho, A
    2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-4, 2005, : 384 - 389
  • [23] Cooperative multi-robot systems: A study of vision-based 3-D mapping using information theory
    Rocha, R
    Dias, J
    Carvalho, A
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2005, 53 (3-4) : 282 - 311
  • [24] LiDAR Based Multi-Robot Cooperation for the 3D Printing of Continuous Carbon Fiber Reinforced Composite Structures
    Li, Nanya
    Link, Guido
    Ma, Junhui
    Jelonnek, John
    ADVANCES IN MANUFACTURING TECHNOLOGY XXXIV, 2021, 15 : 125 - 132
  • [25] Multi Agent Based Approach to Assist the Design Process of 3D Game Environments
    Maddegoda, Ramesh
    Karunananda, Asoka S.
    INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER2012), 2012, : 36 - 44
  • [26] Multi 3D-Sensor Based Human-Robot Collaboration with Cloud Solution for Object Handover
    Bajrami, Aulon
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 4, INTELLISYS 2023, 2024, 825 : 139 - 155
  • [27] Particulate matter source localization in dynamic indoor environments: Bridging simulation-experimentation gaps with a 3D multi-robot system
    Mao, Hongyi
    Guo, Xun
    Qiu, Jiamin
    Zeng, Lingjie
    Li, Fei
    Cai, Hao
    JOURNAL OF HAZARDOUS MATERIALS, 2025, 488
  • [28] An optimal maintenance strategy for multi-state deterioration systems based on a semi-Markov decision process coupled with simulation technique
    Jin, Haibo
    Han, Fangwei
    Sang, Yu
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2020, 139
  • [29] Development of A Robot-Based Multi-Directional Dynamic Fiber Winding Process for Additive Manufacturing Using Shotcrete 3D Printing
    Hack, Norman
    Bahar, Mohammad
    Huehne, Christian
    Lopez, William
    Gantner, Stefan
    Khader, Noor
    Rothe, Tom
    FIBERS, 2021, 9 (06)
  • [30] A review on the Representative Volume Element-based multi-scale simulation of 3D woven high performance thermoset composites manufactured using resin transfer molding process
    Trofimov, Anton
    Ravey, Christophe
    Droz, Nicolas
    Therriault, Daniel
    Levesque, Martin
    COMPOSITES PART A-APPLIED SCIENCE AND MANUFACTURING, 2023, 169