Research on Game-Playing Agents Based on Deep Reinforcement Learning

被引:5
|
作者
Zhao, Kai [1 ]
Song, Jia [1 ]
Luo, Yuxie [1 ]
Liu, Yang [2 ]
机构
[1] Beihang Univ BUAA, Sch Astronaut, Beijing 100191, Peoples R China
[2] Beihang Univ BUAA, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
deep reinforcement learning (DRL); deep deterministic policy gradient (DDPG); dynamic path planning; confrontation environment;
D O I
10.3390/robotics11020035
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Path planning is a key technology for the autonomous mobility of intelligent robots. However, there are few studies on how to carry out path planning in real time under the confrontation environment. Therefore, based on the deep deterministic policy gradient (DDPG) algorithm, this paper designs the reward function and adopts the incremental training and reward compensation method to improve the training efficiency and obtain the penetration strategy. The Monte Carlo experiment results show that the algorithm can effectively avoid static obstacles, break through the interception, and finally reach the target area. Moreover, the algorithm is also validated in the Webots simulator.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] General game-playing and reinforcement learning
    Levinson, R
    [J]. COMPUTATIONAL INTELLIGENCE, 1996, 12 (01) : 155 - 176
  • [2] Synthetic learning agents in game-playing social environments
    Kiourt, Chairi
    Kalles, Dimitris
    [J]. ADAPTIVE BEHAVIOR, 2016, 24 (06) : 411 - 427
  • [3] Learning Simulation Control in General Game-Playing Agents
    Finnsson, Hilmar
    Bjornsson, Yngvi
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 954 - 959
  • [4] A unifying learning framework for building artificial game-playing agents
    Chen, Wenlin
    Chen, Yixin
    Levine, David K.
    [J]. ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2015, 73 (3-4) : 335 - 358
  • [5] A unifying learning framework for building artificial game-playing agents
    Wenlin Chen
    Yixin Chen
    David K. Levine
    [J]. Annals of Mathematics and Artificial Intelligence, 2015, 73 : 335 - 358
  • [6] GAME-PLAYING AGENTS - UNOBSERVABLE CONTRACTS AS PRECOMMITMENTS
    KATZ, ML
    [J]. RAND JOURNAL OF ECONOMICS, 1991, 22 (03): : 307 - 328
  • [7] GAME-PLAYING
    SIASSI, I
    WESNER, DO
    [J]. COLLEGE AND UNIVERSITY, 1983, 58 (03): : 225 - 245
  • [8] Deep Reinforcement Learning for General Game Playing
    Goldwaser, Adrian
    Thielscher, Michael
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 1701 - 1708
  • [9] Understanding Game-Playing Agents with Natural Language Annotations
    Tomlin, Nicholas
    He, Andre
    Klein, Dan
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): (SHORT PAPERS), VOL 2, 2022, : 797 - 807
  • [10] Evaluating Interpretability Methods for DNNs in Game-Playing Agents
    Palsson, Aoalsteinn
    Bjornsson, Yngvi
    [J]. ADVANCES IN COMPUTER GAMES, ACG 2021, 2022, 13262 : 71 - 81