Research on Action Strategies and Simulations of DRL and MCTS-based Intelligent Round Game

被引:4
|
作者
Sun, Yuxiang [1 ]
Yuan, Bo [2 ]
Zhang, Yongliang [4 ]
Zheng, Wanwen [3 ]
Xia, Qingfeng [3 ]
Tang, Bojian [3 ]
Zhou, Xianzhong [3 ]
机构
[1] Nanjing Univ, Coll Engn Management, 22 Hankou Rd, Nanjing, Jiangsu, Peoples R China
[2] Derby Univ, Sch Comp & Engn, Derby, England
[3] Nanjing Univ, Sch Engn Management, 22 Hankou Rd, Nanjing, Jiangsu, Peoples R China
[4] Army Engn Univ Nanjing, Nanjing, Jiangsu, Peoples R China
关键词
DDQN; deep reinforcement learning; MCTS; round game; CARLO TREE-SEARCH; ARCADE LEARNING-ENVIRONMENT; GO;
D O I
10.1007/s12555-020-0277-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The reinforcement learning problem of complex action control in multiplayer online battlefield games has brought considerable interest in the deep learning field. This problem involves more complex states and action spaces than traditional confrontation games, making it difficult to search for any strategy with human-level performance. This paper presents a deep reinforcement learning model to solve this problem from the perspective of game simulations and algorithm implementation. A reverse reinforcement-learning model based on high-level player training data is established to support downstream algorithms. With less training data, the proposed model is converged quicker, and more consistent with the action strategies of high-level players' decision-making. Then an intelligent deduction algorithm based on DDQN is developed to achieve a better generalization ability under the guidance of a given reward function. At the game simulation level, this paper constructs Monte Carlo Tree Search Intelligent Decision Model for turn-based antagonistic deduction games to generate next-step actions. Furthermore, a prototype game simulator that combines offline with online functions is implemented to verify the performance of proposed model and algorithm. The experiments show that our proposed approach not only has a better reference value to the antagonistic environment using incomplete information, but also accurate and effective in predicting the return value. Moreover, our work provides a theoretical validation platform and testbed for related research on game AI for deductive games.
引用
下载
收藏
页码:2984 / 2998
页数:15
相关论文
共 43 条
  • [1] Research on Action Strategies and Simulations of DRL and MCTS-based Intelligent Round Game
    Yuxiang Sun
    Bo Yuan
    Yongliang Zhang
    Wanwen Zheng
    Qingfeng Xia
    Bojian Tang
    Xianzhong Zhou
    International Journal of Control, Automation and Systems, 2021, 19 : 2984 - 2998
  • [2] Strength Adjustment and Assessment for MCTS-Based Programs [Research Frontier]
    Liu, An-Jen
    Wu, Ti-Rong
    Wu, I-Chen
    Guei, Hung
    Wei, Ting-Han
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2020, 15 (03) : 60 - 73
  • [3] Intelligent Game Countermeasures Algorithm Based on Opponent Action Prediction
    Han, Runhai
    Chen, Hao
    Liu, Quan
    Huang, Jian
    Computer Engineering and Applications, 2023, 59 (07) : 190 - 197
  • [4] Research on an Identification Method of Intelligent vehicle Based on Game Theory
    Zhang, KaiSheng
    MODERN COMPUTER SCIENCE AND APPLICATIONS II (MCSA 2017), 2017, : 70 - 75
  • [5] Research on an intelligent game simulation system based on road network
    Wang, Xin
    Liu, Laikai
    Zhou, Zhenghao
    Qi, Xiangwei
    APPLIED MATHEMATICS AND NONLINEAR SCIENCES, 2022, 7 (02) : 1181 - 1192
  • [6] Game-based Sprint retrospectives: multiple action research
    Adam Przybyłek
    Marta Albecka
    Olga Springer
    Wojciech Kowalski
    Empirical Software Engineering, 2022, 27
  • [7] Game-based Sprint retrospectives: multiple action research
    Przybylek, Adam
    Albecka, Marta
    Springer, Olga
    Kowalski, Wojciech
    EMPIRICAL SOFTWARE ENGINEERING, 2022, 27 (01)
  • [8] Research on Guiding Strategies of VMS and Their Effects Based on Intelligent Materials
    Pei, YuLong
    Li, Xue
    ADVANCED RESEARCH ON INTELLIGENT MATERIALS AND MECHANICAL ENGINEERING, 2011, 321 : 50 - 54
  • [9] Research on Strategies of Networked Manufacturing Resources Configuration Based on Evolutionary Game
    Zhan, Yan
    Lu, Jiansha
    Ji, Xuehong
    FRONTIERS OF ADVANCED MATERIALS AND ENGINEERING TECHNOLOGY, PTS 1-3, 2012, 430-432 : 1330 - +
  • [10] Research on Renewable Energy Trading Strategies Based on Evolutionary Game Theory
    Huang, Fei
    Fan, Hua
    Shang, Yunlong
    Wei, Yuankang
    Almutairi, Sulaiman Z.
    Alharbi, Abdullah M.
    Ma, Hengrui
    Wang, Hongxia
    SUSTAINABILITY, 2024, 16 (07)