Research on Action Strategies and Simulations of DRL and MCTS-based Intelligent Round Game

被引:4
|
作者
Sun, Yuxiang [1 ]
Yuan, Bo [2 ]
Zhang, Yongliang [4 ]
Zheng, Wanwen [3 ]
Xia, Qingfeng [3 ]
Tang, Bojian [3 ]
Zhou, Xianzhong [3 ]
机构
[1] Nanjing Univ, Coll Engn Management, 22 Hankou Rd, Nanjing, Jiangsu, Peoples R China
[2] Derby Univ, Sch Comp & Engn, Derby, England
[3] Nanjing Univ, Sch Engn Management, 22 Hankou Rd, Nanjing, Jiangsu, Peoples R China
[4] Army Engn Univ Nanjing, Nanjing, Jiangsu, Peoples R China
关键词
DDQN; deep reinforcement learning; MCTS; round game; CARLO TREE-SEARCH; ARCADE LEARNING-ENVIRONMENT; GO;
D O I
10.1007/s12555-020-0277-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The reinforcement learning problem of complex action control in multiplayer online battlefield games has brought considerable interest in the deep learning field. This problem involves more complex states and action spaces than traditional confrontation games, making it difficult to search for any strategy with human-level performance. This paper presents a deep reinforcement learning model to solve this problem from the perspective of game simulations and algorithm implementation. A reverse reinforcement-learning model based on high-level player training data is established to support downstream algorithms. With less training data, the proposed model is converged quicker, and more consistent with the action strategies of high-level players' decision-making. Then an intelligent deduction algorithm based on DDQN is developed to achieve a better generalization ability under the guidance of a given reward function. At the game simulation level, this paper constructs Monte Carlo Tree Search Intelligent Decision Model for turn-based antagonistic deduction games to generate next-step actions. Furthermore, a prototype game simulator that combines offline with online functions is implemented to verify the performance of proposed model and algorithm. The experiments show that our proposed approach not only has a better reference value to the antagonistic environment using incomplete information, but also accurate and effective in predicting the return value. Moreover, our work provides a theoretical validation platform and testbed for related research on game AI for deductive games.
引用
下载
收藏
页码:2984 / 2998
页数:15
相关论文
共 43 条
  • [21] Designing Game-based Learning Artefacts for Cybersecurity Processes Using Action Design Research
    Rajendran, Dixon Prem Daniel
    Sundarraj, Rangaraja P.
    BUSINESS & INFORMATION SYSTEMS ENGINEERING, 2024,
  • [22] Research on Block-Chain-Based Intelligent Transaction and Collaborative Scheduling Strategies for Large Grid
    Fu, Xiaolin
    Wang, Hong
    Wang, Zhijie
    IEEE ACCESS, 2020, 8 : 151866 - 151877
  • [23] Research on multimedia intelligent algorithm based on recognition and detection of table tennis ball striking action
    Dong X.
    Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [24] Intelligent Game Strategies in Target-Missile-Defender Engagement Using Curriculum-Based Deep Reinforcement Learning
    Gong, Xiaopeng
    Chen, Wanchun
    Chen, Zhongyuan
    AEROSPACE, 2023, 10 (02)
  • [25] Designing Biodiversity Management Strategies at the Community Level: Approaches Based on Participatory Action Research
    Rafael Hernández Maqueda
    Sandra Paste
    María del Consuelo Chango
    Bianca F. Serrano
    Fernando del Moral
    Human Ecology, 2022, 50 : 665 - 679
  • [26] Designing Biodiversity Management Strategies at the Community Level: Approaches Based on Participatory Action Research
    Hernandez Maqueda, Rafael
    Paste, Sandra
    Maria del Consuelo, Chango
    Serrano, Bianca F.
    del Moral, Fernando
    HUMAN ECOLOGY, 2022, 50 (04) : 665 - 679
  • [27] Based on Action-Personality Data Mining, Research of Gamification Emission Reduction Mechanism and Intelligent Personalized Action Recommendation Model
    Xu, Yangbo
    Tang, Yi
    CROSS-CULTURAL DESIGN: METHODS, PRACTICE AND IMPACT, CCD 2015, PT I, 2015, 9180 : 241 - 252
  • [28] Research on Value Co-Creation Strategies for Stakeholders of Takeaway Platforms Based on Tripartite Evolutionary Game
    Li, Jianjun
    Xu, Xiaodi
    Yang, Yu
    SUSTAINABILITY, 2023, 15 (17)
  • [29] A Research on Attack-Defense Strategies for Electric Vehicle Charging Pile System Based on Bayesian Game
    Zhang, Mimi
    Ma, Linyue
    Jiao, Xinyuan
    Lv, Ruiguang
    PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON POWER ELECTRONICS AND ARTIFICIAL INTELLIGENCE, PEAI 2024, 2024, : 775 - 780
  • [30] Research on task offloading optimization strategies for vehicular networks based on game theory and deep reinforcement learning
    Wang, Lei
    Zhou, Wenjiang
    Xu, Haitao
    Li, Liang
    Cai, Lei
    Zhou, Xianwei
    FRONTIERS IN PHYSICS, 2023, 11