Imitation Learning for Playing Shogi Based on Generative Adversarial Networks

被引:0
|
作者
Wan, Shanchuan [1 ]
Kaneko, Tomoyuki [2 ,3 ]
机构
[1] Univ Tokyo, Grad Sch Interdisciplinary Informat Studies, Tokyo, Japan
[2] Univ Tokyo, Interfac Initiat Informat Studies, Tokyo, Japan
[3] JST, PRESTO, Kawaguchi, Saitama, Japan
关键词
imitation learning; board games; computer shogi; neural networks;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For imitation learning in games, AI programs commonly learn thinking and evaluating methods from professional players' game records. However, compared to the total number of all possible game states, top players' records are extremely insufficient. The limited amount of high -quality learning materials may become the bottleneck of training artificial intelligence. We proposed to introduce the idea of Generative Adversarial Networks into game programming, and validated its effectiveness in playing Shogi, a Japanese Chess game. The proposed method is experimentally proved to be capable to alleviate the data insufficiency problem and build more competitive AI programs than conventional supervised training methods.
引用
收藏
页码:92 / 95
页数:4
相关论文
共 50 条
  • [21] Survey Paper:Introduction to Generative Adversarial Networks in Speech Imitation
    Guha, Rajashree
    Sharma, Vaibhav
    Yadav, Yatin
    Bhat, Aruna
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS 2020), 2020, : 839 - 843
  • [22] Learning Temporal Strategic Relationships using Generative Adversarial Imitation Learning
    Fernando, Tharindu
    Denman, Simon
    Sridharan, Sridha
    Fookes, Clinton
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 113 - 121
  • [23] Generative Adversarial Network for Imitation Learning from Single Demonstration
    Tho Nguyen Duc
    Chanh Minh Tran
    Phan Xuan Tan
    Kamioka, Eiji
    BAGHDAD SCIENCE JOURNAL, 2021, 18 (04) : 1350 - 1355
  • [24] Joint Entity and Event Extraction with Generative Adversarial Imitation Learning
    Tongtao Zhang
    Heng Ji
    Avirup Sil
    Data Intelligence, 2019, 1 (02) : 99 - 120
  • [25] Saliency Prediction on Omnidirectional Image With Generative Adversarial Imitation Learning
    Xu, Mai
    Yang, Li
    Tao, Xiaoming
    Duan, Yiping
    Wang, Zulin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2087 - 2102
  • [26] Generalization and Computation for Policy Classes of Generative Adversarial Imitation Learning
    Zhou, Yirui
    Zhang, Yangchun
    Liu, Xiaowei
    Wang, Wanying
    Che, Zhengping
    Xu, Zhiyuan
    Tang, Jian
    Peng, Yaxin
    PARALLEL PROBLEM SOLVING FROM NATURE - PPSN XVII, PPSN 2022, PT I, 2022, 13398 : 385 - 399
  • [27] Imitating Agents in A Complex Environment by Generative Adversarial Imitation Learning
    Li, Wanxiang
    Hsueh, Chu-Hsuan
    Ikeda, Kokolo
    2020 IEEE CONFERENCE ON GAMES (IEEE COG 2020), 2020, : 702 - 705
  • [28] Domain Adaptation for Imitation Learning Using Generative Adversarial Network
    Duc, Tho Nguyen
    Tran, Chanh Minh
    Tan, Phan Xuan
    Kamioka, Eiji
    SENSORS, 2021, 21 (14)
  • [29] Joint Entity and Event Extraction with Generative Adversarial Imitation Learning
    Zhang, Tongtao
    Ji, Heng
    Sil, Avirup
    DATA INTELLIGENCE, 2019, 1 (02) : 99 - 120
  • [30] Distributional generative adversarial imitation learning with reproducing kernel generalization
    Zhou, Yirui
    Lu, Mengxiao
    Liu, Xiaowei
    Che, Zhengping
    Xu, Zhiyuan
    Tang, Jian
    Zhang, Yangchun
    Peng, Yan
    Peng, Yaxin
    NEURAL NETWORKS, 2023, 165 : 43 - 59