Imitation Learning for Playing Shogi Based on Generative Adversarial Networks

被引:0
|
作者
Wan, Shanchuan [1 ]
Kaneko, Tomoyuki [2 ,3 ]
机构
[1] Univ Tokyo, Grad Sch Interdisciplinary Informat Studies, Tokyo, Japan
[2] Univ Tokyo, Interfac Initiat Informat Studies, Tokyo, Japan
[3] JST, PRESTO, Kawaguchi, Saitama, Japan
关键词
imitation learning; board games; computer shogi; neural networks;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For imitation learning in games, AI programs commonly learn thinking and evaluating methods from professional players' game records. However, compared to the total number of all possible game states, top players' records are extremely insufficient. The limited amount of high -quality learning materials may become the bottleneck of training artificial intelligence. We proposed to introduce the idea of Generative Adversarial Networks into game programming, and validated its effectiveness in playing Shogi, a Japanese Chess game. The proposed method is experimentally proved to be capable to alleviate the data insufficiency problem and build more competitive AI programs than conventional supervised training methods.
引用
收藏
页码:92 / 95
页数:4
相关论文
共 50 条
  • [41] Generative Adversarial Networks Based on Contrastive Learning for Sequential Recommendation
    Li Jianhong
    Wang Yue
    Yan Taotao
    Sun Chengyuan
    Li Dequan
    WEB AND BIG DATA, PT II, APWEB-WAIM 2023, 2024, 14332 : 439 - 453
  • [42] Research on imbalanced learning based on conditional generative adversarial networks
    Zhao H.-X.
    Shi H.-B.
    Wu J.
    Chen X.
    Kongzhi yu Juece/Control and Decision, 2021, 36 (03): : 619 - 628
  • [43] Data Augment in Imbalanced Learning Based on Generative Adversarial Networks
    Zhou, Zhuocheng
    Zhang, Bofeng
    Lv, Ying
    Shi, Tian
    Chang, Furong
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT IV, 2019, 1142 : 21 - 30
  • [44] Crack Detection Based on Generative Adversarial Networks and Deep Learning
    Gongfa Chen
    Shuai Teng
    Mansheng Lin
    Xiaomei Yang
    Xiaoli Sun
    KSCE Journal of Civil Engineering, 2022, 26 : 1803 - 1816
  • [45] Volumetric Imitation Generative Adversarial Networks for Anatomical Human Body Modeling
    Kim, Jion
    Li, Yan
    Shin, Byeong-Seok
    BIOENGINEERING-BASEL, 2024, 11 (02):
  • [46] Sample-Efficient Imitation Learning via Generative Adversarial Nets
    Blonde, Lionel
    Kalousis, Alexandros
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [47] Unmanned surface vehicle navigation through generative adversarial imitation learning
    Chaysri, Piyabhum
    Spatharis, Christos
    Blekas, Konstantinos
    Vlachos, Kostas
    OCEAN ENGINEERING, 2023, 282
  • [48] Modeling Human Driving Behavior Through Generative Adversarial Imitation Learning
    Bhattacharyya, Raunak
    Wulfe, Blake
    Phillips, Derek J.
    Kuefler, Alex
    Morton, Jeremy
    Senanayake, Ransalu
    Kochenderfer, Mykel J.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (03) : 2874 - 2887
  • [49] Exploring Gradient Explosion in Generative Adversarial Imitation Learning: A Probabilistic Perspective
    Wang, Wanying
    Zhu, Yichen
    Zhou, Yirui
    Shen, Chaomin
    Tang, Jian
    Xu, Zhiyuan
    Peng, Yaxin
    Zhang, Yangchun
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 14, 2024, : 15625 - 15633
  • [50] DIVINE: A Generative Adversarial Imitation Learning Framework for Knowledge Graph Reasoning
    Li, Ruiping
    Cheng, Xiang
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2642 - 2651