Imitation Learning for Playing Shogi Based on Generative Adversarial Networks

被引：0

作者：

Wan, Shanchuan ^{[1
]}

Kaneko, Tomoyuki ^{[2
,3
]}

机构：

[1] Univ Tokyo, Grad Sch Interdisciplinary Informat Studies, Tokyo, Japan

[2] Univ Tokyo, Interfac Initiat Informat Studies, Tokyo, Japan

[3] JST, PRESTO, Kawaguchi, Saitama, Japan

来源：

2017 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI) | 2017年

关键词：

imitation learning; board games; computer shogi; neural networks;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

For imitation learning in games, AI programs commonly learn thinking and evaluating methods from professional players' game records. However, compared to the total number of all possible game states, top players' records are extremely insufficient. The limited amount of high -quality learning materials may become the bottleneck of training artificial intelligence. We proposed to introduce the idea of Generative Adversarial Networks into game programming, and validated its effectiveness in playing Shogi, a Japanese Chess game. The proposed method is experimentally proved to be capable to alleviate the data insufficiency problem and build more competitive AI programs than conventional supervised training methods.

引用

页码：92 / 95

页数：4

共 50 条

[21] Survey Paper:Introduction to Generative Adversarial Networks in Speech Imitation
Guha, Rajashree
Sharma, Vaibhav
Yadav, Yatin
Bhat, Aruna
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS 2020), 2020, : 839 - 843
[22] Learning Temporal Strategic Relationships using Generative Adversarial Imitation Learning
Fernando, Tharindu
Denman, Simon
Sridharan, Sridha
Fookes, Clinton
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 113 - 121
[23] Generative Adversarial Network for Imitation Learning from Single Demonstration
Tho Nguyen Duc
Chanh Minh Tran
Phan Xuan Tan
Kamioka, Eiji
BAGHDAD SCIENCE JOURNAL, 2021, 18 (04) : 1350 - 1355
[24] Joint Entity and Event Extraction with Generative Adversarial Imitation Learning
Tongtao Zhang
Heng Ji
Avirup Sil
Data Intelligence, 2019, 1 (02) : 99 - 120
[25] Saliency Prediction on Omnidirectional Image With Generative Adversarial Imitation Learning
Xu, Mai
Yang, Li
Tao, Xiaoming
Duan, Yiping
Wang, Zulin
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2087 - 2102
[26] Generalization and Computation for Policy Classes of Generative Adversarial Imitation Learning
Zhou, Yirui
Zhang, Yangchun
Liu, Xiaowei
Wang, Wanying
Che, Zhengping
Xu, Zhiyuan
Tang, Jian
Peng, Yaxin
PARALLEL PROBLEM SOLVING FROM NATURE - PPSN XVII, PPSN 2022, PT I, 2022, 13398 : 385 - 399
[27] Imitating Agents in A Complex Environment by Generative Adversarial Imitation Learning
Li, Wanxiang
Hsueh, Chu-Hsuan
Ikeda, Kokolo
2020 IEEE CONFERENCE ON GAMES (IEEE COG 2020), 2020, : 702 - 705
[28] Domain Adaptation for Imitation Learning Using Generative Adversarial Network
Duc, Tho Nguyen
Tran, Chanh Minh
Tan, Phan Xuan
Kamioka, Eiji
SENSORS, 2021, 21 (14)
[29] Joint Entity and Event Extraction with Generative Adversarial Imitation Learning
Zhang, Tongtao
Ji, Heng
Sil, Avirup
DATA INTELLIGENCE, 2019, 1 (02) : 99 - 120
[30] Distributional generative adversarial imitation learning with reproducing kernel generalization
Zhou, Yirui
Lu, Mengxiao
Liu, Xiaowei
Che, Zhengping
Xu, Zhiyuan
Tang, Jian
Zhang, Yangchun
Peng, Yan
Peng, Yaxin
NEURAL NETWORKS, 2023, 165 : 43 - 59

← 1 2 3 4 5 →