Deep or Wide? Learning Policy and Value Neural Networks for Combinatorial Games

被引：0

作者：

Edelkamp, Stefan ^{[1
]}

机构：

[1] Univ Bremen, Fac Math & Comp Sci, Bremen, Germany

来源：

COMPUTER GAMES: 5TH WORKSHOP ON COMPUTER GAMES, CGW 2016, AND 5TH WORKSHOP ON GENERAL INTELLIGENCE IN GAME-PLAYING AGENTS, GIGA 2016, HELD IN CONJUNCTION WITH THE 25TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2016, NEW YORK, USA, JULY 9-10, 2016 | 2017年 / 705卷

关键词：

D O I：

10.1007/978-3-319-57969-6_2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The success in learning how to play Go at a professional level is based on training a deep neural network on a wider selection of human expert games and raises the question on the availability, the limits, and the possibilities of this technique for other combinatorial games, especially when there is a lack of access to a larger body of additional expert knowledge. As a step towards this direction, we trained a value network for Tic-TacToe, providing perfect winning information obtained by retrograde analysis. Next, we trained a policy network for the SameGame, a challenging combinatorial puzzle. Here, we discuss the interplay of deep learning with nested rollout policy adaptation (NRPA), a randomized algorithm for optimizing the outcome of single-player games. In both cases we observed that ordinary feed-forward neural networks can perform better than convolutional ones both in accuracy and efficiency.

引用

页码：19 / 33

页数：15

共 50 条

[31] Wide Neural Networks with Bottlenecks are Deep Gaussian Processes
Agrawal, Devanshu
Papamarkou, Theodore
Hinkle, Jacob
JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
[32] Introduction to Machine Learning, Neural Networks, and Deep Learning
Choi, Rene Y.
Coyner, Aaron S.
Kalpathy-Cramer, Jayashree
Chiang, Michael F.
Campbell, J. Peter
TRANSLATIONAL VISION SCIENCE & TECHNOLOGY, 2020, 9 (02):
[33] Coordinated Wide-Area Damping Control Using Deep Neural Networks and Reinforcement Learning
Gupta, Pooja
Pal, Anamitra
Vittal, Vijay
IEEE TRANSACTIONS ON POWER SYSTEMS, 2022, 37 (01) : 365 - 376
[34] The Effect of Combinatorial Coverage for Neurons on Fault Detection in Deep Neural Networks
Wang, Ziyuan
Guo, Jinwu
Chen, Yanshan
She, Feiyan
2021 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C 2021), 2021, : 77 - 82
[35] Wide and Deep Reinforcement Learning for Grid-based Action Games
Montoya, Juan M.
Borgelt, Christian
PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 50 - 59
[36] Deep learning with coherent VCSEL neural networks
Zaijun Chen
Alexander Sludds
Ronald Davis
Ian Christen
Liane Bernstein
Lamia Ateshian
Tobias Heuser
Niels Heermeier
James A. Lott
Stephan Reitzenstein
Ryan Hamerly
Dirk Englund
Nature Photonics, 2023, 17 : 723 - 730
[37] Learning deep neural networks for node classification
Li, Bentian
Pi, Dechang
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 137 : 324 - 334
[38] Inspecting the behaviour of Deep Learning Neural Networks
Duer, Alexander
Filzmoser, Peter
Rauber, Andreas
ERCIM NEWS, 2019, (116): : 18 - 19
[39] Piecewise linear neural networks and deep learning
Qinghua Tao
Li Li
Xiaolin Huang
Xiangming Xi
Shuning Wang
Johan A. K. Suykens
Nature Reviews Methods Primers, 2
[40] Abstraction Hierarchy in Deep Learning Neural Networks
Ilin, Roman
Watson, Thomas
Kozma, Robert
2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 768 - 774

← 1 2 3 4 5 →