Deep or Wide? Learning Policy and Value Neural Networks for Combinatorial Games

被引：0

作者：

Edelkamp, Stefan ^{[1
]}

机构：

[1] Univ Bremen, Fac Math & Comp Sci, Bremen, Germany

来源：

COMPUTER GAMES: 5TH WORKSHOP ON COMPUTER GAMES, CGW 2016, AND 5TH WORKSHOP ON GENERAL INTELLIGENCE IN GAME-PLAYING AGENTS, GIGA 2016, HELD IN CONJUNCTION WITH THE 25TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2016, NEW YORK, USA, JULY 9-10, 2016 | 2017年 / 705卷

关键词：

D O I：

10.1007/978-3-319-57969-6_2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The success in learning how to play Go at a professional level is based on training a deep neural network on a wider selection of human expert games and raises the question on the availability, the limits, and the possibilities of this technique for other combinatorial games, especially when there is a lack of access to a larger body of additional expert knowledge. As a step towards this direction, we trained a value network for Tic-TacToe, providing perfect winning information obtained by retrograde analysis. Next, we trained a policy network for the SameGame, a challenging combinatorial puzzle. Here, we discuss the interplay of deep learning with nested rollout policy adaptation (NRPA), a randomized algorithm for optimizing the outcome of single-player games. In both cases we observed that ordinary feed-forward neural networks can perform better than convolutional ones both in accuracy and efficiency.

引用

页码：19 / 33

页数：15

共 50 条

[21] Deep associative learning for neural networks
Liu, Jia
Zhang, Wenhua
Liu, Fang
Xiao, Liang
NEUROCOMPUTING, 2021, 443 (443) : 222 - 234
[22] Collaborative Learning for Deep Neural Networks
Song, Guocong
Chai, Wei
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[23] Big learning and deep neural networks
Montavon, Grégoire
Müller, Klaus-Robert
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2012, 7700 LECTURE NO : 419 - 420
[24] Multiplierless Neural Networks for Deep Learning
Banduka, Maja Lutovac
Lutovac, Miroslav
2024 13TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING, MECO 2024, 2024, : 262 - 265
[25] Shortcut learning in deep neural networks
Geirhos, Robert
Jacobsen, Joern-Henrik
Michaelis, Claudio
Zemel, Richard
Brendel, Wieland
Bethge, Matthias
Wichmann, Felix A.
NATURE MACHINE INTELLIGENCE, 2020, 2 (11) : 665 - 673
[26] Neural Networks as a Learning Component for Designing Board Games
Nikolakakis, Alexandros
Kalles, Dimitris
ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EANN 2017, 2017, 744 : 291 - 302
[27] Deep Learning Neural Networks and Bayesian Neural Networks in Data Analysis
Chernoded, Andrey
Dudko, Lev
Myagkov, Igor
Volkov, Petr
XXIII INTERNATIONAL WORKSHOP HIGH ENERGY PHYSICS AND QUANTUM FIELD THEORY (QFTHEP 2017), 2017, 158
[28] Wide neural networks with bottlenecks are deep gaussian processes
Agrawal, Devanshu
Papamarkou, Theodore
Hinkle, Jacob
Journal of Machine Learning Research, 2020, 21
[29] Wide and deep neural networks achieve consistency for classification
Radhakrishnan, Adityanarayanan
Belkin, Mikhail
Uhler, Caroline
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (14)
[30] Stable behaviour of infinitely wide deep neural networks
Favaro, Stefano
Fortini, Sandra
Peluchetti, Stefano
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 1137 - 1145

← 1 2 3 4 5 →