Deep or Wide? Learning Policy and Value Neural Networks for Combinatorial Games

被引:0
|
作者
Edelkamp, Stefan [1 ]
机构
[1] Univ Bremen, Fac Math & Comp Sci, Bremen, Germany
关键词
D O I
10.1007/978-3-319-57969-6_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The success in learning how to play Go at a professional level is based on training a deep neural network on a wider selection of human expert games and raises the question on the availability, the limits, and the possibilities of this technique for other combinatorial games, especially when there is a lack of access to a larger body of additional expert knowledge. As a step towards this direction, we trained a value network for Tic-TacToe, providing perfect winning information obtained by retrograde analysis. Next, we trained a policy network for the SameGame, a challenging combinatorial puzzle. Here, we discuss the interplay of deep learning with nested rollout policy adaptation (NRPA), a randomized algorithm for optimizing the outcome of single-player games. In both cases we observed that ordinary feed-forward neural networks can perform better than convolutional ones both in accuracy and efficiency.
引用
收藏
页码:19 / 33
页数:15
相关论文
共 50 条
  • [21] Deep associative learning for neural networks
    Liu, Jia
    Zhang, Wenhua
    Liu, Fang
    Xiao, Liang
    NEUROCOMPUTING, 2021, 443 (443) : 222 - 234
  • [22] Collaborative Learning for Deep Neural Networks
    Song, Guocong
    Chai, Wei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [23] Big learning and deep neural networks
    Montavon, Grégoire
    Müller, Klaus-Robert
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2012, 7700 LECTURE NO : 419 - 420
  • [24] Multiplierless Neural Networks for Deep Learning
    Banduka, Maja Lutovac
    Lutovac, Miroslav
    2024 13TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING, MECO 2024, 2024, : 262 - 265
  • [25] Shortcut learning in deep neural networks
    Geirhos, Robert
    Jacobsen, Joern-Henrik
    Michaelis, Claudio
    Zemel, Richard
    Brendel, Wieland
    Bethge, Matthias
    Wichmann, Felix A.
    NATURE MACHINE INTELLIGENCE, 2020, 2 (11) : 665 - 673
  • [26] Neural Networks as a Learning Component for Designing Board Games
    Nikolakakis, Alexandros
    Kalles, Dimitris
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EANN 2017, 2017, 744 : 291 - 302
  • [27] Deep Learning Neural Networks and Bayesian Neural Networks in Data Analysis
    Chernoded, Andrey
    Dudko, Lev
    Myagkov, Igor
    Volkov, Petr
    XXIII INTERNATIONAL WORKSHOP HIGH ENERGY PHYSICS AND QUANTUM FIELD THEORY (QFTHEP 2017), 2017, 158
  • [28] Wide neural networks with bottlenecks are deep gaussian processes
    Agrawal, Devanshu
    Papamarkou, Theodore
    Hinkle, Jacob
    Journal of Machine Learning Research, 2020, 21
  • [29] Wide and deep neural networks achieve consistency for classification
    Radhakrishnan, Adityanarayanan
    Belkin, Mikhail
    Uhler, Caroline
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (14)
  • [30] Stable behaviour of infinitely wide deep neural networks
    Favaro, Stefano
    Fortini, Sandra
    Peluchetti, Stefano
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 1137 - 1145