Deep or Wide? Learning Policy and Value Neural Networks for Combinatorial Games

被引:0
|
作者
Edelkamp, Stefan [1 ]
机构
[1] Univ Bremen, Fac Math & Comp Sci, Bremen, Germany
关键词
D O I
10.1007/978-3-319-57969-6_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The success in learning how to play Go at a professional level is based on training a deep neural network on a wider selection of human expert games and raises the question on the availability, the limits, and the possibilities of this technique for other combinatorial games, especially when there is a lack of access to a larger body of additional expert knowledge. As a step towards this direction, we trained a value network for Tic-TacToe, providing perfect winning information obtained by retrograde analysis. Next, we trained a policy network for the SameGame, a challenging combinatorial puzzle. Here, we discuss the interplay of deep learning with nested rollout policy adaptation (NRPA), a randomized algorithm for optimizing the outcome of single-player games. In both cases we observed that ordinary feed-forward neural networks can perform better than convolutional ones both in accuracy and efficiency.
引用
收藏
页码:19 / 33
页数:15
相关论文
共 50 条
  • [31] Wide Neural Networks with Bottlenecks are Deep Gaussian Processes
    Agrawal, Devanshu
    Papamarkou, Theodore
    Hinkle, Jacob
    JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [32] Introduction to Machine Learning, Neural Networks, and Deep Learning
    Choi, Rene Y.
    Coyner, Aaron S.
    Kalpathy-Cramer, Jayashree
    Chiang, Michael F.
    Campbell, J. Peter
    TRANSLATIONAL VISION SCIENCE & TECHNOLOGY, 2020, 9 (02):
  • [33] Coordinated Wide-Area Damping Control Using Deep Neural Networks and Reinforcement Learning
    Gupta, Pooja
    Pal, Anamitra
    Vittal, Vijay
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2022, 37 (01) : 365 - 376
  • [34] The Effect of Combinatorial Coverage for Neurons on Fault Detection in Deep Neural Networks
    Wang, Ziyuan
    Guo, Jinwu
    Chen, Yanshan
    She, Feiyan
    2021 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C 2021), 2021, : 77 - 82
  • [35] Wide and Deep Reinforcement Learning for Grid-based Action Games
    Montoya, Juan M.
    Borgelt, Christian
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 50 - 59
  • [36] Deep learning with coherent VCSEL neural networks
    Zaijun Chen
    Alexander Sludds
    Ronald Davis
    Ian Christen
    Liane Bernstein
    Lamia Ateshian
    Tobias Heuser
    Niels Heermeier
    James A. Lott
    Stephan Reitzenstein
    Ryan Hamerly
    Dirk Englund
    Nature Photonics, 2023, 17 : 723 - 730
  • [37] Learning deep neural networks for node classification
    Li, Bentian
    Pi, Dechang
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 137 : 324 - 334
  • [38] Inspecting the behaviour of Deep Learning Neural Networks
    Duer, Alexander
    Filzmoser, Peter
    Rauber, Andreas
    ERCIM NEWS, 2019, (116): : 18 - 19
  • [39] Piecewise linear neural networks and deep learning
    Qinghua Tao
    Li Li
    Xiaolin Huang
    Xiangming Xi
    Shuning Wang
    Johan A. K. Suykens
    Nature Reviews Methods Primers, 2
  • [40] Abstraction Hierarchy in Deep Learning Neural Networks
    Ilin, Roman
    Watson, Thomas
    Kozma, Robert
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 768 - 774