Deep or Wide? Learning Policy and Value Neural Networks for Combinatorial Games

被引:0
|
作者
Edelkamp, Stefan [1 ]
机构
[1] Univ Bremen, Fac Math & Comp Sci, Bremen, Germany
关键词
D O I
10.1007/978-3-319-57969-6_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The success in learning how to play Go at a professional level is based on training a deep neural network on a wider selection of human expert games and raises the question on the availability, the limits, and the possibilities of this technique for other combinatorial games, especially when there is a lack of access to a larger body of additional expert knowledge. As a step towards this direction, we trained a value network for Tic-TacToe, providing perfect winning information obtained by retrograde analysis. Next, we trained a policy network for the SameGame, a challenging combinatorial puzzle. Here, we discuss the interplay of deep learning with nested rollout policy adaptation (NRPA), a randomized algorithm for optimizing the outcome of single-player games. In both cases we observed that ordinary feed-forward neural networks can perform better than convolutional ones both in accuracy and efficiency.
引用
收藏
页码:19 / 33
页数:15
相关论文
共 50 条
  • [1] Policy Learning for Continuous Space Security Games Using Neural Networks
    Kamra, Nitin
    Gupta, Umang
    Fang, Fei
    Liu, Yan
    Tambe, Milind
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 1103 - 1112
  • [2] WIDE AND DEEP GRAPH NEURAL NETWORKS WITH DISTRIBUTED ONLINE LEARNING
    Gao, Zhan
    Ribeiro, Alejandro
    Gama, Fernando
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5270 - 5274
  • [3] Learning a Better Negative Sampling Policy with Deep Neural Networks for Search
    Cohen, Daniel
    Jordan, Scott M.
    Croft, W. Bruce
    PROCEEDINGS OF THE 2019 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'19), 2019, : 19 - 26
  • [4] Learning Coagulation Processes With Combinatorial Neural Networks
    Wang, Justin L.
    Curtis, Jeffrey H.
    Riemer, Nicole
    West, Matthew
    JOURNAL OF ADVANCES IN MODELING EARTH SYSTEMS, 2022, 14 (12)
  • [5] Fast Learning of Deep Neural Networks via Singular Value Decomposition
    Cai, Chenghao
    Ke, Dengfeng
    Xu, Yanyan
    Su, Kaile
    PRICAI 2014: TRENDS IN ARTIFICIAL INTELLIGENCE, 2014, 8862 : 820 - 826
  • [6] Variable Strength Combinatorial Testing for Deep Neural Networks
    Chen, Yanshan
    Wang, Ziyuan
    Wang, Dong
    Fang, Chunrong
    Chen, Zhenyu
    2019 IEEE 12TH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS (ICSTW 2019), 2019, : 281 - 284
  • [7] Learning Deep and Wide: A Spectral Method for Learning Deep Networks
    Shao, Ling
    Wu, Di
    Li, Xuelong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (12) : 2303 - 2308
  • [8] Propagation Mechanism for Deep and Wide Neural Networks
    Xu, Dejiang
    Lee, Mong Li
    Hsu, Wynne
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9212 - 9220
  • [9] Deep and Wide Neural Networks Covariance Estimation
    Arratia, Argimiro
    Cabana, Alejandra
    Rafael Leon, Jose
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT I, 2020, 12396 : 195 - 206
  • [10] The Loss Surface of Deep and Wide Neural Networks
    Quynh Nguyen
    Hein, Matthias
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70