Deep or Wide? Learning Policy and Value Neural Networks for Combinatorial Games

被引：0

作者：

Edelkamp, Stefan ^{[1
]}

机构：

[1] Univ Bremen, Fac Math & Comp Sci, Bremen, Germany

来源：

COMPUTER GAMES: 5TH WORKSHOP ON COMPUTER GAMES, CGW 2016, AND 5TH WORKSHOP ON GENERAL INTELLIGENCE IN GAME-PLAYING AGENTS, GIGA 2016, HELD IN CONJUNCTION WITH THE 25TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2016, NEW YORK, USA, JULY 9-10, 2016 | 2017年 / 705卷

关键词：

D O I：

10.1007/978-3-319-57969-6_2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The success in learning how to play Go at a professional level is based on training a deep neural network on a wider selection of human expert games and raises the question on the availability, the limits, and the possibilities of this technique for other combinatorial games, especially when there is a lack of access to a larger body of additional expert knowledge. As a step towards this direction, we trained a value network for Tic-TacToe, providing perfect winning information obtained by retrograde analysis. Next, we trained a policy network for the SameGame, a challenging combinatorial puzzle. Here, we discuss the interplay of deep learning with nested rollout policy adaptation (NRPA), a randomized algorithm for optimizing the outcome of single-player games. In both cases we observed that ordinary feed-forward neural networks can perform better than convolutional ones both in accuracy and efficiency.

引用

页码：19 / 33

页数：15

共 50 条

[1] Policy Learning for Continuous Space Security Games Using Neural Networks
Kamra, Nitin
Gupta, Umang
Fang, Fei
Liu, Yan
Tambe, Milind
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 1103 - 1112
[2] WIDE AND DEEP GRAPH NEURAL NETWORKS WITH DISTRIBUTED ONLINE LEARNING
Gao, Zhan
Ribeiro, Alejandro
Gama, Fernando
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5270 - 5274
[3] Learning a Better Negative Sampling Policy with Deep Neural Networks for Search
Cohen, Daniel
Jordan, Scott M.
Croft, W. Bruce
PROCEEDINGS OF THE 2019 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'19), 2019, : 19 - 26
[4] Learning Coagulation Processes With Combinatorial Neural Networks
Wang, Justin L.
Curtis, Jeffrey H.
Riemer, Nicole
West, Matthew
JOURNAL OF ADVANCES IN MODELING EARTH SYSTEMS, 2022, 14 (12)
[5] Fast Learning of Deep Neural Networks via Singular Value Decomposition
Cai, Chenghao
Ke, Dengfeng
Xu, Yanyan
Su, Kaile
PRICAI 2014: TRENDS IN ARTIFICIAL INTELLIGENCE, 2014, 8862 : 820 - 826
[6] Variable Strength Combinatorial Testing for Deep Neural Networks
Chen, Yanshan
Wang, Ziyuan
Wang, Dong
Fang, Chunrong
Chen, Zhenyu
2019 IEEE 12TH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS (ICSTW 2019), 2019, : 281 - 284
[7] Learning Deep and Wide: A Spectral Method for Learning Deep Networks
Shao, Ling
Wu, Di
Li, Xuelong
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (12) : 2303 - 2308
[8] Propagation Mechanism for Deep and Wide Neural Networks
Xu, Dejiang
Lee, Mong Li
Hsu, Wynne
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9212 - 9220
[9] Deep and Wide Neural Networks Covariance Estimation
Arratia, Argimiro
Cabana, Alejandra
Rafael Leon, Jose
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT I, 2020, 12396 : 195 - 206
[10] The Loss Surface of Deep and Wide Neural Networks
Quynh Nguyen
Hein, Matthias
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70

← 1 2 3 4 5 →