Accelerating Deep Neuroevolution on Distributed FPGAs for Reinforcement Learning Problems

被引:2
|
作者
Asseman, Alexis [1 ]
Antoine, Nicolas [1 ]
Ozcan, Ahmet S. [1 ]
机构
[1] IBM Almaden Res Ctr, 650 Harry Rd, San Jose, CA 95120 USA
关键词
Genetic algorithm; field programmable gate array; neuroevolution; reinforcement learning; artificial neural network; ENVIRONMENT;
D O I
10.1145/3425500
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning, augmented by the representational power of deep neural networks, has shown promising results on high-dimensional problems, such as game playing and robotic control. However, the sequential nature of these problems poses a fundamental challenge for computational efficiency. Recently, alternative approaches such as evolutionary strategies and deep neuroevolution demonstrated competitive results with faster training time on distributed CPU cores. Here we report record training times (running at about 1 million frames per second) for Atari 2600 games using deep neuroevolution implemented on distributed FPGAs. Combined hardware implementation of the game console, image preprocessing and the neural network in an optimized pipeline, multiplied with the system level parallelism enabled the acceleration. These results are the first application demonstration on the IBM Neural Computer, which is a custom designed system that consists of 432 Xilinx FPGAs interconnected in a 3D mesh network topology. In addition to high performance, experiments also showed improvement in accuracy for all games compared to the CPU implementation of the same algorithm.
引用
下载
收藏
页数:17
相关论文
共 50 条
  • [1] Neuroevolution for Deep Reinforcement Learning Problems
    Ha, David
    PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION), 2019, : 550 - 593
  • [2] Accelerating Distributed Deep Reinforcement Learning by In-Network Experience Sampling
    Furukawa, Masaki
    Matsutani, Hiroki
    30TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2022), 2022, : 75 - 82
  • [3] Design Exploration of Multi-FPGAs for Accelerating Deep Learning
    Wang, Teng
    Gong, Lei
    Wang, Chao
    Zhou, Xuehai
    Chen, Huaping
    2019 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2019, : 464 - 465
  • [4] Accelerating Distributed Reinforcement Learning with In-Switch Computing
    Li, Youjie
    Liu, Iou-Jen
    Yuan, Yifan
    Chen, Deming
    Schwing, Alexander
    Huang, Jian
    PROCEEDINGS OF THE 2019 46TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA '19), 2019, : 279 - 291
  • [5] Neuroevolution strategies for episodic reinforcement learning
    Heidrich-Meisner, Verena
    Igel, Christian
    JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 2009, 64 (04): : 152 - 168
  • [6] Accelerating deep reinforcement learning model for game strategy
    Li, Yifan
    Fang, Yuchun
    Akhtar, Zahid
    NEUROCOMPUTING, 2020, 408 : 157 - 168
  • [7] Accelerating the Deep Reinforcement Learning with Neural Network Compression
    Zhang, Hongjie
    He, Zhuocheng
    Li, Jing
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [8] Neuroevolution for reinforcement learning using evolution strategies
    Igel, C
    CEC: 2003 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-4, PROCEEDINGS, 2003, : 2588 - 2595
  • [9] Reinforcement Learning and Neuroevolution in Flappy Bird Game
    Brandao, Andre
    Pires, Pedro
    Georgieva, Petia
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT I, 2020, 11867 : 225 - 236
  • [10] Neuroevolution-Based Inverse Reinforcement Learning
    Budhraja, Karan K.
    Oates, Tim
    2017 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2017, : 67 - 76