DARL: Distributed Reconfigurable Accelerator for Hyperdimensional Reinforcement Learning

被引:13
|
作者
Chen, Hanning [1 ]
Issa, Mariam [1 ]
Ni, Yang [1 ]
Imani, Mohsen [1 ]
机构
[1] Univ Calif Irvine, Irvine, CA 92717 USA
基金
美国国家科学基金会;
关键词
D O I
10.1145/3508352.3549437
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Reinforcement Learning (RL) is a powerful technology to solve decisionmaking problems such as robotics control. Modern RL algorithms, i.e., Deep Q-Learning, are based on costly and resource hungry deep neural networks. This motivates us to deploy alternative models for powering RL agents on edge devices. Recently, brain-inspired HyperDimensional Computing (HDC) has been introduced as a promising solution for lightweight and efficient machine learning, particularly for classification. In this work, we develop a novel platform capable of real-time hyper dimensional reinforcement learning. Our heterogeneous CPU-FPGA platform, called DARL, maximizes FPGA's computing capabilities by applying hardware optimizations to hyperdimensional computing's critical operations, including hardware -friendly encoder IP, the hypervector chunk fragmentation, and the delayed model update. Aside from hardware innovation, we also extend the platform to basic single agent RL to support multi-agents distributed learning. We evaluate the effectiveness of our approach on OpenAl Gym tasks. Our results show that the FPGA platform provides on average 20x speedup compared to current state-of-the-art hyperdimensional RL methods running on Intel Xeon 6226 CPU. In addition, DARL provides around 4.8x faster and 4.2x higher energy efficiency compared to the state-of-the-art RL accelerator While ensuring a better or comparable quality of learning.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Sample-efficient reinforcement learning for CERN accelerator control
    Kain, Verena
    Hirlander, Simon
    Goddard, Brennan
    Velotti, Francesco Maria
    Porta, Giovanni Zevi Della
    Bruchon, Niky
    Valentino, Gianluca
    PHYSICAL REVIEW ACCELERATORS AND BEAMS, 2020, 23 (12)
  • [22] Fast DSE of reconfigurable accelerator systems via ensemble machine learning
    Lopes, Alba
    Pereira, Monica
    ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 2021, 108 (03) : 495 - 509
  • [23] Universal Reconfigurable Hardware Accelerator for Sparse Machine Learning Predictive Models
    Vranjkovic, Vuk
    Teodorovic, Predrag
    Struharik, Rastislav
    ELECTRONICS, 2022, 11 (08)
  • [24] Coarse-grained Reconfigurable Hardware Accelerator of Machine Learning Classifiers
    Vranjkovic, Vuk
    Struharik, Rastislav
    PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, (IWSSIP 2016), 2016, : 193 - 196
  • [25] HygHD: Hyperdimensional Hypergraph Learning
    Kang, Jaeyoung
    Lee, You Hak
    Zhou, Minxuan
    Xu, Weihong
    Rosing, Tajana
    2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
  • [26] Memory-Centric Reconfigurable Accelerator for Classification and Machine Learning Applications
    Karam, Robert
    Paul, Somnath
    Puri, Ruchir
    Bhunia, Swarup
    ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2017, 13 (03)
  • [27] Distributed reinforcement learning for sequential decision making
    Rogova, G
    Scott, P
    Lolett, C
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, VOL II, 2002, : 1263 - 1268
  • [28] An Exact Distributed Newton Method for Reinforcement Learning
    Tutunov, Rasul
    Ammar, Haitham Bou
    Jadbabaie, Ali
    2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 1003 - 1008
  • [29] Distributed Scheduling for Autonomous Vehicles by Reinforcement Learning
    Unoki, T.
    Suetake, N.
    Denki Gakkai Ronbunshi. C, Erekutoronikusu Joho Kogaku, Shisutemu, 117 (10):
  • [30] Distributed Reinforcement Learning for Networked Dynamical Systems
    Sadamoto, Tomonori
    Kikuya, Ayafumi
    Chakrabortty, Aranya
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2024, 11 (02): : 1103 - 1115