DARL: Distributed Reconfigurable Accelerator for Hyperdimensional Reinforcement Learning

被引：13

作者：

Chen, Hanning ^{[1
]}

Issa, Mariam ^{[1
]}

Ni, Yang ^{[1
]}

Imani, Mohsen ^{[1
]}

机构：

[1] Univ Calif Irvine, Irvine, CA 92717 USA

来源：

2022 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD | 2022年

基金：

美国国家科学基金会;

关键词：

D O I：

10.1145/3508352.3549437

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Reinforcement Learning (RL) is a powerful technology to solve decisionmaking problems such as robotics control. Modern RL algorithms, i.e., Deep Q-Learning, are based on costly and resource hungry deep neural networks. This motivates us to deploy alternative models for powering RL agents on edge devices. Recently, brain-inspired HyperDimensional Computing (HDC) has been introduced as a promising solution for lightweight and efficient machine learning, particularly for classification. In this work, we develop a novel platform capable of real-time hyper dimensional reinforcement learning. Our heterogeneous CPU-FPGA platform, called DARL, maximizes FPGA's computing capabilities by applying hardware optimizations to hyperdimensional computing's critical operations, including hardware -friendly encoder IP, the hypervector chunk fragmentation, and the delayed model update. Aside from hardware innovation, we also extend the platform to basic single agent RL to support multi-agents distributed learning. We evaluate the effectiveness of our approach on OpenAl Gym tasks. Our results show that the FPGA platform provides on average 20x speedup compared to current state-of-the-art hyperdimensional RL methods running on Intel Xeon 6226 CPU. In addition, DARL provides around 4.8x faster and 4.2x higher energy efficiency compared to the state-of-the-art RL accelerator While ensuring a better or comparable quality of learning.

引用

页数：9

共 50 条

[21] Sample-efficient reinforcement learning for CERN accelerator control
Kain, Verena
Hirlander, Simon
Goddard, Brennan
Velotti, Francesco Maria
Porta, Giovanni Zevi Della
Bruchon, Niky
Valentino, Gianluca
PHYSICAL REVIEW ACCELERATORS AND BEAMS, 2020, 23 (12)
[22] Fast DSE of reconfigurable accelerator systems via ensemble machine learning
Lopes, Alba
Pereira, Monica
ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 2021, 108 (03) : 495 - 509
[23] Universal Reconfigurable Hardware Accelerator for Sparse Machine Learning Predictive Models
Vranjkovic, Vuk
Teodorovic, Predrag
Struharik, Rastislav
ELECTRONICS, 2022, 11 (08)
[24] Coarse-grained Reconfigurable Hardware Accelerator of Machine Learning Classifiers
Vranjkovic, Vuk
Struharik, Rastislav
PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, (IWSSIP 2016), 2016, : 193 - 196
[25] HygHD: Hyperdimensional Hypergraph Learning
Kang, Jaeyoung
Lee, You Hak
Zhou, Minxuan
Xu, Weihong
Rosing, Tajana
2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
[26] Memory-Centric Reconfigurable Accelerator for Classification and Machine Learning Applications
Karam, Robert
Paul, Somnath
Puri, Ruchir
Bhunia, Swarup
ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2017, 13 (03)
[27] Distributed reinforcement learning for sequential decision making
Rogova, G
Scott, P
Lolett, C
PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, VOL II, 2002, : 1263 - 1268
[28] An Exact Distributed Newton Method for Reinforcement Learning
Tutunov, Rasul
Ammar, Haitham Bou
Jadbabaie, Ali
2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 1003 - 1008
[29] Distributed Scheduling for Autonomous Vehicles by Reinforcement Learning
Unoki, T.
Suetake, N.
Denki Gakkai Ronbunshi. C, Erekutoronikusu Joho Kogaku, Shisutemu, 117 (10):
[30] Distributed Reinforcement Learning for Networked Dynamical Systems
Sadamoto, Tomonori
Kikuya, Ayafumi
Chakrabortty, Aranya
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2024, 11 (02): : 1103 - 1115

← 1 2 3 4 5 →