An energy efficient EdgeAI autoencoder accelerator for reinforcement learning

被引：9

作者：

Manjunath N.K. ^{[1
]}

Shiri A. ^{[1
]}

Hosseini M. ^{[1
]}

Prakash B. ^{[1
]}

Waytowich N.R. ^{[2
]}

Mohsenin T. ^{[1
]}

机构：

[1] Department of Computer Science and Electrical Engineering, University of Maryland, Baltimore, 21250, MD

[2] U.S. Army Research Laboratory, Aberdeen, MD

来源：

IEEE Open Journal of Circuits and Systems | 2021年 / 2卷

关键词：

ASIC; autoencoder; autonomous systems; binary neural networks (BNNs); EdgeAI; energy efficiency; FPGA; Reinforcement learning; ternary neural networks (TNNs);

D O I：

10.1109/OJCAS.2020.3043737

中图分类号：

学科分类号：

摘要：

In EdgeAI embedded devices that exploit reinforcement learning (RL), it is essential to reduce the number of actions taken by the agent in the real world and minimize the compute-intensive policies learning process. Convolutional autoencoders (AEs) has demonstrated great improvement for speeding up the policy learning time when attached to the RL agent, by compressing the high dimensional input data into a small latent representation for feeding the RL agent. Despite reducing the policy learning time, AE adds a significant computational and memory complexity to the model which contributes to the increase in the total computation and the model size. In this article, we propose a model for speeding up the policy learning process of RL agent with the use of AE neural networks, which engages binary and ternary precision to address the high complexity overhead without deteriorating the policy that an RL agent learns. Binary Neural Networks (BNNs) and Ternary Neural Networks (TNNs) compress weights into 1 and 2 bits representations, which result in significant compression of the model size and memory as well as simplifying multiply-accumulate (MAC) operations. We evaluate the performance of our model in three RL environments including DonkeyCar, Miniworld sidewalk, and Miniworld Object Pickup, which emulate various real-world applications with different levels of complexity. With proper hyperparameter optimization and architecture exploration, TNN models achieve near the same average reward, Peak Signal to Noise Ratio (PSNR) and Mean Squared Error (MSE) performance as the full-precision model while reducing the model size by 10 × compared to full-precision and 3 × compared to BNNs. However, in BNN models the average reward drops up to 12% - 25% compared to the full-precision even after increasing its model size by 4 ×. We designed and implemented a scalable hardware accelerator which is configurable in terms of the number of processing elements (PEs) and memory data width to achieve the best power, performance, and energy efficiency trade-off for EdgeAI embedded devices. The proposed hardware implemented on Artix-7 FPGA dissipates 250 μJ energy while meeting 30 frames per second (FPS) throughput requirements. The hardware is configurable to reach an efficiency of over 1 TOP/J on FPGA implementation. The proposed hardware accelerator is synthesized and placed-and-routed in 14 nm FinFET ASIC technology which brings down the power dissipation to 3.9 μJ and maximum throughput of 1,250 FPS. Compared to the state of the art TNN implementations on the same target platform, our hardware is 5 × and 4.4 × (2.2 × if technology scaled) more energy efficient on FPGA and ASIC, respectively. © 2020 IEEE.

引用

页码：182 / 195

页数：13

共 50 条

[31] Energy-Efficient IoT Sensor Calibration With Deep Reinforcement Learning
Ashiquzzaman, Akm
Lee, Hyunmin
Um, Tai-Won
Kim, Jinsul
IEEE ACCESS, 2020, 8 : 97045 - 97055
[32] Multiple agent based reinforcement learning for energy efficient routing in WSN
D. Prabhu
R. Alageswaran
S. Miruna Joe Amali
Wireless Networks, 2023, 29 : 1787 - 1797
[33] Personalized and Energy-Efficient Health Monitoring: A Reinforcement Learning Approach
Eden, Batchen
Bistritz, Ilai
Bambos, Nicholas
Ben-Gal, Irad
Khmelnitsky, Evgeni
IEEE CONTROL SYSTEMS LETTERS, 2023, 7 : 955 - 960
[34] Lightweight Reinforcement Learning for Energy Efficient Communications in Wireless Sensor Networks
Savaglio, Claudio
Pace, Pasquale
Aloi, Gianluca
Liotta, Antonio
Fortino, Giancarlo
IEEE ACCESS, 2019, 7 : 29355 - 29364
[35] Balanced energy-efficient routing in MANETs using reinforcement learning
Naruephiphat, W.
Usaha, W.
2008 THE INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, 2008, : 54 - 58
[36] Multiple agent based reinforcement learning for energy efficient routing in WSN
Prabhu, D.
Alageswaran, R.
Miruna Joe Amali, S.
WIRELESS NETWORKS, 2023, 29 (04) : 1787 - 1797
[37] Energy-efficient VM scheduling based on deep reinforcement learning
Wang, Bin
Liu, Fagui
Lin, Weiwei
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 125 : 616 - 628
[38] Deep Reinforcement Learning Based Energy Efficient Underwater Acoustic Communications
Zhu, Zewen
Ye, Xiaowen
Fu, Liqun
GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST, 2020,
[39] Safe Reinforcement Learning for an Energy-Efficient Driver Assistance System
Hailemichael, Habtamu
Ayalew, Beshah
Kerbel, Lindsey
Ivanco, Andrej
Loiselle, Keith
IFAC PAPERSONLINE, 2022, 55 (37): : 615 - 620
[40] Reinforcement Learning Based Energy-Efficient Fast Routing for FANETs
Li, Jieling
Xiao, Liang
Qi, Xuchen
Lv, Zefang
Chen, Qiaoxin
Liu, Yong-Jin
IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (11) : 7063 - 7076

← 1 2 3 4 5 →