Hardware-Based Real-Time Deep Neural Network Lossless Weights Compression

被引:3
|
作者
Malach, Tomer [1 ]
Greenberg, Shlomo [1 ]
Haiut, Moshe [2 ]
机构
[1] Ben Gurion Univ Negev, Sch Elect & Comp Engn, IL-84105 Beer Sheva, Israel
[2] DSP Grp Inc, IL-4659071 Herzliyya, Israel
关键词
Deep neural network; entropy compression; hardware decoder; real-time;
D O I
10.1109/ACCESS.2020.3037254
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Neural Networks (DNN) are widely applied to many mobile applications demanding real-time implementation and large memory space. Therefore, it presents a new challenge for low-power and efficient implementation of a diversity of applications, such as speech recognition and image classification, for embedded edge devices. This work presents a hardware-based DNN compression approach to address the limited memory resources in edge devices. We propose a new entropy-based compression algorithm for encoding DNN weights, as well as a real-time decoding method and efficient dedicated hardware implementation. The proposed approach enables a significant reduction of the required DNN weights memory (approximately 70% and 63% for AlexNet and VGG19, respectively), while allowing the decoding of one weight per clock cycle. Results show a high compression ratio compared to well-known lossless compression algorithms. The proposed hardware decoder enables an efficient implementation of large DNN networks in low-power edge devices with limited memory resources.
引用
收藏
页码:205051 / 205060
页数:10
相关论文
共 50 条
  • [31] Lossless Compression Technique for Real-Time Photoplethysmographic Measurements
    Gupta, Rajarshi
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2015, 64 (04) : 975 - 983
  • [32] A hybrid system for real-time lossless image compression
    Drost, GW
    Bourbakis, NG
    [J]. MICROPROCESSORS AND MICROSYSTEMS, 2001, 25 (01) : 19 - 31
  • [33] Architecture for dynamically reconfigurable real-time lossless compression
    Carter, AJ
    Audsley, NC
    [J]. REAL-TIME IMAGING VIII, 2004, 5297 : 231 - 241
  • [34] FIR filters for hardware-based real-time multi-band image blending
    Popovic, Vladan
    Leblebici, Yusuf
    [J]. REAL-TIME IMAGE AND VIDEO PROCESSING 2015, 2015, 9400
  • [35] Deep Learning for Hardware-Based Real-Time Fault Detection and Localization of All Electric Ship MVDC Power System
    Liu, Qin
    Liang, Tian
    Dinavahi, Venkata
    [J]. IEEE OPEN JOURNAL OF INDUSTRY APPLICATIONS, 2020, 1 : 194 - 204
  • [36] A Hardware-Based Architecture-Neutral Framework for Real-Time IoT Workload Forensics
    Zhou, Liwei
    Hu, Yang
    Makris, Yiorgos
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2020, 69 (11) : 1668 - 1680
  • [37] A real-time hardware-based scheduler for next-generation optical burst switches
    Anan, M. T.
    Chaudhry, G. M.
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-14, 2007, : 2289 - 2293
  • [38] A Hardware-based Preemptive Dynamic Bandwidth Allocation Mechanism for Real-time Applications in EPONs
    Jia, Wen-Kang
    Yang, Yuan-Rung
    Chen, Yaw-Chung
    [J]. IV INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS 2012 (ICUMT), 2012, : 570 - 576
  • [39] Design of a pipelined hardware architecture for real-time neural network computations
    Ayala, JL
    Lomeña, AG
    López-Vallejo, M
    Fernández, A
    [J]. 2002 45TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL I, CONFERENCE PROCEEDINGS, 2002, : 419 - 422
  • [40] Stream-Based Lossless Data Compression Applying Adaptive Entropy Coding for Hardware-Based Implementation
    Yamagiwa, Shinichi
    Hayakawa, Eisaku
    Marumo, Koichi
    [J]. ALGORITHMS, 2020, 13 (07)