Hardware-Based Real-Time Deep Neural Network Lossless Weights Compression

被引:3
|
作者
Malach, Tomer [1 ]
Greenberg, Shlomo [1 ]
Haiut, Moshe [2 ]
机构
[1] Ben Gurion Univ Negev, Sch Elect & Comp Engn, IL-84105 Beer Sheva, Israel
[2] DSP Grp Inc, IL-4659071 Herzliyya, Israel
关键词
Deep neural network; entropy compression; hardware decoder; real-time;
D O I
10.1109/ACCESS.2020.3037254
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Neural Networks (DNN) are widely applied to many mobile applications demanding real-time implementation and large memory space. Therefore, it presents a new challenge for low-power and efficient implementation of a diversity of applications, such as speech recognition and image classification, for embedded edge devices. This work presents a hardware-based DNN compression approach to address the limited memory resources in edge devices. We propose a new entropy-based compression algorithm for encoding DNN weights, as well as a real-time decoding method and efficient dedicated hardware implementation. The proposed approach enables a significant reduction of the required DNN weights memory (approximately 70% and 63% for AlexNet and VGG19, respectively), while allowing the decoding of one weight per clock cycle. Results show a high compression ratio compared to well-known lossless compression algorithms. The proposed hardware decoder enables an efficient implementation of large DNN networks in low-power edge devices with limited memory resources.
引用
收藏
页码:205051 / 205060
页数:10
相关论文
共 50 条
  • [1] Real-Time Anomaly Detection Using Hardware-based Unsupervised Spiking Neural Network (TinySNN)
    Mehrabi, Ali
    Dennler, Nik
    Bethi, Yeshwanth
    van Schaik, Andre
    Afshar, Saeed
    [J]. 2024 33RD INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, ISIE 2024, 2024,
  • [2] Hardware-Based Real-Time Workload Forensics
    Zhang, Yunjie
    Zhou, Liwei
    Makris, Yiorgos
    [J]. IEEE DESIGN & TEST, 2020, 37 (04) : 52 - 58
  • [3] A hardware architecture of the real-time and lossless data compression based on LZW algorithm
    Zhang, JY
    Pei, DX
    Zhu, J
    [J]. ISTM/2005: 6th International Symposium on Test and Measurement, Vols 1-9, Conference Proceedings, 2005, : 6753 - 6756
  • [4] Research on realization of real-time and lossless hardware compression algorithm
    Li, JM
    Mao, HY
    Zhang, WD
    Lin, J
    Ma, YC
    [J]. ISTM/2005: 6th International Symposium on Test and Measurement, Vols 1-9, Conference Proceedings, 2005, : 6808 - 6811
  • [5] Hardware-based power management for real-time applications
    Uhrig, S
    Ungerer, T
    [J]. SECOND INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING, PROCEEDINGS, 2003, : 258 - 265
  • [6] Towards a Hardware-based System for Real-Time Vehicle Tracking
    Zheng, Yi
    Tang, Hua
    [J]. 2008 IEEE INTELLIGENT VEHICLES SYMPOSIUM, VOLS 1-3, 2008, : 56 - 61
  • [8] Real-Time Capable Hardware-based Parser for Efficient XML Interchange
    Altmann, Vlado
    Skodzik, Jan
    Danielis, Peter
    Nam Pham Van
    Golatowski, Frank
    Timmermann, Dirk
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON COMMUNICATION SYSTEMS, NETWORKS & DIGITAL SIGNAL PROCESSING (CSNDSP), 2014, : 395 - 400
  • [9] A graphics hardware-based accessibility analysis for real-time robotic manipulation
    Jang, Han-Young
    Moradi, Hadi
    Lee, Sukhan
    Jang, Daesik
    Kim, Eunyoung
    Han, JungHyun
    [J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2007, 14 : 97 - 106
  • [10] Hardware-based Fast Real-time Image Classification with Stochastic Computing
    Muthappa, Ponnanna Kelettira
    Neugebauer, Florian
    Polian, Ilia
    Hayes, John P.
    [J]. 2020 IEEE 38TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2020), 2020, : 340 - 347