A High-Throughput Processor for GDN-Based Deep Learning Image Compression

被引:2
|
作者
Shao, Hu [1 ]
Liu, Bingtao [1 ]
Li, Zongpeng [1 ]
Yan, Chenggang [1 ]
Sun, Yaoqi [1 ]
Wang, Tingyu [1 ]
机构
[1] Hangzhou Dianzi Univ, Inst Informat & Control, Hangzhou 310000, Peoples R China
关键词
deep learning image compression; FPGA-based accelerator; generalized divisive normalization; high-throughput; CNN;
D O I
10.3390/electronics12102289
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning-based image compression techniques can take advantage of the autoencoder's benefits to achieve greater compression quality at the same bit rate as traditional image compression, which is more in line with user desires. Designing a high-performance processor that can increase the inference speed and efficiency of the deep learning image compression (DIC) network is important to make this technology more extensively employed in mobile devices. To the best of our knowledge, there is no dedicated processor that can accelerate DIC with low power consumption, and general-purpose network accelerators based on field programmable gate arrays (FPGA) cannot directly process compressed networks, so we propose a processor suitable for DIC in this paper. First, we analyze the image compression algorithm and quantize the data of the network into 16-bit fixed points using a dynamic hierarchical quantization. Then, we design an operation module, which is the core computational part for processing. It is composed of convolution, sampling, and normalization units, which pipeline the inference calculation for each layer of the network. To achieve high-throughput inference computing, the processing elements group (PEG) array with local buffers is developed for convolutional computation. Based on the common components in encoding and decoding, the sampling and normalization units are compatible with codec computation and utilized for image compression with time-sharing multiplexing. According to the control signal, the operation module could change the order of data flow through the three units so that they perform encoding and decoding operations, respectively. Based on these design methods and schemes, DIC is deployed into the Xilinx Zynq ZCU104 development board to achieve high-throughput image compression at 6 different bit rates. The experimental results show that the processor can run at 200 MHz and achieve 283.4 GOPS for the 16 bits fixed-point DIC network.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] High-Throughput Lossy-to-Lossless 3D Image Compression
    Rossinelli, Diego
    Fourestey, Gilles
    Schmidt, Felix
    Busse, Bjoern
    Kurtcuoglu, Vartan
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (02) : 607 - 620
  • [32] The effect of image compression on classification and storage requirements in a high-throughput crystallization system
    Berry, I
    Wilson, J
    Mayo, C
    Diprose, J
    Esnouf, R
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 117 - 124
  • [33] High-Throughput Area-Efficient Processor for Cryptography
    HUO Yuanhong
    LIU Dake
    ChineseJournalofElectronics, 2017, 26 (03) : 514 - 521
  • [34] A high-throughput low-cost AES processor
    Su, CP
    Lin, TF
    Huang, CT
    Wu, CW
    IEEE COMMUNICATIONS MAGAZINE, 2003, 41 (12) : 86 - 91
  • [35] High-Throughput Trellis Processor for Multistandard FEC Decoding
    Wu, Zhenzhi
    Liu, Dake
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2015, 23 (12) : 2757 - 2767
  • [36] High-Throughput Area-Efficient Processor for Cryptography
    Huo Yuanhong
    Liu Dake
    CHINESE JOURNAL OF ELECTRONICS, 2017, 26 (03) : 514 - 521
  • [37] A High-Throughput Low-Energy Arithmetic Processor
    Hong-Thu Nguyen
    Xuan-Thuan Nguyen
    Cong-Kha Pham
    IEICE TRANSACTIONS ON ELECTRONICS, 2018, E101C (04): : 281 - 284
  • [38] A High-Throughput Memory-Based FFT/IFFT Processor for OFDM Systems
    Lin, Kuang-Hao
    Shen, Che-Ying
    Huang, Shi-Yan
    Chen, Hou-Ming
    Wang, Liang-Hung
    Lee, Shuenn-Yuh
    2015 INTERNATIONAL SYMPOSIUM ON BIOELECTRONICS AND BIOINFORMATICS (ISBB), 2015, : 152 - 155
  • [39] High-throughput deep learning variant effect prediction with Sequence UNET
    Dunham, Alistair S.
    Beltrao, Pedro
    AlQuraishi, Mohammed
    GENOME BIOLOGY, 2023, 24 (01)
  • [40] Virtual high-throughput screening: A combined deep-learning approach
    Morris, Paul
    St Clair, Rachel
    Teti, Mike
    Clark, Evan
    Hahn, William
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 257