FPGA-Based Hybrid-Type Implementation of Quantized Neural Networks for Remote Sensing Applications

被引：26

作者：

Wei, Xin ^{[1
]}

Liu, Wenchao ^{[1
]}

Chen, Lei ^{[1
]}

Ma, Long ^{[2
]}

Chen, He ^{[1
]}

Zhuang, Yin ^{[3
]}

机构：

[1] Beijing Inst Technol, Beijing Key Lab Embedded Real Time Informat Proc, Beijing 100081, Peoples R China

[2] Zhengzhou Univ, Sch Informat Engn, Zhengzhou 450001, Henan, Peoples R China

[3] Peking Univ, Sch Elect Engn & Comp Sci, Beijing 100087, Peoples R China

来源：

SENSORS | 2019年 / 19卷 / 04期

基金：

中国国家自然科学基金;

关键词：

remote sensing; convolutional neural network; hybrid-type inference; symmetric quantization; FPGA; CLASSIFICATION;

D O I：

10.3390/s19040924

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Recently, extensive convolutional neural network (CNN)-based methods have been used in remote sensing applications, such as object detection and classification, and have achieved significant improvements in performance. Furthermore, there are a lot of hardware implementation demands for remote sensing real-time processing applications. However, the operation and storage processes in floating-point models hinder the deployment of networks in hardware implements with limited resource and power budgets, such as field-programmable gate arrays (FPGAs) and application-specific integrated circuits (ASICs). To solve this problem, this paper focuses on optimizing the hardware design of CNN with low bit-width integers by quantization. First, a symmetric quantization scheme-based hybrid-type inference method was proposed, which uses the low bit-width integer to replace floating-point precision. Then, a training approach for the quantized network is introduced to reduce accuracy degradation. Finally, a processing engine (PE) with a low bit-width is proposed to optimize the hardware design of FPGA for remote sensing image classification. Besides, a fused-layer PE is also presented for state-of-the-art CNNs equipped with Batch-Normalization and LeakyRelu. The experiments performed on the Moving and Stationary Target Acquisition and Recognition (MSTAR) dataset using a graphics processing unit (GPU) demonstrate that the accuracy of 8-bit quantized model drops by about 1%, which is an acceptable accuracy loss. The accuracy result tested on FPGA is consistent with that of GPU. As for the resource consumptions of FPGA, the Look Up Table (LUT), Flip-flop (FF), Digital Signal Processor (DSP), and Block Random Access Memory (BRAM) are reduced by 46.21%, 43.84%, 45%, and 51%, respectively, compared with that of floating-point implementation.

引用

页数：21

共 50 条

[1] An Efficient FPGA-Based Implementation for Quantized Remote Sensing Image Scene Classification Network
Zhang, Xiaoli
Wei, Xin
Sang, Qianbo
Chen, He
Xie, Yizhuang
[J]. ELECTRONICS, 2020, 9 (09) : 1 - 20
[2] FPGA-based Accelerator for Losslessly Quantized Convolutional Neural Networks
Sit, Mankit
Kazami, Ryosuke
Amano, Hideharu
[J]. 2017 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (ICFPT), 2017, : 295 - 298
[3] Implementation of FPGA-based Accelerator for Deep Neural Networks
Tsai, Tsung-Han
Ho, Yuan-Chen
Sheu, Ming-Hwa
[J]. 2019 IEEE 22ND INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS & SYSTEMS (DDECS), 2019,
[4] FPGA-based Implementation of Sorting Networks in MMC applications
Ricco, Mattia
Mathe, Laszlo
Teodorescu, Remus
[J]. 2016 18TH EUROPEAN CONFERENCE ON POWER ELECTRONICS AND APPLICATIONS (EPE'16 ECCE EUROPE), 2016,
[5] Development and Implementation of Parameterized FPGA-Based General Purpose Neural Networks for Online Applications
Gomperts, Alexander
Ukil, Abhisek
Zurfluh, Franz
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2011, 7 (01) : 78 - 89
[6] An FPGA-based Accelerator Implementation for Deep Convolutional Neural Networks
Zhou, Yongmei
Jiang, Jingfei
[J]. PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 829 - 832
[7] Development, Implementation and Prospect of FPGA-Based Deep Neural Networks
Jiao, Li-Cheng
Sun, Qi-Gong
Yang, Yu-Ting
Feng, Yu-Xin
Li, Xiu-Fang
[J]. Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (03): : 441 - 471
[8] A Hybrid Architecture for Efficient FPGA-based Implementation of Multilayer Neural Network
Lin, Zhen
Dong, Yiping
Li, Yan
Watanabe, Takahiro
[J]. PROCEEDINGS OF THE 2010 IEEE ASIA PACIFIC CONFERENCE ON CIRCUIT AND SYSTEM (APCCAS), 2010, : 616 - 619
[9] FPGA-based design and implementation of the location attention mechanism in neural networks
Qiao, Ruixiu
Guo, Xiaozhou
Mao, Wenyu
Li, Jixing
Lu, Huaxiang
[J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (04) : 5309 - 5323
[10] Streaming Architecture for Large-Scale Quantized Neural Networks on an FPGA-Based Dataflow Platform
Baskin, Chaim
Zheltonozhskii, Evgenii
Bronstein, Alex M.
Mendelson, Avi
Liss, Natan
[J]. 2018 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2018), 2018, : 162 - 169

← 1 2 3 4 5 →