An FPGA Realization of OpenPose based on a Sparse Weight Convolutional Neural Network

被引：7

作者：

Jinguji, Akira ^{[1
]}

Fujii, Tomoya ^{[1
]}

Sato, Shimpei ^{[1
]}

Nakahara, Hiroki ^{[1
]}

机构：

[1] Tokyo Inst Technol, Tokyo, Japan

来源：

2018 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT 2018) | 2018年

关键词：

D O I：

10.1109/FPT.2018.00061

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The OpenPose is a kind of a deep learning based pose estimator which achieved a top accuracy for multiple person pose estimations. Even if using the OpenPose, it is necessary to used high-performance GPU since it requires massive parameters access with high-bandwidth off-chip GDDR5 memories and a higher operation clock frequency. Thus, the power consumption becomes a critical issue to realization. Also, its computation time is slower than the current video standard frame speed (29.97 FPS). In the paper, we introduce a sparse weight CNN to reduce the amount of memory size for weights, which is Then, we offer the indirect memory access architecture to realize the sparse CNN convolutional operation efficiently. Also, to increase throughput further, we applied the six stages of pipeline architecture with a pipeline buffer memory realization. Our implementation satisfied the timing constraint for real-time applications. Since our architecture computed an image with 42.6 msec, the number of frames per second (FPS) was 23.43. We measured the total board power consumption: It was 55 Watt. Thus, the performance per power efficiency was 0.444 (FPS/W). Compared with the NVidia Titan X Pascal architecture GPU, it was 3.49 times faster, it dissipated 3.54 times lower power, and its performance per power efficiency was 13.05 times better. As far as we know, this work is the first FPGA implementation of the OpenPose.

引用

页码：313 / 316

页数：4

共 50 条

[1] FPGA-Based Reconfigurable Convolutional Neural Network Accelerator Using Sparse and Convolutional Optimization
Gowda, Kavitha Malali Vishveshwarappa
Madhavan, Sowmya
Rinaldi, Stefano
Divakarachari, Parameshachari Bidare
Atmakur, Anitha
[J]. ELECTRONICS, 2022, 11 (10)
[2] Design of Convolutional Neural Network Based on FPGA
Zhai, Sheping
Qiu, Cheng
Yang, Yuanyuan
Li, Jing
Cui, Yiming
[J]. 2018 INTERNATIONAL CONFERENCE ON COMPUTER INFORMATION SCIENCE AND APPLICATION TECHNOLOGY, 2019, 1168
[3] FPGA Accelerator for Homomorphic Encrypted Sparse Convolutional Neural Network Inference
Yang, Yang
Kuppannagari, Sanmukh R.
Kannan, Rajgopal
Prasanna, Viktor K.
[J]. 2022 IEEE 30TH INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2022), 2022, : 81 - 89
[4] FPGA Realization of a Neural Network based Motor Controller
Diodati, Francesco
Jeppesen, Ben
Jervis, Mark
Saletti, Roberto
[J]. 2022 IEEE 27TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2022,
[5] An FPGA Realization of a Deep Convolutional Neural Network Using a Threshold Neuron Pruning
Fujii, Tomoya
Sato, Simpei
Nakahara, Hiroki
Motomura, Masato
[J]. APPLIED RECONFIGURABLE COMPUTING, 2017, 10216 : 268 - 280
[6] An FPGA-Based Convolutional Neural Network Coprocessor
Qiu, Changpei
Wang, Xin'an
Zhao, Tianxia
Li, Qiuping
Wang, Bo
Wang, Hu
[J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
[7] Acceleration and Implementation of Convolutional Neural Network Based on FPGA
Wang, Enyi
Qiu, Dehui
[J]. PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2019), 2019, : 321 - 325
[8] WinoNN: Optimizing FPGA-Based Convolutional Neural Network Accelerators Using Sparse Winograd Algorithm
Wang, Xuan
Wang, Chao
Cao, Jing
Gong, Lei
Zhou, Xuehai
[J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (11) : 4290 - 4302
[9] Sparse Realization in Unreliable Spin-Transfer-Torque RAM for Convolutional Neural Network
Cai, Hao
Chen, Juntong
Zhou, Yongliang
Hong, Xiaofeng
Liu, Bo
Naviner, Lirida Alves de Barros
[J]. IEEE TRANSACTIONS ON MAGNETICS, 2021, 57 (02)
[10] A Ternary Weight Binary Input Convolutional Neural Network: Realization on the Embedded Processor
Yonekawa, Haruyoshi
Sato, Shimpei
Nakahara, Hiroki
[J]. 2018 IEEE 48TH INTERNATIONAL SYMPOSIUM ON MULTIPLE-VALUED LOGIC (ISMVL 2018), 2018, : 174 - 179

← 1 2 3 4 5 →