An FPGA Realization of OpenPose based on a Sparse Weight Convolutional Neural Network

被引:7
|
作者
Jinguji, Akira [1 ]
Fujii, Tomoya [1 ]
Sato, Shimpei [1 ]
Nakahara, Hiroki [1 ]
机构
[1] Tokyo Inst Technol, Tokyo, Japan
关键词
D O I
10.1109/FPT.2018.00061
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The OpenPose is a kind of a deep learning based pose estimator which achieved a top accuracy for multiple person pose estimations. Even if using the OpenPose, it is necessary to used high-performance GPU since it requires massive parameters access with high-bandwidth off-chip GDDR5 memories and a higher operation clock frequency. Thus, the power consumption becomes a critical issue to realization. Also, its computation time is slower than the current video standard frame speed (29.97 FPS). In the paper, we introduce a sparse weight CNN to reduce the amount of memory size for weights, which is Then, we offer the indirect memory access architecture to realize the sparse CNN convolutional operation efficiently. Also, to increase throughput further, we applied the six stages of pipeline architecture with a pipeline buffer memory realization. Our implementation satisfied the timing constraint for real-time applications. Since our architecture computed an image with 42.6 msec, the number of frames per second (FPS) was 23.43. We measured the total board power consumption: It was 55 Watt. Thus, the performance per power efficiency was 0.444 (FPS/W). Compared with the NVidia Titan X Pascal architecture GPU, it was 3.49 times faster, it dissipated 3.54 times lower power, and its performance per power efficiency was 13.05 times better. As far as we know, this work is the first FPGA implementation of the OpenPose.
引用
收藏
页码:313 / 316
页数:4
相关论文
共 50 条
  • [1] FPGA-Based Reconfigurable Convolutional Neural Network Accelerator Using Sparse and Convolutional Optimization
    Gowda, Kavitha Malali Vishveshwarappa
    Madhavan, Sowmya
    Rinaldi, Stefano
    Divakarachari, Parameshachari Bidare
    Atmakur, Anitha
    [J]. ELECTRONICS, 2022, 11 (10)
  • [2] Design of Convolutional Neural Network Based on FPGA
    Zhai, Sheping
    Qiu, Cheng
    Yang, Yuanyuan
    Li, Jing
    Cui, Yiming
    [J]. 2018 INTERNATIONAL CONFERENCE ON COMPUTER INFORMATION SCIENCE AND APPLICATION TECHNOLOGY, 2019, 1168
  • [3] FPGA Accelerator for Homomorphic Encrypted Sparse Convolutional Neural Network Inference
    Yang, Yang
    Kuppannagari, Sanmukh R.
    Kannan, Rajgopal
    Prasanna, Viktor K.
    [J]. 2022 IEEE 30TH INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2022), 2022, : 81 - 89
  • [4] FPGA Realization of a Neural Network based Motor Controller
    Diodati, Francesco
    Jeppesen, Ben
    Jervis, Mark
    Saletti, Roberto
    [J]. 2022 IEEE 27TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2022,
  • [5] An FPGA Realization of a Deep Convolutional Neural Network Using a Threshold Neuron Pruning
    Fujii, Tomoya
    Sato, Simpei
    Nakahara, Hiroki
    Motomura, Masato
    [J]. APPLIED RECONFIGURABLE COMPUTING, 2017, 10216 : 268 - 280
  • [6] An FPGA-Based Convolutional Neural Network Coprocessor
    Qiu, Changpei
    Wang, Xin'an
    Zhao, Tianxia
    Li, Qiuping
    Wang, Bo
    Wang, Hu
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [7] Acceleration and Implementation of Convolutional Neural Network Based on FPGA
    Wang, Enyi
    Qiu, Dehui
    [J]. PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2019), 2019, : 321 - 325
  • [8] WinoNN: Optimizing FPGA-Based Convolutional Neural Network Accelerators Using Sparse Winograd Algorithm
    Wang, Xuan
    Wang, Chao
    Cao, Jing
    Gong, Lei
    Zhou, Xuehai
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (11) : 4290 - 4302
  • [9] Sparse Realization in Unreliable Spin-Transfer-Torque RAM for Convolutional Neural Network
    Cai, Hao
    Chen, Juntong
    Zhou, Yongliang
    Hong, Xiaofeng
    Liu, Bo
    Naviner, Lirida Alves de Barros
    [J]. IEEE TRANSACTIONS ON MAGNETICS, 2021, 57 (02)
  • [10] A Ternary Weight Binary Input Convolutional Neural Network: Realization on the Embedded Processor
    Yonekawa, Haruyoshi
    Sato, Shimpei
    Nakahara, Hiroki
    [J]. 2018 IEEE 48TH INTERNATIONAL SYMPOSIUM ON MULTIPLE-VALUED LOGIC (ISMVL 2018), 2018, : 174 - 179