An Energy-Efficient Deep Belief Network Processor Based on Heterogeneous Multi-Core Architecture With Transposable Memory and On-Chip Learning

被引:2
|
作者
Wu, Jiajun [1 ]
Huang, Xuan [1 ]
Yang, Le [1 ]
Wang, Jipeng [1 ]
Liu, Bingqiang [1 ]
Wen, Ziyuan [1 ]
Li, Juhui [2 ]
Yu, Guoyi [1 ]
Chong, Kwen-Siong [3 ]
Wang, Chao [1 ,4 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Opt & Elect Informat, Wuhan 430074, Hubei, Peoples R China
[2] Nations Innovat Technol Pte Ltd, Singapore 117674, Singapore
[3] Nanyang Technol Univ Singapore, Temasek Labs, Singapore 637553, Singapore
[4] Wuhan Natl Lab Optoelect, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Neurons; Energy efficiency; Computational modeling; Unsupervised learning; System-on-chip; Integrated circuit modeling; Computer architecture; Edge computing; Deep Belief Network (DBN); on-chip learning; algorithm-architecture-circuit co-design; data reuse; data sparsity; heterogeneous multi-core architecture; transposable memory;
D O I
10.1109/JETCAS.2021.3114396
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the growing interest of edge computing in the Internet of Things (IoT), Deep Neural Network (DNN) hardware processors/accelerators face challenges of low energy consumption, low latency, and data privacy issues. This paper proposes an energy-efficient processor design based on Deep Belief Network (DBN), which is one of the most suitable DNN models for on- chip learning. In this study, a thorough algorithm-architecture-circuit design optimization method is used for efficient design. The characteristics of data reuse and data sparsity in the DBN learning algorithm inspires this study to propose a heterogeneous multi-core architecture with local learning. In addition, novel circuits of transposable weight memory and sparse address generator are proposed to reduce weight memory access and exploit neuron state sparsity, respectively, for maximizing the energy efficiency. The DBN processor is implemented and thoroughly evaluated on Xilinx Zynq FPGA. Implementation results confirm that the proposed DBN processor has excellent energy efficiency of 45.0 pJ per neuron-weight update, which has been improved by 74% against the conventional design.
引用
收藏
页码:725 / 738
页数:14
相关论文
共 50 条
  • [1] DNPU: An Energy-Efficient Deep-Learning Processor with Heterogeneous Multi-Core Architecture
    Shin, Dongjoo
    Lee, Jinmook
    Lee, Jinsu
    Lee, Juhyoung
    Yoo, Hoi-Jun
    [J]. IEEE MICRO, 2018, 38 (05) : 85 - 93
  • [2] An Energy-Efficient Deep Learning Processor with Heterogeneous Multi-Core Architecture for Convolutional Neural Networks and Recurrent Neural Networks
    Shin, Dongjoo
    Lee, Jinmook
    Lee, Jinsu
    Lee, Juhyoung
    Yoo, Hoi-Jun
    [J]. 2017 IEEE SYMPOSIUM IN LOW-POWER AND HIGH-SPEED CHIPS (COOL CHIPS), 2017,
  • [3] Storage Architecture for an On-chip Multi-core Processor
    Liu, Mengxiao
    Ji, Weixing
    Li, Jiaxin
    Pu, Xing
    [J]. PROCEEDINGS OF THE 2009 12TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN, ARCHITECTURES, METHODS AND TOOLS, 2009, : 263 - 270
  • [4] XOMA: Exclusive On-Chip Memory Architecture for Energy-Efficient Deep Learning Acceleration
    Sim, Hyeonuk
    Anderson, Jason H.
    Lee, Jongeun
    [J]. 24TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC 2019), 2019, : 651 - 656
  • [5] An Energy-efficient Multi-core Restricted Boltzmann Machine Processor with On-chip Bio-plausible Learning and Reconfigurable Sparsity
    Wu, Jiajun
    Huang, Xuan
    Yang, Le
    Wang, Liang
    Wang, Jipeng
    Liu, Zuozhu
    Chong, Kwen-Siong
    Lin, Shaowei
    Wang, Chao
    [J]. 2020 IEEE ASIAN SOLID-STATE CIRCUITS CONFERENCE (A-SSCC), 2020,
  • [6] New on-chip interconnection network for multi-core processor
    Qiao, Bao-Jun
    Shi, Feng
    Ji, Wei-Xing
    [J]. Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2007, 27 (06): : 511 - 516
  • [7] A function-based on-chip communication design in the heterogeneous multi-core architecture
    Chen, Tianzhou
    Chen, Guobing
    Dai, Hongjun
    Shi, Qinsong
    [J]. MUE: 2007 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND UBIQUITOUS ENGINEERING, PROCEEDINGS, 2007, : 1086 - +
  • [8] An on-chip communication mechanism design in the embedded heterogeneous multi-core architecture
    Yan, Like
    Shi, Qingsong
    Chen, Tianzhou
    Chen, Guobing
    [J]. PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, VOLS 1 AND 2, 2008, : 1842 - 1845
  • [9] An Energy-efficient On-chip Learning Architecture for STDP based Sparse Coding
    Kim, Heetak
    Tang, Hoyoung
    Park, Jongsun
    [J]. 2019 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN (ISLPED), 2019,
  • [10] Energy-Efficient Partitioning of Hybrid Caches in Multi-Core Architecture
    Lee, Dongwoo
    Choi, Kiyoung
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION (VLSI-SOC), 2014,