An Energy-Efficient Deep Belief Network Processor Based on Heterogeneous Multi-Core Architecture With Transposable Memory and On-Chip Learning

被引：2

作者：

Wu, Jiajun ^{[1
]}

Huang, Xuan ^{[1
]}

Yang, Le ^{[1
]}

Wang, Jipeng ^{[1
]}

Liu, Bingqiang ^{[1
]}

Wen, Ziyuan ^{[1
]}

Li, Juhui ^{[2
]}

Yu, Guoyi ^{[1
]}

Chong, Kwen-Siong ^{[3
]}

Wang, Chao ^{[1
,4
]}

机构：

[1] Huazhong Univ Sci & Technol, Sch Opt & Elect Informat, Wuhan 430074, Hubei, Peoples R China

[2] Nations Innovat Technol Pte Ltd, Singapore 117674, Singapore

[3] Nanyang Technol Univ Singapore, Temasek Labs, Singapore 637553, Singapore

[4] Wuhan Natl Lab Optoelect, Wuhan 430074, Hubei, Peoples R China

来源：

IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS | 2021年 / 11卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Neurons; Energy efficiency; Computational modeling; Unsupervised learning; System-on-chip; Integrated circuit modeling; Computer architecture; Edge computing; Deep Belief Network (DBN); on-chip learning; algorithm-architecture-circuit co-design; data reuse; data sparsity; heterogeneous multi-core architecture; transposable memory;

D O I：

10.1109/JETCAS.2021.3114396

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

With the growing interest of edge computing in the Internet of Things (IoT), Deep Neural Network (DNN) hardware processors/accelerators face challenges of low energy consumption, low latency, and data privacy issues. This paper proposes an energy-efficient processor design based on Deep Belief Network (DBN), which is one of the most suitable DNN models for on- chip learning. In this study, a thorough algorithm-architecture-circuit design optimization method is used for efficient design. The characteristics of data reuse and data sparsity in the DBN learning algorithm inspires this study to propose a heterogeneous multi-core architecture with local learning. In addition, novel circuits of transposable weight memory and sparse address generator are proposed to reduce weight memory access and exploit neuron state sparsity, respectively, for maximizing the energy efficiency. The DBN processor is implemented and thoroughly evaluated on Xilinx Zynq FPGA. Implementation results confirm that the proposed DBN processor has excellent energy efficiency of 45.0 pJ per neuron-weight update, which has been improved by 74% against the conventional design.

引用

页码：725 / 738

页数：14

共 50 条

[1] DNPU: An Energy-Efficient Deep-Learning Processor with Heterogeneous Multi-Core Architecture
Shin, Dongjoo
Lee, Jinmook
Lee, Jinsu
Lee, Juhyoung
Yoo, Hoi-Jun
[J]. IEEE MICRO, 2018, 38 (05) : 85 - 93
[2] An Energy-Efficient Deep Learning Processor with Heterogeneous Multi-Core Architecture for Convolutional Neural Networks and Recurrent Neural Networks
Shin, Dongjoo
Lee, Jinmook
Lee, Jinsu
Lee, Juhyoung
Yoo, Hoi-Jun
[J]. 2017 IEEE SYMPOSIUM IN LOW-POWER AND HIGH-SPEED CHIPS (COOL CHIPS), 2017,
[3] Storage Architecture for an On-chip Multi-core Processor
Liu, Mengxiao
Ji, Weixing
Li, Jiaxin
Pu, Xing
[J]. PROCEEDINGS OF THE 2009 12TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN, ARCHITECTURES, METHODS AND TOOLS, 2009, : 263 - 270
[4] XOMA: Exclusive On-Chip Memory Architecture for Energy-Efficient Deep Learning Acceleration
Sim, Hyeonuk
Anderson, Jason H.
Lee, Jongeun
[J]. 24TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC 2019), 2019, : 651 - 656
[5] An Energy-efficient Multi-core Restricted Boltzmann Machine Processor with On-chip Bio-plausible Learning and Reconfigurable Sparsity
Wu, Jiajun
Huang, Xuan
Yang, Le
Wang, Liang
Wang, Jipeng
Liu, Zuozhu
Chong, Kwen-Siong
Lin, Shaowei
Wang, Chao
[J]. 2020 IEEE ASIAN SOLID-STATE CIRCUITS CONFERENCE (A-SSCC), 2020,
[6] New on-chip interconnection network for multi-core processor
Qiao, Bao-Jun
Shi, Feng
Ji, Wei-Xing
[J]. Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2007, 27 (06): : 511 - 516
[7] A function-based on-chip communication design in the heterogeneous multi-core architecture
Chen, Tianzhou
Chen, Guobing
Dai, Hongjun
Shi, Qinsong
[J]. MUE: 2007 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND UBIQUITOUS ENGINEERING, PROCEEDINGS, 2007, : 1086 - +
[8] An on-chip communication mechanism design in the embedded heterogeneous multi-core architecture
Yan, Like
Shi, Qingsong
Chen, Tianzhou
Chen, Guobing
[J]. PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, VOLS 1 AND 2, 2008, : 1842 - 1845
[9] An Energy-efficient On-chip Learning Architecture for STDP based Sparse Coding
Kim, Heetak
Tang, Hoyoung
Park, Jongsun
[J]. 2019 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN (ISLPED), 2019,
[10] Energy-Efficient Partitioning of Hybrid Caches in Multi-Core Architecture
Lee, Dongwoo
Choi, Kiyoung
[J]. 2014 22ND INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION (VLSI-SOC), 2014,

← 1 2 3 4 5 →