Cambricon: An Instruction Set Architecture for Neural Networks

被引:172
|
作者
Liu, Shaoli [1 ,4 ]
Du, Zidong [1 ,4 ]
Tao, Jinhua [1 ,4 ]
Han, Dong [1 ,4 ]
Luo, Tao [1 ,4 ]
Xie, Yuan [2 ]
Chent, Yunji [1 ,3 ]
Chent, Tianshi [1 ,3 ,4 ]
机构
[1] Chinese Acad Sci, ICT, State Key Lab Comp Architecture, Beijing, Peoples R China
[2] UCSB, Dept Elect & Comp Engn, Santa Barbara, CA USA
[3] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing, Peoples R China
[4] Cambricon Ltd, Beijing, Peoples R China
关键词
D O I
10.1109/ISCA.2016.42
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Neural Networks (NN) are a family of models for a broad range of emerging machine learning and pattern recondition applications. NN techniques are conventionally executed on general-purpose processors (such as CPU and GPGPU), which are usually not energy-efficient since they invest excessive hardware resources to flexibly support various workloads. Consequently, application-specific hardware accelerators for neural networks have been proposed recently to improve the energy-efficiency. However, such accelerators were designed for a small set of NN techniques sharing similar computational patterns, and they adopt complex and informative instructions (control signals) directly corresponding to high-level functional blocks of an NN (such as layers), or even an NN as a whole. Although straightforward and easy-to-implement for a limited set of similar NN techniques, the lack of agility in the instruction set prevents such accelerator designs from supporting a variety of different NN techniques with sufficient flexibility and efficiency. In this paper, we propose a novel domain-specific Instruction Set Architecture (ISA) for NN accelerators, called Cambricon, which is a load-store architecture that integrates scalar, vector, matrix, logical, data transfer, and control instructions, based on a comprehensive analysis of existing NN techniques. Our evaluation over a total of ten representative yet distinct NN techniques have demonstrated that Cambricon exhibits strong descriptive capacity over a broad range of NN techniques, and provides higher code density than general-purpose ISAs such as x86, MIPS, and GPGPU. Compared to the latest state-of-the-art NN accelerator design DaDianNao [5] (which can only accommodate 3 types of NN techniques), our Cambricon-based accelerator prototype implemented in TSMC 65nm technology incurs only negligible latency/power/area overheads, with a versatile coverage of 10 different NN benchmarks.
引用
收藏
页码:393 / 405
页数:13
相关论文
共 50 条
  • [41] ISDL: An Instruction Set Description Language for retargetability and architecture exploration
    Hadjiyiannis, G
    Hanono, S
    Devadas, S
    [J]. DESIGN AUTOMATION FOR EMBEDDED SYSTEMS, 2000, 6 (01) : 39 - 69
  • [42] PLX: An instruction set architecture and testbed for multimedia information processing
    Lee, RB
    Fiskiran, AM
    [J]. JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2005, 40 (01): : 85 - 108
  • [43] An architecture framework for transparent instruction set customization in embedded processors
    Clark, N
    Blome, J
    Chu, M
    Mahlke, S
    Biles, S
    Flautner, K
    [J]. 32ND INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, PROCEEDINGS, 2005, : 272 - 283
  • [44] The Design of Reconfigurable Instruction Set Processor Based on ARM Architecture
    Yin, Jinyong
    Xu, Zhenpeng
    Fang, Xinmo
    Zhou, Xihao
    [J]. ADVANCED COMPUTER ARCHITECTURE, 2018, 908 : 66 - 78
  • [45] Instruction set extension exploration in multiple-issue architecture
    Wu, I-Wei
    Chen, Zhi-Yuan
    Shann, Jyh-Jiun
    Chung, Chung-Ping
    [J]. 2008 DESIGN, AUTOMATION AND TEST IN EUROPE, VOLS 1-3, 2008, : 680 - 685
  • [46] LLVA: A low-level virtual instruction set architecture
    Adve, V
    Lattner, C
    Brukman, M
    Shukla, A
    Gaeke, B
    [J]. 36TH INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, PROCEEDINGS, 2003, : 205 - 216
  • [47] Architecture Design of a Variable Length Instruction Set VLIW DSP
    沈钲
    何虎
    杨旭
    贾迪
    孙义和
    [J]. Tsinghua Science and Technology, 2009, 14 (05) : 561 - 569
  • [48] A Scheme to Improve the Intrinsic Error Detection of the Instruction Set Architecture
    Martinez, Jorge A.
    Antonio Maestro, Juan
    Reviriego, Pedro
    [J]. IEEE COMPUTER ARCHITECTURE LETTERS, 2017, 16 (02) : 103 - 106
  • [49] IMPLEMENTATION-INDEPENDENT MODEL OF AN INSTRUCTION SET ARCHITECTURE IN VHDL
    SALINAS, MH
    JOHNSON, BW
    AYLOR, JH
    [J]. IEEE DESIGN & TEST OF COMPUTERS, 1993, 10 (03): : 42 - 54
  • [50] THE 8086 - AN ARCHITECTURE FOR THE FUTURE .3. INSTRUCTION SET CONTINUED
    HEYWOOD, SA
    [J]. BYTE, 1983, 8 (08): : 404 - &