TENG: A General-Purpose and Efficient Processor Architecture for Accelerating DNN

被引:0
|
作者
Zhang, Zekun [1 ,2 ]
Cai, Yujie [1 ]
Liao, Tianjiao [2 ]
Xu, Chengyu [2 ]
Jiao, Xin [2 ]
机构
[1] Fudan Univ, State Key Lab ASIC & Syst, Shanghai, Peoples R China
[2] SenseTime Res, Hong Kong, Peoples R China
关键词
Neural network; deep learning accelerator; hardware accelerator; DNN; ASIC; PERFORMANCE;
D O I
10.1109/AICAS59952.2024.10595854
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has been widely deployed in the fields such as computer vision and speech, etc. However, with the development of deep learning algorithms, neural networks have gradually become more complex, the subsequent computing requirements for huge amounts of data have posed greater challenges to hardware updates and improvements. In this work, a general-purpose and efficient deep neural network (DNN) accelerator is proposed, named TENG. In TENG, we proposed a unified architecture with general and dedicated coexistence to support various network operators, and the memory access engine in TENG optimizes off-chip data transmission to achieve better bandwidth utilization. The correctness of TENG is verified on FPGA and implemented in 7-nm FinFET technology. It achieves 4TOPS peak throughput@INT16 and has an energy efficiency of 5.15TOPS/W using VGG-16 at 1GHz clock frequency.
引用
收藏
页码:149 / 153
页数:5
相关论文
共 50 条
  • [1] A GENERAL-PURPOSE VIDEO SIGNAL PROCESSOR - ARCHITECTURE AND PROGRAMMING
    DIJKSTRA, H
    ESSINK, G
    HAFKAMP, AJM
    DENHENGST, H
    HUIZER, CM
    VANROERMUND, AHM
    SLUYTER, RJ
    SNIJDER, PJ
    [J]. PROCEEDINGS - IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN : VLSI IN COMPUTERS & PROCESSORS, 1989, : 74 - 77
  • [2] A GENERAL-PURPOSE SIGNAL PROCESSOR ARCHITECTURE FOR NEUROCOMPUTING AND PREPROCESSING APPLICATIONS
    RAMACHER, U
    BEICHTER, J
    BRULS, N
    [J]. JOURNAL OF VLSI SIGNAL PROCESSING, 1993, 6 (01): : 45 - 56
  • [3] An FFT Performance Model for Optimizing General-Purpose Processor Architecture
    李玲
    陈云霁
    刘道福
    钱诚
    胡伟武
    [J]. Journal of Computer Science & Technology, 2011, 26 (05) : 875 - 889
  • [4] An FFT Performance Model for Optimizing General-Purpose Processor Architecture
    Li, Ling
    Chen, Yun-Ji
    Liu, Dao-Fu
    Qian, Cheng
    Hu, Wei-Wu
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2011, 26 (05): : 875 - 889
  • [5] A GENERAL-PURPOSE SIGNAL PROCESSOR
    FREY, AH
    MINTZER, FC
    [J]. COMPUTER NETWORKS AND ISDN SYSTEMS, 1982, 6 (03): : 224 - 224
  • [6] No Such Thing as a General-Purpose Processor
    Chisnall, David
    [J]. COMMUNICATIONS OF THE ACM, 2014, 57 (12) : 44 - 48
  • [7] GENERAL-PURPOSE PROCESSOR FOR TYPOGRAPHIC EDITING
    FABER, R
    MURASZKIEWICZ, M
    OSTROWSKI, M
    [J]. INTERNATIONAL FORUM ON INFORMATION AND DOCUMENTATION, 1987, 12 (03): : 19 - 20
  • [8] Space systems general-purpose processor
    Perschy, JK
    [J]. IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2000, 15 (11) : 15 - 19
  • [9] A GENERAL-PURPOSE FUZZY INFERENCE PROCESSOR
    EICHFELD, H
    KLIMKE, M
    MENKE, M
    NOLLES, J
    KUNEMUND, T
    [J]. IEEE MICRO, 1995, 15 (03) : 12 - 17
  • [10] Specification of a configurable general-purpose protocol processor
    Henriksson, T
    Nordqvist, U
    Liu, D
    [J]. IEE PROCEEDINGS-CIRCUITS DEVICES AND SYSTEMS, 2002, 149 (03): : 198 - 202