TENG: A General-Purpose and Efficient Processor Architecture for Accelerating DNN

被引:0
|
作者
Zhang, Zekun [1 ,2 ]
Cai, Yujie [1 ]
Liao, Tianjiao [2 ]
Xu, Chengyu [2 ]
Jiao, Xin [2 ]
机构
[1] Fudan Univ, State Key Lab ASIC & Syst, Shanghai, Peoples R China
[2] SenseTime Res, Hong Kong, Peoples R China
关键词
Neural network; deep learning accelerator; hardware accelerator; DNN; ASIC; PERFORMANCE;
D O I
10.1109/AICAS59952.2024.10595854
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has been widely deployed in the fields such as computer vision and speech, etc. However, with the development of deep learning algorithms, neural networks have gradually become more complex, the subsequent computing requirements for huge amounts of data have posed greater challenges to hardware updates and improvements. In this work, a general-purpose and efficient deep neural network (DNN) accelerator is proposed, named TENG. In TENG, we proposed a unified architecture with general and dedicated coexistence to support various network operators, and the memory access engine in TENG optimizes off-chip data transmission to achieve better bandwidth utilization. The correctness of TENG is verified on FPGA and implemented in 7-nm FinFET technology. It achieves 4TOPS peak throughput@INT16 and has an energy efficiency of 5.15TOPS/W using VGG-16 at 1GHz clock frequency.
引用
收藏
页码:149 / 153
页数:5
相关论文
共 50 条
  • [21] No such thing as a general-purpose processor: And the belief in such a device is harmful
    Chisnall, David
    [J]. Queue, 2014, 12 (10): : 1 - 6
  • [22] AN ARRAY PROCESSOR FOR GENERAL-PURPOSE DIGITAL IMAGE COMPRESSION
    YATES, RB
    THACKER, NA
    EVANS, SJ
    WALKER, SN
    IVEY, PA
    [J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1995, 30 (03) : 244 - 250
  • [23] Accelerating earthquake simulations on general-purpose graphics processors
    Sengupta, Prasenjit
    Nguyen, Jimmy
    Kwan, Jason
    Menon, Padmanabhan K.
    Heien, Eric M.
    Rundle, John B.
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (17): : 5460 - 5471
  • [24] General-purpose acousto-optic connectionist processor
    Naughton, T
    Javadpour, Z
    Keating, J
    Klíma, M
    Rott, J
    [J]. OPTICAL ENGINEERING, 1999, 38 (07) : 1170 - 1177
  • [25] A GENERAL-PURPOSE CMOS ASSOCIATIVE PROCESSOR IC AND SYSTEM
    STORMON, CD
    TROULLINOS, NB
    SALEH, EM
    CHAVAN, AV
    BRULE, MR
    OLDFIELD, JV
    [J]. IEEE MICRO, 1992, 12 (06) : 68 - 78
  • [26] A General-Purpose and Configurable Planar Data Processor for Energy-Efficient Pooling Computation
    Pan, Lunshuai
    Xue, Peng
    Li, Hongxing
    Sun, Litao
    Huang, Mingqiang
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2022): INTELLIGENT TECHNOLOGY IN THE POST-PANDEMIC ERA, 2022, : 33 - 36
  • [27] THE DESIGN OF A GENERAL-PURPOSE MULTIPLE-PROCESSOR SYSTEM
    OSECKY, BD
    GEORG, DD
    BURY, RJ
    [J]. HEWLETT-PACKARD JOURNAL, 1984, 35 (03): : 34 - 38
  • [28] General-purpose compression for efficient retrieval
    Cannane, A
    Williams, HE
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2001, 52 (05): : 430 - 437
  • [29] AN EFFICIENT GENERAL-PURPOSE PARALLEL COMPUTER
    GALIL, Z
    PAUL, WJ
    [J]. JOURNAL OF THE ACM, 1983, 30 (02) : 360 - 387
  • [30] RNIW: A novel general-purpose DSP architecture
    Qing, H
    Huan, HC
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 3302 - 3305