TENG: A General-Purpose and Efficient Processor Architecture for Accelerating DNN

被引:0
|
作者
Zhang, Zekun [1 ,2 ]
Cai, Yujie [1 ]
Liao, Tianjiao [2 ]
Xu, Chengyu [2 ]
Jiao, Xin [2 ]
机构
[1] Fudan Univ, State Key Lab ASIC & Syst, Shanghai, Peoples R China
[2] SenseTime Res, Hong Kong, Peoples R China
关键词
Neural network; deep learning accelerator; hardware accelerator; DNN; ASIC; PERFORMANCE;
D O I
10.1109/AICAS59952.2024.10595854
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has been widely deployed in the fields such as computer vision and speech, etc. However, with the development of deep learning algorithms, neural networks have gradually become more complex, the subsequent computing requirements for huge amounts of data have posed greater challenges to hardware updates and improvements. In this work, a general-purpose and efficient deep neural network (DNN) accelerator is proposed, named TENG. In TENG, we proposed a unified architecture with general and dedicated coexistence to support various network operators, and the memory access engine in TENG optimizes off-chip data transmission to achieve better bandwidth utilization. The correctness of TENG is verified on FPGA and implemented in 7-nm FinFET technology. It achieves 4TOPS peak throughput@INT16 and has an energy efficiency of 5.15TOPS/W using VGG-16 at 1GHz clock frequency.
引用
收藏
页码:149 / 153
页数:5
相关论文
共 50 条
  • [31] VAXSTATION - A GENERAL-PURPOSE RASTER GRAPHICS ARCHITECTURE
    LEVY, HM
    [J]. ACM TRANSACTIONS ON GRAPHICS, 1984, 3 (01): : 70 - 83
  • [32] THE ARCHITECTURE OF NEWTON, A GENERAL-PURPOSE DYNAMICS SIMULATOR
    CREMER, JF
    STEWART, AJ
    [J]. PROCEEDINGS - 1989 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOL 1-3, 1989, : 1806 - 1811
  • [33] Digital Transversal Filter as General-Purpose Signal Processor.
    Klaas, Lothar
    [J]. Elektronik Munchen, 1980, 29 (20): : 61 - 66
  • [34] General-purpose programmable photonic processor for advanced radiofrequency applications
    Perez-Lopez, Daniel
    Gutierrez, Ana
    Sanchez, David
    Lopez-Hernandez, Aitor
    Gutierrez, Mikel
    Sanchez-Gomariz, Erica
    Fernandez, Juan
    Cruz, Alejandro
    Quiros, Alberto
    Xie, Zhenyun
    Benitez, Jesus
    Bekesi, Nandor
    Santome, Alejandro
    Perez-Galacho, Diego
    DasMahapatra, Prometheus
    Macho, Andres
    Capmany, Jose
    [J]. NATURE COMMUNICATIONS, 2024, 15 (01)
  • [36] Adaptive speed control of a general-purpose processor based on activities
    Furuichi, S
    Aihara, T
    [J]. IEICE TRANSACTIONS ON ELECTRONICS, 1998, E81C (09) : 1481 - 1483
  • [37] General-purpose programmable photonic processor for advanced radiofrequency applications
    Daniel Pérez-López
    Ana Gutierrez
    David Sánchez
    Aitor López-Hernández
    Mikel Gutierrez
    Erica Sánchez-Gomáriz
    Juan Fernández
    Alejandro Cruz
    Alberto Quirós
    Zhenyun Xie
    Jesús Benitez
    Nandor Bekesi
    Alejandro Santomé
    Diego Pérez-Galacho
    Prometheus DasMahapatra
    Andrés Macho
    José Capmany
    [J]. Nature Communications, 15
  • [38] EFFICIENT GENERAL-PURPOSE PARALLEL COMPUTER.
    Galil, Zvi
    Paul, Wolfang J.
    [J]. Journal of the ACM, 1983, 30 (02): : 360 - 387
  • [39] A General-Purpose Transferable Predictor for Neural Architecture Search
    Han, Fred X.
    Mills, Keith G.
    Chudak, Fabian
    Riahi, Parsa
    Salameh, Mohammad
    Zhang, Jialin
    Lul, Wei
    Jui, Shangling
    Niu, Di
    [J]. PROCEEDINGS OF THE 2023 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2023, : 721 - 729
  • [40] A GENERAL-PURPOSE DIGITAL ARCHITECTURE FOR NEURAL NETWORK SIMULATIONS
    DURANTON, M
    MAUDUIT, N
    [J]. FIRST IEE INTERNATIONAL CONFERENCE ON ARTIFICIAL NEURAL NETWORKS, 1989, : 62 - 66