Tetris: Re-architecting Convolutional Neural Network Computation for Machine Learning Accelerators

被引:29
|
作者
Lu, Hang [1 ,2 ]
Wei, Xin [2 ]
Lin, Ning [2 ]
Yan, Guihai [1 ,2 ]
Li, Xiao-Wei [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, State Key Lab Comp Architecture, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1145/3240765.3240855
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Inference efficiency is the predominant consideration in designing deep learning accelerators. Previous work mainly focuses on skipping zero values to deal with remarkable ineffectual computation, while zero bits in non-zero values, as another major source of ineffectual computation, is often ignored. The reason lies on the difficulty of extracting essential bits during operating multiply-and-accumulate (MAC) in the processing element. Based on the fact that zero bits occupy as high as 68.9% fraction in the overall weights of modern deep convolutional neural network models, this paper firstly proposes a weight kneading technique that could eliminate ineffectual computation caused by either zero value weights or zero bits in non-zero weights, simultaneously. Besides, a split-and-accumulate (SAC) computing pattern in replacement of conventional MAC, as well as the corresponding hardware accelerator design called Tetris are proposed to support weight kneading at the hardware level. Experimental results prove that Tetris could speed up inference up to 1.50x, and improve power efficiency up to 5.33x compared with the state-of-the-art baselines.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Architecting Effectual Computation for Machine Learning Accelerators
    Lu, Hang
    Zhang, Mingzhe
    Han, Yinhe
    Wang, Qi
    Li, Huawei
    Li, Xiaowei
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (10) : 2654 - 2667
  • [2] Re-architecting Traffic Analysis with Neural Network Interface Cards
    Siracusano, Giuseppe
    Galea, Salvator
    Sanvito, Davide
    Malekzadeh, Mohammad
    Antichi, Gianni
    Costa, Paolo
    Haddadi, Hamed
    Bifulco, Roberto
    PROCEEDINGS OF THE 19TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION (NSDI '22), 2022, : 513 - 533
  • [3] OmniMon: Re-architecting Network Telemetry with Resource Efficiency and Full Accuracy
    Huang, Qun
    Sun, Haifeng
    Lee, Patrick P. C.
    Bai, Wei
    Zhu, Feng
    Bao, Yungang
    SIGCOMM '20: PROCEEDINGS OF THE 2020 ANNUAL CONFERENCE OF THE ACM SPECIAL INTEREST GROUP ON DATA COMMUNICATION ON THE APPLICATIONS, TECHNOLOGIES, ARCHITECTURES, AND PROTOCOLS FOR COMPUTER COMMUNICATION, 2020, : 404 - +
  • [4] Re-architecting the On-chip memory Sub-system of Machine-Learning Accelerator for Embedded Devices
    Wang, Ying
    Li, Huawei
    Li, Xiaowei
    2016 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2016,
  • [5] Refresh Triggered Computation: Improving the Energy Efficiency of Convolutional Neural Network Accelerators
    Jafri, Syed M. A. H.
    Hassan, Hasan
    Hemani, Ahmed
    Mutlu, Onur
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2021, 18 (01)
  • [6] Preventing Neural Network Model Exfiltration in Machine Learning Hardware Accelerators
    Isakov, Mihailo
    Bu, Lake
    Cheng, Hai
    Kinsy, Michel A.
    PROCEEDINGS OF THE 2018 ASIAN HARDWARE ORIENTED SECURITY AND TRUST SYMPOSIUM (ASIANHOST), 2018, : 62 - 67
  • [7] Semantic Representation Learning of Convolutional Neural Network Based on Tensor Computation
    Yang L.-J.
    Wang J.-Q.
    Jing L.-P.
    Yu J.
    Jisuanji Xuebao/Chinese Journal of Computers, 2023, 46 (03): : 568 - 578
  • [8] Scene Classification with Simple Machine Learning and Convolutional Neural Network
    Yosboon, Simon
    2022 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATIONS (DASA), 2022, : 616 - 619
  • [9] ADVERSARIAL MACHINE LEARNING USING CONVOLUTIONAL NEURAL NETWORK WITH IMAGENET
    Khakurel, Utsab
    Rawat, Danda B.
    PROCEEDINGS OF THE 2022 ANNUAL MODELING AND SIMULATION CONFERENCE (ANNSIM'22), 2022, : 246 - 257
  • [10] Machine learning with a convolutional neural network for segmentation of ophthalmological images
    Biswas, Hridoy
    Umbaugh, Scott E.
    COMPUTATIONAL IMAGING VI, 2021, 11731