Tetris: Re-architecting Convolutional Neural Network Computation for Machine Learning Accelerators

被引:29
|
作者
Lu, Hang [1 ,2 ]
Wei, Xin [2 ]
Lin, Ning [2 ]
Yan, Guihai [1 ,2 ]
Li, Xiao-Wei [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, State Key Lab Comp Architecture, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1145/3240765.3240855
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Inference efficiency is the predominant consideration in designing deep learning accelerators. Previous work mainly focuses on skipping zero values to deal with remarkable ineffectual computation, while zero bits in non-zero values, as another major source of ineffectual computation, is often ignored. The reason lies on the difficulty of extracting essential bits during operating multiply-and-accumulate (MAC) in the processing element. Based on the fact that zero bits occupy as high as 68.9% fraction in the overall weights of modern deep convolutional neural network models, this paper firstly proposes a weight kneading technique that could eliminate ineffectual computation caused by either zero value weights or zero bits in non-zero weights, simultaneously. Besides, a split-and-accumulate (SAC) computing pattern in replacement of conventional MAC, as well as the corresponding hardware accelerator design called Tetris are proposed to support weight kneading at the hardware level. Experimental results prove that Tetris could speed up inference up to 1.50x, and improve power efficiency up to 5.33x compared with the state-of-the-art baselines.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Intelligent machine fault diagnosis based on deep transfer convolutional neural network and extreme learning machine
    Cen, Jian
    Chen, Zhihao
    Wu, Yinbo
    Yang, Zhuohong
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2023, 237 (09) : 2201 - 2212
  • [42] Bee Hive Acoustic Monitoring and Processing Using Convolutional Neural Network and Machine Learning
    Sakova, Michaela
    Jurik, Patrik
    Galajda, Pavol
    Sokol, Miroslav
    2024 34TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA, RADIOELEKTRONIKA 2024, 2024,
  • [43] Multimodal medical image fusion using convolutional neural network and extreme learning machine
    Kong, Weiwei
    Li, Chi
    Lei, Yang
    FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [44] Integration of convolutional neural network and conventional machine learning classifiers for landslide susceptibility mapping
    Fang, Zhice
    Wang, Yi
    Peng, Ling
    Hong, Haoyuan
    COMPUTERS & GEOSCIENCES, 2020, 139
  • [45] Machine learning approach to residential valuation: a convolutional neural network model for geographic variation
    Lee, Hojun
    Han, Hoon
    Pettit, Chris
    Gao, Qishuo
    Shi, Vivien
    ANNALS OF REGIONAL SCIENCE, 2024, 72 (02): : 579 - 599
  • [46] A Hybrid Spiking-Convolutional Neural Network Approach for Advancing Machine Learning Models
    Sanaullah
    Roy, Kaushik
    Rueckert, Ulrich
    Jungeblut, Thorsten
    NORTHERN LIGHTS DEEP LEARNING CONFERENCE, VOL 233, 2024, 233 : 220 - 227
  • [47] Machine learning approach to residential valuation: a convolutional neural network model for geographic variation
    Hojun Lee
    Hoon Han
    Chris Pettit
    Qishuo Gao
    Vivien Shi
    The Annals of Regional Science, 2024, 72 : 579 - 599
  • [48] Biomedical event trigger detection with convolutional highway neural network and extreme learning machine
    Shen, Chen
    Lin, Hongfei
    Fan, Xiaochao
    Chu, Yonghe
    Yang, Zhihao
    Wang, Jian
    Zhang, Shaowu
    APPLIED SOFT COMPUTING, 2019, 84
  • [49] Skin lesion classification of dermoscopic images using machine learning and convolutional neural network
    Bhuvaneshwari Shetty
    Roshan Fernandes
    Anisha P. Rodrigues
    Rajeswari Chengoden
    Sweta Bhattacharya
    Kuruva Lakshmanna
    Scientific Reports, 12
  • [50] Research Challenges for Network Function Virtualization - Re-Architecting Middlebox for High Performance and Efficient, Elastic and Resilient Platform to Create New Services
    Shiomoto, Kohei
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2018, E101B (01) : 96 - 122