CANNA: Neural Network Acceleration using Configurable Approximation on GPGPU

被引:0
|
作者
Imani, Mohsen [1 ]
Masich, Max [1 ]
Peroni, Daniel [1 ]
Wang, Pushen [1 ]
Rosing, Tajana [1 ]
机构
[1] Univ Calif San Diego, CSE Dept, La Jolla, CA 92093 USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Neural networks have been successfully used in many applications. Due to their computational complexityit is difficult to implement them on embedded devices. Neural networks are inherently approximate and thus can be simplified. In this paper, CANNA proposes a gradual training approximation which adaptively sets the level of hardware approximation depending on the neural network's internal error, instead of apply uniform hardware approximation. To accelerate inference, CANNA's layer-based approximation approach selectively relaxes the computation in each layer of neural network, as a function its sensitivity to approximation. For hardware support, we use a configurable floating point unit in Hardware that dynamically identifies inputs which produce the largest approximation error and process them instead in precise mode. We evaluate the accuracy and efficiency of our design by integrating configurable FPUs into AMD's Southern Island GPU architecture. Our experimental evaluation shows that CANNA achieves up to 4.84x (7.13x) energy savings and 3.22x (4.64x) speedup when training four different neural network applications with 0% (2%) quality loss as compared to the implementation on baseline GPU. During the inference phase, our layer-based approach improves the energy efficiency by 4.42x (6.06x) and results in 2.96x (3.98x) speedup while ensuring 0% (2%) quality loss.
引用
收藏
页码:682 / 689
页数:8
相关论文
共 50 条
  • [21] fuseGNN: Accelerating Graph Convolutional Neural Network Training on GPGPU
    Chen, Zhaodong
    Yan, Mingyu
    Zhu, Maohua
    Deng, Lei
    Li, Guoqi
    Li, Shuangchen
    Xie, Yuan
    [J]. 2020 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED-DESIGN (ICCAD), 2020,
  • [22] Parallel Implementation of Neural Network Model for Quadratic Programming on GPGPU
    Ma, Anguo
    Tan, Caifeng
    Cheng, Yu
    [J]. 2011 INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND NEURAL COMPUTING (FSNC 2011), VOL IV, 2011, : 241 - 245
  • [23] ARTIFICIAL NEURAL NETWORK ACCELERATION ON FPGA USING CUSTOM INSTRUCTION
    Santos, Patrick
    Ouellet-Poulin, David
    Shapiro, Daniel
    Bolic, Miodrag
    [J]. 2011 24TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2011, : 450 - 455
  • [24] Frequency adjustment for GPGPU kernels based on recurrent neural network
    Kim, SeongKi
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 127 : 17 - 18
  • [25] Parallel Implementation of Neural Network Model for Quadratic Programming on GPGPU
    Ma, Anguo
    Tan, Caifeng
    Cheng, Yu
    [J]. 2011 AASRI CONFERENCE ON INFORMATION TECHNOLOGY AND ECONOMIC DEVELOPMENT (AASRI-ITED 2011), VOL 1, 2011, : 241 - 245
  • [26] Acceleration of ray tracing method using predictive evaluation and GPGPU technology
    Sobota, Branislav
    Korecko, Stefan
    Szabo, Csaba
    Hrozek, Frantisek
    [J]. OPEN COMPUTER SCIENCE, 2014, 4 (03) : 118 - 126
  • [27] A Configurable Nonlinear Operation Unit For Neural Network Accelerator
    Cai, Yujie
    Li, Xin
    Han, Jun
    Zeng, Xiaoyang
    [J]. 2017 IEEE 12TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2017, : 319 - 322
  • [28] A Configurable Digital Cellular Neural Network with Template Decomposition
    Sung-Nien Yu
    Chien-Nan Lin
    Yun-Kai Hsu
    [J]. Circuits, Systems, and Signal Processing, 2011, 30 : 463 - 482
  • [29] Acceleration of Image Reconstruction with a Ray-Driven Method Using a GPGPU
    Ito, Ryo
    Ogawa, Koichi
    [J]. 2015 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE (NSS/MIC), 2015,
  • [30] Design and Implementation of Configurable Convolutional Neural Network on FPGA
    Huynh Vinh Phu
    Tran Minh Tan
    Phan Van Men
    Nguyen Van Hieu
    Truong Van Cuong
    [J]. PROCEEDINGS OF 2019 6TH NATIONAL FOUNDATION FOR SCIENCE AND TECHNOLOGY DEVELOPMENT (NAFOSTED) CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2019, : 298 - 302