CANNA: Neural Network Acceleration using Configurable Approximation on GPGPU

被引:0
|
作者
Imani, Mohsen [1 ]
Masich, Max [1 ]
Peroni, Daniel [1 ]
Wang, Pushen [1 ]
Rosing, Tajana [1 ]
机构
[1] Univ Calif San Diego, CSE Dept, La Jolla, CA 92093 USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Neural networks have been successfully used in many applications. Due to their computational complexityit is difficult to implement them on embedded devices. Neural networks are inherently approximate and thus can be simplified. In this paper, CANNA proposes a gradual training approximation which adaptively sets the level of hardware approximation depending on the neural network's internal error, instead of apply uniform hardware approximation. To accelerate inference, CANNA's layer-based approximation approach selectively relaxes the computation in each layer of neural network, as a function its sensitivity to approximation. For hardware support, we use a configurable floating point unit in Hardware that dynamically identifies inputs which produce the largest approximation error and process them instead in precise mode. We evaluate the accuracy and efficiency of our design by integrating configurable FPUs into AMD's Southern Island GPU architecture. Our experimental evaluation shows that CANNA achieves up to 4.84x (7.13x) energy savings and 3.22x (4.64x) speedup when training four different neural network applications with 0% (2%) quality loss as compared to the implementation on baseline GPU. During the inference phase, our layer-based approach improves the energy efficiency by 4.42x (6.06x) and results in 2.96x (3.98x) speedup while ensuring 0% (2%) quality loss.
引用
收藏
页码:682 / 689
页数:8
相关论文
共 50 条
  • [1] Acceleration of Neural Network Learning by GPGPU
    Tsuchida, Yuta
    Yoshioka, Michifumi
    [J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2013, 96 (08) : 59 - 66
  • [2] Efficient Neural Network Acceleration on GPGPU using Content Addressable Memory
    Imani, Mohsen
    Peroni, Daniel
    Kim, Yeseong
    Rahimi, Abbas
    Rosing, Tajana
    [J]. PROCEEDINGS OF THE 2017 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2017, : 1026 - 1031
  • [3] Neural network application using GPGPU
    Tsuchida, Y.
    Yoshioka, M.
    [J]. PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 16TH '11), 2011, : 870 - 872
  • [4] CONNA: Configurable Matrix Multiplication Engine for Neural Network Acceleration
    Park, Sang-Soo
    Chung, Ki-Seok
    [J]. ELECTRONICS, 2022, 11 (15)
  • [5] Compression acceleration using GPGPU
    Shastry, Krishnaprasad
    Pandey, Avinash
    Agrawal, Ashutosh
    Sarveswara, Ravi
    [J]. 2016 23RD IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING WORKSHOPS (HIPCW 2016), 2016, : 70 - 78
  • [6] Speed Up Method for Neural Network Learning by Using GPGPU
    Tsuchida, Yuta
    Yoshioka, Michifumi
    [J]. 6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 193 - 196
  • [7] Speed-Up Method for Neural Network Learning Using GPGPU
    Tsuchida, Yuta
    Yoshioka, Michifumi
    Omatu, Sigeru
    [J]. DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2012, 151 : 73 - +
  • [8] Acceleration of Game Tree Search Using GPGPU
    Mahale, Kajal
    Kanaskar, Shital
    Kapadnis, Prachi
    Desale, Madhuri
    Walunj, S. M.
    [J]. 2015 INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND INTERNET OF THINGS (ICGCIOT), 2015, : 550 - 553
  • [9] Acceleration of discrete stochastic biochemical simulation using GPGPU
    Sumiyoshi, Kei
    Hirata, Kazuki
    Hiroi, Noriko
    Funahashi, Akira
    [J]. FRONTIERS IN PHYSIOLOGY, 2015, 6
  • [10] Approximation of the disturbing gravity using neural network
    Wang, Ji-Ping
    Wang, Ming-Hai
    Zhang, Zhi-Hui
    [J]. Yuhang Xuebao/Journal of Astronautics, 2008, 29 (01): : 385 - 390