CANNA: Neural Network Acceleration using Configurable Approximation on GPGPU

被引：0

作者：

Imani, Mohsen ^{[1
]}

Masich, Max ^{[1
]}

Peroni, Daniel ^{[1
]}

Wang, Pushen ^{[1
]}

Rosing, Tajana ^{[1
]}

机构：

[1] Univ Calif San Diego, CSE Dept, La Jolla, CA 92093 USA

来源：

2018 23RD ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC) | 2018年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Neural networks have been successfully used in many applications. Due to their computational complexityit is difficult to implement them on embedded devices. Neural networks are inherently approximate and thus can be simplified. In this paper, CANNA proposes a gradual training approximation which adaptively sets the level of hardware approximation depending on the neural network's internal error, instead of apply uniform hardware approximation. To accelerate inference, CANNA's layer-based approximation approach selectively relaxes the computation in each layer of neural network, as a function its sensitivity to approximation. For hardware support, we use a configurable floating point unit in Hardware that dynamically identifies inputs which produce the largest approximation error and process them instead in precise mode. We evaluate the accuracy and efficiency of our design by integrating configurable FPUs into AMD's Southern Island GPU architecture. Our experimental evaluation shows that CANNA achieves up to 4.84x (7.13x) energy savings and 3.22x (4.64x) speedup when training four different neural network applications with 0% (2%) quality loss as compared to the implementation on baseline GPU. During the inference phase, our layer-based approach improves the energy efficiency by 4.42x (6.06x) and results in 2.96x (3.98x) speedup while ensuring 0% (2%) quality loss.

引用

页码：682 / 689

页数：8

共 50 条

[1] Acceleration of Neural Network Learning by GPGPU
Tsuchida, Yuta
Yoshioka, Michifumi
[J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2013, 96 (08) : 59 - 66
[2] Efficient Neural Network Acceleration on GPGPU using Content Addressable Memory
Imani, Mohsen
Peroni, Daniel
Kim, Yeseong
Rahimi, Abbas
Rosing, Tajana
[J]. PROCEEDINGS OF THE 2017 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2017, : 1026 - 1031
[3] Neural network application using GPGPU
Tsuchida, Y.
Yoshioka, M.
[J]. PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 16TH '11), 2011, : 870 - 872
[4] CONNA: Configurable Matrix Multiplication Engine for Neural Network Acceleration
Park, Sang-Soo
Chung, Ki-Seok
[J]. ELECTRONICS, 2022, 11 (15)
[5] Compression acceleration using GPGPU
Shastry, Krishnaprasad
Pandey, Avinash
Agrawal, Ashutosh
Sarveswara, Ravi
[J]. 2016 23RD IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING WORKSHOPS (HIPCW 2016), 2016, : 70 - 78
[6] Speed Up Method for Neural Network Learning by Using GPGPU
Tsuchida, Yuta
Yoshioka, Michifumi
[J]. 6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 193 - 196
[7] Speed-Up Method for Neural Network Learning Using GPGPU
Tsuchida, Yuta
Yoshioka, Michifumi
Omatu, Sigeru
[J]. DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2012, 151 : 73 - +
[8] Acceleration of Game Tree Search Using GPGPU
Mahale, Kajal
Kanaskar, Shital
Kapadnis, Prachi
Desale, Madhuri
Walunj, S. M.
[J]. 2015 INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND INTERNET OF THINGS (ICGCIOT), 2015, : 550 - 553
[9] Acceleration of discrete stochastic biochemical simulation using GPGPU
Sumiyoshi, Kei
Hirata, Kazuki
Hiroi, Noriko
Funahashi, Akira
[J]. FRONTIERS IN PHYSIOLOGY, 2015, 6
[10] Approximation of the disturbing gravity using neural network
Wang, Ji-Ping
Wang, Ming-Hai
Zhang, Zhi-Hui
[J]. Yuhang Xuebao/Journal of Astronautics, 2008, 29 (01): : 385 - 390

← 1 2 3 4 5 →