Scalar Processing Overhead on SIMD-Only Architectures

被引：0

作者：

Azevedo, Arnaldo ^{[1
]}

Juurlink, Ben ^{[1
]}

机构：

[1] Delft Univ Technol, Fac Elect Engn Math & Comp Sci, Comp Engn Grp, Delft, Netherlands

来源：

2009 20TH IEEE INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS | 2009年

关键词：

Computer architecture; Datapath; SIMD processing; SIMD overhead;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The Cell processor consists of a general-purpose core and eight cores with a complete SIMD instruction set. Although originally designed for multimedia and gaming, it is currently being used for a much broader range of applications. In this paper we evaluate if the Cell SPEs could benefit significantly from a scalar processing unit using two methodologies. In the first methodology the scalar processing overhead is eliminated by replacing all scalar data types by the quadword data type. This methodology is feasible only for relatively small kernels. In the second methodology SPE performance is compared to the performance of a similarly configured PPU, which supports scalar operations. Experimental results show that the scalar processing overhead ranges from 19% to 57% for small kernels and from 12% to 39% for large kernels. Solutions to eliminate this overhead are also discussed.

引用

页码：183 / 190

页数：8

共 50 条

[1] Avoiding Conversion and Rearrangement Overhead in SIMD Architectures
Asadollah Shahbahrami
Ben Juurlink
Demid Borodin
Stamatis Vassiliadis
International Journal of Parallel Programming, 2006, 34 : 237 - 260
[2] Avoiding conversion and rearrangement overhead in SIMD architectures
Shahbahrami, Asadollah
Juurlink, Ben
Borodin, Demid
Vassiliadis, Stamatis
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2006, 34 (03) : 237 - 260
[3] Redefining the Relationship between Scalar and Parallel Units in SIMD Architectures
Wang, Yaohua
Chen, Shuming
Wan, Jianghua
Zhang, Kai
2013 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2013, : 781 - 784
[4] SIMD architectures and algorithms for image processing and computer vision
Cypher, Robert, 1600, (37):
[5] SIMD ARCHITECTURES AND ALGORITHMS FOR IMAGE-PROCESSING AND COMPUTER VISION
CYPHER, R
SANZ, JLC
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (12): : 2158 - 2174
[6] B-SCT: Improve SpMV processing on SIMD architectures
Wang, Yaohua
Wang, Dong
Zhou, Xu
IEICE ELECTRONICS EXPRESS, 2015, 12 (11):
[7] Determining optimal grain size for efficient vector processing on SIMD image processing architectures
Kim, J
Wills, DS
Wills, LM
ADVANCES IN COMPUTER SYSTEMS ARCHITECTURE, PROCEEDINGS, 2005, 3740 : 551 - 565
[8] The impact of grain size on the efficiency of embedded SIMD image processing architectures
Gentile, A
Sander, S
Wills, L
Wills, S
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2004, 64 (11) : 1318 - 1327
[9] Dual-Core Framework: Eliminating the Bottleneck Effect of Scalar Kernels on SIMD Architectures
Wang, Yaohua
Chen, Shuming
Chen, Hu
Wan, Jianghua
Zhang, Kai
Liu, Sheng
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (02): : 365 - 369
[10] A novel methodology for the design of processing elements in embedded SIMD architectures for multimedia
Gentile, A
Wills, DS
Sorbello, F
PDPTA'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, 2001, : 437 - 443

← 1 2 3 4 5 →