Scalar Processing Overhead on SIMD-Only Architectures

被引:0
|
作者
Azevedo, Arnaldo [1 ]
Juurlink, Ben [1 ]
机构
[1] Delft Univ Technol, Fac Elect Engn Math & Comp Sci, Comp Engn Grp, Delft, Netherlands
关键词
Computer architecture; Datapath; SIMD processing; SIMD overhead;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The Cell processor consists of a general-purpose core and eight cores with a complete SIMD instruction set. Although originally designed for multimedia and gaming, it is currently being used for a much broader range of applications. In this paper we evaluate if the Cell SPEs could benefit significantly from a scalar processing unit using two methodologies. In the first methodology the scalar processing overhead is eliminated by replacing all scalar data types by the quadword data type. This methodology is feasible only for relatively small kernels. In the second methodology SPE performance is compared to the performance of a similarly configured PPU, which supports scalar operations. Experimental results show that the scalar processing overhead ranges from 19% to 57% for small kernels and from 12% to 39% for large kernels. Solutions to eliminate this overhead are also discussed.
引用
收藏
页码:183 / 190
页数:8
相关论文
共 50 条
  • [1] Avoiding Conversion and Rearrangement Overhead in SIMD Architectures
    Asadollah Shahbahrami
    Ben Juurlink
    Demid Borodin
    Stamatis Vassiliadis
    International Journal of Parallel Programming, 2006, 34 : 237 - 260
  • [2] Avoiding conversion and rearrangement overhead in SIMD architectures
    Shahbahrami, Asadollah
    Juurlink, Ben
    Borodin, Demid
    Vassiliadis, Stamatis
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2006, 34 (03) : 237 - 260
  • [3] Redefining the Relationship between Scalar and Parallel Units in SIMD Architectures
    Wang, Yaohua
    Chen, Shuming
    Wan, Jianghua
    Zhang, Kai
    2013 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2013, : 781 - 784
  • [5] SIMD ARCHITECTURES AND ALGORITHMS FOR IMAGE-PROCESSING AND COMPUTER VISION
    CYPHER, R
    SANZ, JLC
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (12): : 2158 - 2174
  • [6] B-SCT: Improve SpMV processing on SIMD architectures
    Wang, Yaohua
    Wang, Dong
    Zhou, Xu
    IEICE ELECTRONICS EXPRESS, 2015, 12 (11):
  • [7] Determining optimal grain size for efficient vector processing on SIMD image processing architectures
    Kim, J
    Wills, DS
    Wills, LM
    ADVANCES IN COMPUTER SYSTEMS ARCHITECTURE, PROCEEDINGS, 2005, 3740 : 551 - 565
  • [8] The impact of grain size on the efficiency of embedded SIMD image processing architectures
    Gentile, A
    Sander, S
    Wills, L
    Wills, S
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2004, 64 (11) : 1318 - 1327
  • [9] Dual-Core Framework: Eliminating the Bottleneck Effect of Scalar Kernels on SIMD Architectures
    Wang, Yaohua
    Chen, Shuming
    Chen, Hu
    Wan, Jianghua
    Zhang, Kai
    Liu, Sheng
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (02): : 365 - 369
  • [10] A novel methodology for the design of processing elements in embedded SIMD architectures for multimedia
    Gentile, A
    Wills, DS
    Sorbello, F
    PDPTA'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, 2001, : 437 - 443