Architecture scalability of parallel vector computers with a shared memory

被引:2
|
作者
Dekker, E [1 ]
机构
[1] Delft Univ Technol, Fac Informat Technol & Syst, NL-2628 CD Delft, Netherlands
关键词
architecture scalability; parallel vector computers; shared memory; sustainable peak performance; theoretical peak performance;
D O I
10.1109/12.677257
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Based on a model of a parallel vector computer with a shared memory, its scalability properties are derived. The processor-memory interconnection network is assumed to be composed of crossbar switches of size b x b. This paper analyzes sustainable peak performance under optimal conditions, i.e., no memory bank conflicts, sufficient processor-memory bank pathways, and no interconnection network conflicts. It will be shown that, with fully vectorizable algorithms and no communication overhead, the sustainable peak performance does not scale up linearly with the number of processors p, If the interconnection network is unbuffered, the number of memory banks must increase at least with O(p log(b) p) to sustain peak performance. If the network is buffered, this bottleneck can be alleviated; however, the half performance vector length still increases with O(log(b) p). The paper confirms the validity of the model by examining the performance behavior of the LINPACK benchmark.
引用
收藏
页码:614 / 624
页数:11
相关论文
共 50 条
  • [1] Scalability aspects of parallel vector computers with a shared memory.
    Dekker, E
    EUROSIM '96 - HPCN CHALLENGES IN TELECOMP AND TELECOM: PARALLEL SIMULATION OF COMPLEX SYSTEMS AND LARGE-SCALE APPLICATIONS, 1996, : 81 - 92
  • [2] Parallel FDTD calculation efficiency on computers with shared memory architecture
    Ciamulski, Tomasz
    Hjelm, Mats
    Sypniewski, Maciej
    2007 WORKSHOP ON COMPUTATIONAL ELECTROMAGNETICS IN TIME-DOMAIN, 2007, : 33 - +
  • [3] Parallel Electromagnetic Transients Simulation with Shared Memory Architecture Computers
    Fan, Shengtao
    Ding, Hui
    Kariyawasam, Anuradha
    Gole, Aniruddha M.
    IEEE TRANSACTIONS ON POWER DELIVERY, 2018, 33 (01) : 239 - 247
  • [4] VECTOR AND PARALLEL ALGORITHMS FOR THE MOLECULAR-DYNAMICS SIMULATION OF MACROMOLECULES ON SHARED-MEMORY COMPUTERS
    MERTZ, JE
    TOBIAS, DJ
    BROOKS, CL
    SINGH, UC
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 1991, 12 (10) : 1270 - 1277
  • [5] Poor scalability of parallel shared memory model: Myth or reality?
    Kremenetsky, M
    Raefsky, A
    Reinhardt, S
    COMPUTATIONAL SCIENCE - ICCS 2003, PT IV, PROCEEDINGS, 2003, 2660 : 657 - 666
  • [6] Configurable parallel memory architecture for multimedia computers
    Kuusilinna, K
    Tanskanen, J
    Hämäläinen, T
    Niittylahti, J
    JOURNAL OF SYSTEMS ARCHITECTURE, 2002, 47 (14-15) : 1089 - 1115
  • [7] Parallel DSMC on shared and hybrid memory multiprocessor computers
    Khanlarov, GO
    Lukianov, GA
    Malashonok, DY
    Zakharov, VV
    HIGH PERFORMANCE COMPUTING AND NETWORKING, PROCEEDINGS, 2000, 1823 : 584 - 587
  • [8] A parallel WLS state estimator on shared memory computers
    Neplocha, J.
    Chavarria-Miranda, D.
    Tipparaju, V.
    Zuang, H.
    Marquez, A.
    2007 CONFERENCE PROCEEDINGS IPEC, VOLS 1-3, 2007, : 395 - 400
  • [9] Parallel operation of CartaBlanca on shared and distributed memory computers
    Padial-Collins, NT
    VanderHeyden, WB
    Zhang, DZ
    Dendy, ED
    Livescu, D
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2004, 16 (01): : 61 - 77
  • [10] Parallel Random Access Memory in a Shared Memory Architecture
    Tran Due Linh
    de Souza-Daw, Tony
    Thang Manh Hoang
    Nguyen Tien Dzung
    2014 IEEE FIFTH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS (ICCE), 2014, : 364 - 369