Parallel-vector algorithms for particle simulations on shared-memory multiprocessors

被引:51
|
作者
Nishiura, Daisuke [1 ]
Sakaguchi, Hide [1 ]
机构
[1] Japan Agcy Marine Earth Sci & Technol, Inst Res Earth Evolut, Kanagawa 2360001, Japan
关键词
Discrete element method; GPO computing; Vectorization; Parallelization; High performance computing; MOLECULAR-DYNAMICS SIMULATIONS; DISCRETE; DEM; SYSTEMS; MODEL; FLOW; SPH;
D O I
10.1016/j.jcp.2010.11.040
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Over the last few decades, the computational demands of massive particle-based simulations for both scientific and industrial purposes have been continuously increasing. Hence, considerable efforts are being made to develop parallel computing techniques on various platforms. In such simulations, particles freely move within a given space, and so on a distributed-memory system, load balancing, i.e., assigning an equal number of particles to each processor, is not guaranteed. However, shared-memory systems achieve better load balancing for particle models, but suffer from the intrinsic drawback of memory access competition, particularly during (1) paring of contact candidates from among neighboring particles and (2) force summation for each particle. Here, novel algorithms are proposed to overcome these two problems. For the first problem, the key is a pre-conditioning process during which particle labels are sorted by a cell label in the domain to which the particles belong. Then, a list of contact candidates is constructed by pairing the sorted particle labels. For the latter problem, a table comprising the list indexes of the contact candidate pairs is created and used to sum the contact forces acting on each particle for all contacts according to Newton's third law. With just these methods, memory access competition is avoided without additional redundant procedures. The parallel efficiency and compatibility of these two algorithms were evaluated in discrete element method (DEM) simulations on four types of shared-memory parallel computers: a multicore multiprocessor computer, scalar supercomputer, vector supercomputer, and graphics processing unit. The computational efficiency of a DEM code was found to be drastically improved with our algorithms on all but the scalar supercomputer. Thus, the developed parallel algorithms are useful on shared-memory parallel computers with sufficient memory bandwidth. (C) 2010 Elsevier Inc. All rights reserved.
引用
收藏
页码:1923 / 1938
页数:16
相关论文
共 50 条
  • [1] PARALLEL ALGORITHMS FOR STRESS-ANALYSIS ON SHARED-MEMORY MULTIPROCESSORS
    ADELI, H
    KAMAL, O
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1992, 591 : 426 - 437
  • [2] SYNCHRONIZATION ALGORITHMS FOR SHARED-MEMORY MULTIPROCESSORS
    GRAUNKE, G
    THAKKAR, S
    [J]. COMPUTER, 1990, 23 (06) : 60 - 69
  • [3] A PARALLEL LINKED LIST FOR SHARED-MEMORY MULTIPROCESSORS
    TANG, PY
    YEW, PC
    ZHU, CQ
    [J]. PROCEEDINGS : THE THIRTEENTH ANNUAL INTERNATIONAL COMPUTER SOFTWARE & APPLICATIONS CONFERENCE, 1989, : 130 - 135
  • [4] ALGORITHMS FOR SCALABLE SYNCHRONIZATION ON SHARED-MEMORY MULTIPROCESSORS
    MELLORCRUMMEY, JM
    SCOTT, ML
    [J]. ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1991, 9 (01): : 21 - 65
  • [5] Design and analysis of algorithms for shared-memory multiprocessors
    Leiserson, CE
    [J]. ALGORITHMS AND DATA STRUCTURES, 1999, 1663 : 55 - 55
  • [6] Parallel Execution of Prolog on Shared-Memory Multiprocessors
    高耀清
    王鼎兴
    郑纬民
    沈美明
    黄志毅
    胡守仁
    Giorgio Levi
    [J]. Journal of Computer Science & Technology, 1993, (04) : 329 - 336
  • [7] Parallel classification for data mining on shared-memory multiprocessors
    Zaki, MJ
    Ho, CT
    Agrawal, R
    [J]. 15TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1999, : 198 - 205
  • [8] Parallel intersection counting on shared-memory multiprocessors and GPUs
    Marzolla, Moreno
    Birolo, Giovanni
    D'Angelo, Gabriele
    Fariselli, Piero
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 159 : 423 - 431
  • [9] Parallel Data Distribution Management on Shared-memory Multiprocessors
    Marzolla, Moreno
    D'angelo, Gabriele
    [J]. ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 2020, 30 (01):
  • [10] AND OR PARALLELISM ON SHARED-MEMORY MULTIPROCESSORS
    GUPTA, G
    JAYARAMAN, B
    [J]. JOURNAL OF LOGIC PROGRAMMING, 1993, 17 (01): : 59 - 89