Strategy and algorithms for the parallel solution of the nearest neighborhood problem in shared-memory processors

被引:0
|
作者
Santiago Tapia-Fernández
Pablo Hiroshi Alonso-Miyazaki
Ignacio Romero
Angel García-Beltrán
机构
[1] Universidad Politécnica de Madrid (UPM),Department of Electrical Engineering
[2] Universidad Politécnica de Madrid (UPM),ETSII
[3] IMDEA Materials Institute,Department of Mechanical Engineering
来源
关键词
Neighborhood problem; Regular grid; Hash table; Mesh-free methods;
D O I
暂无
中图分类号
学科分类号
摘要
The neighborhood problem appears in many applications of computational geometry, computational mechanics, etc. In all these situations, the main requirement for a competitive implementation is performance, which can only be attained in modern hardware by exploiting parallelism. However, whereas the performance of serial algorithms is fairly predictable, that of parallel methods depends on delicate issues that have a huge impact (cache memory, cache misses, memory alignment, etc.), but are not easy to control. Even if there is not a simple approach to deal with these factors in shared-memory architectures, it is quite convenient to program parallel algorithms where the data are segregated on a per-thread basis. With this objective in mind, we propose a strategy to develop parallel algorithms based on a two-level design, and apply it to efficiently solve the nearest neighborhood problem. At a higher level, the proposed methods orchestrate the parallel algorithm and split the space into cells stored in a hash table; at the lower level, our methods hold serial search algorithms that are completely agnostic to the high-level counterpart. Using this strategy, we have developed a library combining different serial and parallel algorithms, optimized them, and assessed their performance. The analysis carried out allows to better understand the main bottlenecks in the algorithmic solution of the nearest neighborhood problem and come out with very fast implementations that improve existing available software.
引用
收藏
页码:1669 / 1679
页数:10
相关论文
共 50 条
  • [41] VECTOR AND PARALLEL ALGORITHMS FOR THE MOLECULAR-DYNAMICS SIMULATION OF MACROMOLECULES ON SHARED-MEMORY COMPUTERS
    MERTZ, JE
    TOBIAS, DJ
    BROOKS, CL
    SINGH, UC
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 1991, 12 (10) : 1270 - 1277
  • [42] Comparative evaluation of parallel genetic algorithms on message passing and software shared-memory clusters
    Tiamkaew, E
    Louis, SJ
    Kongmunvattana, A
    PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, 2002, : 443 - 448
  • [43] Shared-Memory Parallel Algorithms for Fully Dynamic Maintenance of 2-Connected Components
    Haryan, Chirayu Anant
    Ramakrishna, G.
    Kothapalli, Kishore
    Banerjee, Dip Sankar
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 1195 - 1205
  • [44] Evaluation of Serial and Parallel Shared-Memory Distance-1 Graph Coloring Algorithms
    Gnam, Lukas
    Selberherr, Siegfried
    Weinbub, Josef
    NUMERICAL METHODS AND APPLICATIONS, NMA 2018, 2019, 11189 : 106 - 114
  • [45] Performance portable parallel programming of heterogeneous stencils across shared-memory platforms with modern Intel processors
    Szustak, Lukasz
    Bratek, Pawel
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2019, 33 (03): : 534 - 553
  • [46] Resolution of an inverse thermal problem using parallel processing on shared-memory multiprocessor architectures
    Ansoni, J. L.
    Brandi, A. C.
    Seleghim, P., Jr.
    INVERSE PROBLEMS IN SCIENCE AND ENGINEERING, 2015, 23 (02) : 351 - 375
  • [47] BASIC PARALLEL ALGORITHMIC TECHNIQUES FOR SHARED-MEMORY MACHINES
    ALBACEA, EA
    AUSTRALIAN COMPUTER JOURNAL, 1995, 27 (02): : 51 - 61
  • [48] Adaptive Space-Shared Scheduling for Shared-Memory Parallel Programs
    Cho, Younghyun
    Oh, Surim
    Egger, Bernhard
    JOB SCHEDULING STRATEGIES FOR PARALLEL PROCESSING, JSSPP 2016, 2017, 10353 : 158 - 177
  • [49] Engineering In-place (Shared-memory) Sorting Algorithms
    Axtmann, Michael
    Witt, Sascha
    Ferizovic, Daniel
    Sanders, Peter
    ACM TRANSACTIONS ON PARALLEL COMPUTING, 2022, 9 (01)
  • [50] VQ compression algorithms on a shared-memory multiprocessor system
    Wakatani, Akiyoshi
    DCC 2006: Data Compression Conference, Proceedings, 2006, : 470 - 470