Strategy and algorithms for the parallel solution of the nearest neighborhood problem in shared-memory processors

被引:0
|
作者
Santiago Tapia-Fernández
Pablo Hiroshi Alonso-Miyazaki
Ignacio Romero
Angel García-Beltrán
机构
[1] Universidad Politécnica de Madrid (UPM),Department of Electrical Engineering
[2] Universidad Politécnica de Madrid (UPM),ETSII
[3] IMDEA Materials Institute,Department of Mechanical Engineering
来源
关键词
Neighborhood problem; Regular grid; Hash table; Mesh-free methods;
D O I
暂无
中图分类号
学科分类号
摘要
The neighborhood problem appears in many applications of computational geometry, computational mechanics, etc. In all these situations, the main requirement for a competitive implementation is performance, which can only be attained in modern hardware by exploiting parallelism. However, whereas the performance of serial algorithms is fairly predictable, that of parallel methods depends on delicate issues that have a huge impact (cache memory, cache misses, memory alignment, etc.), but are not easy to control. Even if there is not a simple approach to deal with these factors in shared-memory architectures, it is quite convenient to program parallel algorithms where the data are segregated on a per-thread basis. With this objective in mind, we propose a strategy to develop parallel algorithms based on a two-level design, and apply it to efficiently solve the nearest neighborhood problem. At a higher level, the proposed methods orchestrate the parallel algorithm and split the space into cells stored in a hash table; at the lower level, our methods hold serial search algorithms that are completely agnostic to the high-level counterpart. Using this strategy, we have developed a library combining different serial and parallel algorithms, optimized them, and assessed their performance. The analysis carried out allows to better understand the main bottlenecks in the algorithmic solution of the nearest neighborhood problem and come out with very fast implementations that improve existing available software.
引用
收藏
页码:1669 / 1679
页数:10
相关论文
共 50 条
  • [31] SHARED-MEMORY PARALLEL PROGRAMMING IN C++
    BECK, B
    IEEE SOFTWARE, 1990, 7 (04) : 38 - 48
  • [32] Truss Decomposition on Shared-Memory Parallel Systems
    Smith, Shaden
    Liu, Xing
    Ahmed, Nesreen K.
    Tom, Ancy Sarah
    Petrini, Fabrizio
    Karypis, George
    2017 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2017,
  • [33] Shared-Memory Parallel Maximal Biclique Enumeration
    Das, Apurba
    Tirthapura, Srikanta
    2019 IEEE 26TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC), 2019, : 34 - 43
  • [34] Parallel Execution of Prolog on Shared-Memory Multiprocessors
    高耀清
    王鼎兴
    郑纬民
    沈美明
    黄志毅
    胡守仁
    Giorgio Levi
    JournalofComputerScienceandTechnology, 1993, (04) : 329 - 336
  • [35] PARALLEL CHOLESKY METHOD ON MIMD WITH SHARED-MEMORY
    ZHENG, D
    CHANG, TYP
    COMPUTERS & STRUCTURES, 1995, 56 (01) : 25 - 38
  • [36] Parallel sequence mining on shared-memory machines
    Zaki, MJ
    LARGE-SCALE PARALLEL DATA MINING, 2000, 1759 : 161 - 189
  • [37] Parallel sequence mining on shared-memory machines
    Zaki, MJ
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2001, 61 (03) : 401 - 426
  • [38] TECHNIQUES FOR EFFICIENT SHARED-MEMORY PARALLEL SIMULATION
    WAGNER, DB
    LAZOWSKA, ED
    BERSHAD, BN
    DISTRIBUTED SIMULATION, 1989, 1989, 21 : 29 - 37
  • [39] The interaction of software prefetching with ILP processors in shared-memory systems
    Ranganathan, P
    Pai, VS
    AbdelShafi, H
    Adve, SV
    24TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, CONFERENCE PROCEEDINGS, 1997, : 144 - 156
  • [40] ITERATIVE SCHEMES FOR PARALLEL S-N ALGORITHMS IN A SHARED-MEMORY COMPUTING ENVIRONMENT
    HAGHIGHAT, A
    HUNTER, MA
    MATTIS, RE
    NUCLEAR SCIENCE AND ENGINEERING, 1995, 121 (01) : 103 - 113