A Locality-based Threading Algorithm for the Configuration-Interaction Method

被引:0
|
作者
Shan, Hongzhang [1 ]
Williams, Samuel [1 ]
Johnson, Calvin [2 ]
McElvain, Kenneth [3 ]
机构
[1] Lawrence Berkeley Natl Lab, Computat Res Div, Berkeley, CA 94720 USA
[2] San Diego State Univ, Dept Phys, San Diego, CA 92182 USA
[3] Univ Calif Berkeley, Dept Phys, Berkeley, CA 94720 USA
关键词
D O I
10.1109/IPDPSW.2017.15
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The Configuration Interaction (CI) method has been widely used to solve the non-relativistic many-body Schrodinger equation. One great challenge to implementing it efficiently on manycore architectures is its immense memory and data movement requirements. To address this issue, within each node, we exploit a hybrid MPI+OpenMP programming model in lieu of the traditional flat MPI programming model. In this paper, we develop optimizations that partition the workloads among OpenMP threads based on data locality, which is essential in ensuring applications with complex data access patterns scale well on manycore architectures. The new algorithm scales to 256 threads on the 64-core Intel Knights Landing (KNL) manycore processor and 24 threads on dual-socket Ivy Bridge (Xeon) nodes. Compared with the original implementation, the performance has been improved by up to 7x on the Knights Landing processor and 3x on the dual-socket Ivy Bridge node.
引用
收藏
页码:1178 / 1187
页数:10
相关论文
共 50 条
  • [31] GENERALIZATION OF SUPEROPERATOR ALGORITHM OF THE LARGE-SCALE CONFIGURATION-INTERACTION
    KLIMKO, GT
    VULFOV, AL
    TEORETICHESKAYA I EKSPERIMENTALNAYA KHIMIYA, 1990, 26 (05): : 536 - 543
  • [32] Locality-based network creation games
    Bilò D.
    Gualà L.
    Leucci S.
    Proietti G.
    ACM Transactions on Parallel Computing, 2016, 3 (01)
  • [33] A locality-based replication manager for data cloud
    Sookhtsaraei, Reza
    Artin, Javad
    Ghorbani, Ali
    Faraahi, Ahmad
    Adineh, Hadi
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2016, 17 (12) : 1275 - 1286
  • [34] Locality-based warp scheduling in GPGPUs
    Zhang, Yang
    Xing, Zuochen
    Liu, Cang
    Tang, Chuan
    wang, Qinglin
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 82 : 520 - 527
  • [35] Locality-Based Online Trace Compression
    Luo, Y
    John, LK
    IEEE TRANSACTIONS ON COMPUTERS, 2004, 53 (06) : 723 - 731
  • [36] Locality-Based Discriminant Neighborhood Embedding
    Gou, Jianping
    Yi, Zhang
    COMPUTER JOURNAL, 2013, 56 (09): : 1063 - 1082
  • [37] Locality-based Network Creation Games
    Bilo, Davide
    Guala, Luciano
    Leucci, Stefano
    Proietti, Guido
    PROCEEDINGS OF THE 26TH ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES (SPAA'14), 2014, : 277 - 286
  • [38] Locality-Based Optimizations in the Chapel Compiler
    Kayraklioglu, Engin
    Ronaghan, Elliot
    Ferguson, Michael P.
    Chamberlain, Bradford L.
    LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING (LCPC 2021), 2022, 13181 : 3 - 17
  • [39] LOCALITY-BASED STUDIES AND THE ENTERPRISE INITIATIVE
    SLATER, TR
    JOURNAL OF GEOGRAPHY IN HIGHER EDUCATION, 1993, 17 (01) : 47 - 55