Optimizing a conjugate gradient solver with non-blocking collective operations

被引:0
|
作者
Hoefler, Torsten [1 ]
Gottschling, Peter
Rehm, Wolfgang
Lumsdaine, Andrew
机构
[1] Indiana Univ, Open Syst Lab, Bloomington, IN 47404 USA
[2] Tech Univ Chemnitz, Dept Comp Sci, D-09107 Chemnitz, Germany
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents a case study about the applicability and usage of non-blocking collective operations. These operations provide the ability to overlap communication with computation and to avoid unnecessary synchronization. We introduce our NBC library, a portable low-overhead implementation of non-blocking collectives on top of MPI-1. We demonstrate the easy usage of the NBC library with the optimization of a conjugate gradient solver with only minor changes to the traditional parallel implementation of the program. The optimized solver runs up to 34% faster and is able to overlap most of the communication. We show that there is, due to the overlap, no performance difference between Gi-gabit Ethernet and InfiniBand (TM) for our calculation.
引用
收藏
页码:374 / 382
页数:9
相关论文
共 50 条
  • [41] Modular verification of a non-blocking stack
    Parkinson, Matthew
    Bornat, Richard
    O'Hearn, Peter
    ACM SIGPLAN NOTICES, 2007, 42 (01) : 297 - 302
  • [42] Versioning based spatial record management technique for non-blocking search operations of moving objects
    Kim, HS
    Kim, HT
    Kim, MK
    Kim, GB
    Bae, HY
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2006, PT 1, 2006, 3980 : 914 - 920
  • [43] Optimizing state-intensive non-blocking queries using run-time adaptation
    Liu, Bin
    Jbantova, Mariana
    Rundensteiner, Elke A.
    2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, VOLS 1-2, 2007, : 614 - 623
  • [44] Using Non-blocking I/O Operations in High Performance Computing to Reduce Execution Times
    Buettner, David
    Kunkel, Julian
    Ludwig, Thomas
    RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, PROCEEDINGS, 2009, 5759 : 134 - 142
  • [45] Combined Partial Conjugate Gradient and Gradient Projection solver for MPC
    Santin, Ondrej
    Havlena, Vladimir
    2011 IEEE INTERNATIONAL CONFERENCE ON CONTROL APPLICATIONS (CCA), 2011, : 1270 - 1275
  • [46] Grasping the Gap Between Blocking and Non-Blocking Transactional Memories
    Kuznetsov, Petr
    Ravi, Srivatsan
    DISTRIBUTED COMPUTING (DISC 2015), 2015, 9363 : 232 - 247
  • [47] Grasping the gap between blocking and non-blocking transactional memories
    Kuznetsov, Petr
    Ravi, Srivatsan
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2017, 101 : 1 - 16
  • [48] A non-blocking parallel spatial join algorithm
    Luo, G
    Naughton, JF
    Ellmann, CJ
    18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, : 697 - 705
  • [49] Modular Termination Verification for Non-blocking Concurrency
    Pinto, Pedro da Rocha
    Dinsdale-Young, Thomas
    Gardner, Philippa
    Sutherland, Julian
    PROGRAMMING LANGUAGES AND SYSTEMS (ESOP 2016), 2016, 9632 : 176 - 201
  • [50] Formalising progress properties of non-blocking programs
    Dongol, Brijesh
    FORMAL METHODS AND SOFTWARE ENGINEERING, PROCEEDINGS, 2006, 4260 : 284 - 303