Leveraging Non-Blocking Collective Communication in High-Performance Applications

被引:0
|
作者
Hoefler, Torsten [1 ]
Gottschling, Peter [1 ]
Lumsdain, Andrew [1 ]
机构
[1] Indiana Univ, Open Syst Lab, Bloomington, IN 47404 USA
关键词
non-blocking collectives; overlap; MPI;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Although overlapping communication with computation is an important mechanism for achieving high performance in parallel programs, developing, applications that actually achieve good overlap can be difficult. Existing approaches are typically based on manual or compiler-based transformations. This paper presents a pattern and library-based approach to optimizing collective communication in parallel high-performance applications, based on using non-blocking collective operations to enable overlapping of communication and computation. Common communication and computation patterns in iterative SPMD computations are used to motivate the transformations we present. Our approach provides the programmer with the capability to separately optimize communication and computation in an application, while automating the interaction between computation and communication to achieve maximum overlap. Performance results with a model application show more than a 90% decrease in communication overhead, resulting in 21% overall performance improvements.
引用
收藏
页码:113 / 115
页数:3
相关论文
共 50 条
  • [31] Maximizing Communication-Computation Overlap Through Automatic Parallelization and Run-time Tuning of Non-blocking Collective Operations
    Barigou, Youcef
    Gabriel, Edgar
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2017, 45 (06) : 1390 - 1416
  • [32] Modeling of a compact and completely non-blocking 4 x 4 multimode interference switch for optical communication applications
    Singh, Ghanshyam
    Goyal, Abhishek
    Yadav, Rajendra Prasad
    Janyani, Vijay
    OPTICAL ENGINEERING, 2012, 51 (05)
  • [33] Optimization of the scheduler for the non-blocking high-capacity router
    Petrovic, Milos
    Smiljanic, Aleksandra
    IEEE COMMUNICATIONS LETTERS, 2007, 11 (06) : 534 - 536
  • [34] Using Non-blocking I/O Operations in High Performance Computing to Reduce Execution Times
    Buettner, David
    Kunkel, Julian
    Ludwig, Thomas
    RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, PROCEEDINGS, 2009, 5759 : 134 - 142
  • [35] An optimized non-blocking SOA switch architecture for high performance Tb/s network interconnects
    Aw, Et
    Wonfor, A.
    Glick, M.
    Penty, Rv
    White, Ih
    2007 PHOTONICS IN SWITCHING, 2007, : 15 - +
  • [36] High-Performance Hybrid Key-Value Store on Modern Clusters with RDMA Interconnects and SSDs: Non-blocking Extensions, Designs, and Benefits
    Shankar, Dipti
    Lu, Xiaoyi
    Islam, Nusrat
    Wasi-ur-Rahman, Md.
    Panda, Dhabaleswar K.
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 393 - 402
  • [37] A non-blocking buffer mechanism for real-time event message communication
    Kim, KHK
    REAL-TIME SYSTEMS, 2006, 32 (03) : 197 - 211
  • [38] A Non-Blocking Buffer Mechanism for Real-Time Event Message Communication
    K. H. (Kane) Kim
    Real-Time Systems, 2006, 32 : 197 - 211
  • [39] Architecture and performance of non-blocking ATM switches with shared internal queueing
    Bianchi, G
    Pattavina, A
    COMPUTER NETWORKS AND ISDN SYSTEMS, 1996, 28 (06): : 835 - 853
  • [40] Optimized designs of low loss non-blocking optical router for ONoC applications
    Chaudhari B.S.
    Patil S.S.
    International Journal of Information Technology, 2020, 12 (1) : 91 - 96