Scalable Communication Endpoints for MPI plus Threads Applications

被引:0
|
作者
Zambre, Rohit [1 ]
Chandramowlishwaran, Aparna [1 ]
Balaji, Pavan [2 ]
机构
[1] Univ Calif Irvine, EECS, Irvine, CA 92697 USA
[2] ANL, MCS, Lemont, IL USA
关键词
multiple endpoints; hybrid MPI; multithreading; InfiniBand; scalable endpoints;
D O I
10.1109/ICPADS.2018.00109
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Hybrid MPI+threads programming is gaining prominence as an alternative to the traditional "MPI everywhere" model to better handle the disproportionate increase in the number of cores compared with other on-node resources. Current implementations of these two models represent the two extreme cases of communication resource sharing in modern MPI implementations. In the MPI-everywhere model, each MPI process has a dedicated set of communication resources (also known as endpoints), which is ideal for performance but is resource wasteful. With MPI+threads, current MPI implementations share a single communication endpoint for all threads, which is ideal for resource usage but is hurtful for performance. In this paper, we explore the tradeoff space between performance and communication resource usage in MPI+threads environments. We first demonstrate the two extreme cases one where all threads share a single communication endpoint and another where each thread gets its own dedicated communication endpoint (similar to the MPI-everywhere model) and showcase the inefficiencies in both these cases. Next, we perform a thorough analysis of the different levels of resource sharing in the context of Mellanox InfiniBand. Using the lessons learned from this analysis, we design an improved resource-sharing model to produce scalable communication endpoints that can achieve the same performance as with dedicated communication resources per thread but using just a third of the resources.
引用
收藏
页码:803 / 812
页数:10
相关论文
共 50 条
  • [1] Logically Parallel Communication for Fast MPI plus Threads Applications
    Zambre, Rohit
    Sahasrabudhe, Damodar
    Zhou, Hui
    Berzins, Martin
    Chandramowlishwaran, Aparna
    Balaji, Pavan
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (12) : 3038 - 3052
  • [2] Lessons Learned on MPI plus Threads Communication
    Zambre, Rohit
    Chandramowlishwaran, Aparna
    [J]. SC22: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2022,
  • [3] Characterizing MPI and Hybrid MPI plus Threads Applications at Scale: Case Study with BFS
    Amer, Abdelhalim
    Lu, Huiwei
    Balaji, Pavan
    Matsuoka, Satoshi
    [J]. 2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 1075 - 1083
  • [4] MPI plus ULT: Overlapping Communication and Computation with User-Level Threads
    Lu, Huiwei
    Seo, Sangmin
    Balaji, Pavan
    [J]. 2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 444 - 454
  • [5] Frustrated with MPI plus Threads? Try MPIxThreads!
    Zhou, Hui
    Raffenetti, Ken
    Zhang, Junchao
    Guo, Yanfei
    Thakur, Rajeev
    [J]. PROCEEDINGS OF THE 2023 30TH EUROPEAN MPI USERS' GROUP MEETING, EUROMPL 2023, 2023,
  • [6] MPI plus Threads: Runtime Contention and Remedies
    Amer, Abdelhalim
    Lu, Huiwei
    Wei, Yanjie
    Balaji, Pavan
    Matsuoka, Satoshi
    [J]. ACM SIGPLAN NOTICES, 2015, 50 (08) : 239 - 248
  • [7] Concurrent programming constructs for parallel MPI applications The MPI threads library
    Berka, Tobias
    Kollias, Giorgos
    Hagenauer, Helge
    Vajtersic, Marian
    Grama, Ananth
    [J]. JOURNAL OF SUPERCOMPUTING, 2013, 63 (02): : 385 - 406
  • [8] Enabling communication concurrency through flexible MPI endpoints
    Dinan, James
    Grant, Ryan E.
    Balaji, Pavan
    Goodell, David
    Miller, Douglas
    Snir, Marc
    Thakur, Rajeev
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2014, 28 (04): : 390 - 405
  • [9] Improving Communication Asynchrony and Concurrency for Adaptive MPI Endpoints
    White, Sam
    Kale, Laxmikant V.
    [J]. 2022 IEEE/ACM INTERNATIONAL WORKSHOP ON EXASCALE MPI (EXAMPI), 2022, : 11 - 21
  • [10] Faster and Scalable MPI Applications Launching
    Dong, Yong
    Dai, Yiqin
    Xie, Min
    Lu, Kai
    Wang, Ruibo
    Chen, Juan
    Shao, Mingtian
    Wang, Zheng
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 35 (02) : 264 - 279