Accelerating MPI Collectives with Process-in-Process-based Multi-object Techniques

被引:2
|
作者
Huang, Jiajun [1 ]
Ouyang, Kaiming [2 ]
Zhai, Yujia [1 ]
Liu, Jinyang [1 ]
Si, Min [3 ]
Raffenetti, Ken [4 ]
Zhou, Hui [4 ]
Hori, Atsushi [5 ]
Chen, Zizhong [1 ]
Guo, Yanfei [4 ]
Thakur, Rajeev [4 ]
机构
[1] Univ Calif Riverside, Riverside, CA 92521 USA
[2] NVIDIA Corp, Santa Clara, CA USA
[3] Meta Platforms Inc, Menlo Pk, CA USA
[4] Argonne Natl Lab, Argonne, IL USA
[5] Natl Inst Informat, Tokyo, Japan
关键词
D O I
10.1145/3588195.3595955
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In the exascale computing era, optimizing MPI collective performance in high-performance computing (HPC) applications is critical. Current algorithms face performance degradation due to system call overhead, page faults, or data-copy latency, affecting HPC applications' efficiency and scalability. To address these issues, we propose PiP-MColl, a Process-in-Process-based Multi-object Interprocess MPI Collective design that maximizes small message MPI collective performance at scale. PiP-MColl features efficient multiple sender and receiver collective algorithms and leverages Process-in-Process shared memory techniques to eliminate unnecessary system call, page fault overhead, and extra data copy, improving intra- and inter-node message rate and throughput. Our design also boosts performance for larger messages, resulting in comprehensive improvement for various message sizes. Experimental results show that PiP-MColl outperforms popular MPI libraries, including OpenMPI, MVAPICH2, and Intel MPI, by up to 4.6X for MPI collectives like MPI_Scatter and MPI_Allgather.
引用
收藏
页码:333 / 334
页数:2
相关论文
共 50 条
  • [31] A Multi-object Detection Method Based on Connected Vehicles
    Wang, Yunpeng
    Wang, Xixian
    Tian, Daxin
    Duan, Xuting
    Liu, He
    Gong, Yinsheng
    Sheng, Zhengguo
    Leung, Victor C. M.
    DIVANET'19: PROCEEDINGS OF THE 9TH ACM SYMPOSIUM ON DESIGN AND ANALYSIS OF INTELLIGENT VEHICULAR NETWORKS AND APPLICATIONS, 2019, : 89 - 96
  • [32] Research on multi-object recognition algorithm based on video
    Hou, Yong
    Mao, Runhua
    Yu, Yan
    Ouyang, Yuxing
    Bian, Ce
    Song, Binhu
    Wei, Baochang
    Qin, Yiqiao
    Chang, Shengbiao
    Dai, Fengzhi
    Jiao, Hongwei
    ICAROB 2018: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS, 2018, : 655 - 658
  • [33] A Randomized Sampling based Approach to Multi-Object Tracking
    Faber, W.
    Chakravorty, S.
    Hussein, Islam I.
    2015 18TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2015, : 1307 - 1314
  • [34] Embedded Smart Tracker Based on Multi-object Tracking
    Xu Yan
    Wang Lei
    Liang Jianpeng
    Li Tao
    Cao Zuoliang
    THEORETICAL AND MATHEMATICAL FOUNDATIONS OF COMPUTER SCIENCE, 2011, 164 : 190 - 197
  • [35] Review of Multi-Object Tracking Based on Deep Learning
    Li, Jiaxin
    Zhao, Lei
    Zheng, Zhaohuang
    Yong, Ting
    2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, : 719 - 725
  • [36] Scattered light in a DMD based multi-object spectrometer
    Chester F. Carlson Center for Imaging Science, Rochester Institute of Technology, 54 Lomb Memorial Drive, Rochester, NY, United States
    Proc SPIE Int Soc Opt Eng,
  • [37] The Multi-object Optimal Pass Design based on MOGA
    Ma Jinhong
    Zhang Wen-zhi
    CEIS 2011, 2011, 15
  • [38] Multi-object Tracking Based on Improved Mean Shift
    Gao, Meifeng
    Liu, Di
    2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2013, : 1588 - 1592
  • [39] Multi-Object Tracking Based on Segmentation and Collision Avoidance
    Meng Zhao
    Junhui Wang
    Maoyong Cao
    Peirui Bai
    Hongyan Gu
    Mingtao Pei
    JournalofBeijingInstituteofTechnology, 2018, 27 (02) : 213 - 219
  • [40] Point Set Registration based on Multi-Object Metrics
    Barrios, Pablo
    Adams, Martin
    2017 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (ICCAIS), 2017, : 245 - 250