Auto-tuning Performance of MPI Parallel Programs Using Resource Management in Container-based Virtual Cloud

被引:0
|
作者
Ma, Hongyi [1 ]
Wang, Liqiang [2 ]
Tak, Byung Chul [3 ]
Wang, Long [3 ]
Tang, Chunqiang [4 ]
机构
[1] Univ Wyoming, Laramie, WY 82071 USA
[2] Univ Cent Florida, Orlando, FL 32816 USA
[3] IBM Thomas J Watson Res Ctr, Yorktown Hts, NY USA
[4] Facebook Inc, Menlo Pk, CA USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CLOUD.2016.76
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Load imbalance problem is one of the major obstacles to achieving optimal performance of High Performance Computing applications. The approach of trying to distribute the problem pieces to each node with the hope of balancing execution time has limits since the performance depends not only on data size but also on many other dynamic factors. This paper describes an approach that uses adaptive resource management enabled by the container-based virtualization to solve the load imbalance problem of MPI programs running in the cloud. Our techniques dynamically adjust CPU resource allocation to MPI processes running as container instances according to the current program execution state and system resource status. The resource allocation among MPI processes is adjusted in two ways: the intra-host level, which dynamically adjusts resources within a host; and the inter-host level, which migrates containers together with MPI processes from one host to another host. We have implemented and evaluated our approach on Amazon EC2 platform using real-world scientific benchmarks and applications, which demonstrates that the performance can be improved up to 31% (with an average of 15%) when compared with the baseline.
引用
收藏
页码:545 / 552
页数:8
相关论文
共 8 条
  • [1] MicroCloud: A Container-based Solution for Efficient Resource Management in the Cloud
    Baresi, Luciano
    Guinea, Sam
    Quattrocchi, Giovanni
    Tamburri, Damian A.
    2016 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2016, : 218 - 223
  • [2] High Performance MPI Library for Container-based HPC Cloud on InfiniBand Clusters
    Zhang, Jie
    Lu, Xiaoyi
    Panda, Dhabaleswar K.
    PROCEEDINGS 45TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING - ICPP 2016, 2016, : 268 - 277
  • [3] Performance Effects of Running Container-Based Open-MPI Cluster in Public Cloud
    Simchev, Teodor
    Atanassov, Emanouil
    LARGE-SCALE SCIENTIFIC COMPUTING (LSSC 2019), 2020, 11958 : 254 - 262
  • [4] Improving the MPI-IO Performance of Applications with Genetic Algorithm based Auto-tuning
    Bagbaba, Ayse
    Wang, Xuan
    2021 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2021, : 798 - 805
  • [5] Thoth: Automatic Resource Management with Machine Learning for Container-based Cloud Platform
    Sangpetch, Akkarit
    Sangpetch, Orathai
    Juangmarisakul, Nut
    Warodom, Supakorn
    CLOSER: PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE, 2017, : 75 - 83
  • [6] Optimal Resource Allocation Using Genetic Algorithm in Container-Based Heterogeneous Cloud
    Chen, Qi-Hong
    Wen, Chih-Yu
    IEEE ACCESS, 2024, 12 : 7413 - 7429
  • [7] Secure live migration of parallel applications using container-based virtual machines
    Hacker, Thomas J.
    Romero, Fabian
    Nielsen, Jeremiah J.
    INTERNATIONAL JOURNAL OF SPACE-BASED AND SITUATED COMPUTING, 2012, 2 (01) : 45 - 57
  • [8] A REST Service Framework for Fine-Grained Resource Management in Container-Based Cloud
    Li, Li
    Tang, Tony
    Chou, Wu
    2015 IEEE 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, 2015, : 645 - 652