CESMTuner: An Auto-Tuning Framework for the Community Earth System Model

被引:8
|
作者
Ding Nan [1 ,2 ,3 ,4 ]
Xue Wei [1 ,2 ,3 ,4 ]
Ji Xu [1 ,2 ,3 ,4 ]
Xu Haoyu [1 ,2 ]
Song Zhenya [5 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
[3] Tsinghua Univ, Key Lab Earth Syst Modeling, Minist Educ, Beijing 100084, Peoples R China
[4] Tsinghua Univ, Ctr Earth Syst Sci, Beijing 100084, Peoples R China
[5] SOA, Inst Oceanog 1, Qingdao 266061, Peoples R China
关键词
auto-tuning; CESM; load balance; processor allocation; performance prediction; PERFORMANCE PORTABILITY; DYNAMICAL CORE; OCEAN MODEL;
D O I
10.1109/HPCC.2014.51
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The growing scientific demands of climate predication and climate projection have promoted to manage the computational resources of climate model rationally. The Community Earth System Model (CESM) is one of the state-of-the-art and the most widely-used coupled models for simulating the earth system. Although considerable effort has been put to improve the scalability of single component, CESM is still struggling with the poor performance due to load balance across components. To solve this problem, an easy-used and easy-ported auto-tuning framework named CESMTuner is proposed in this paper. It targets to reduce the time consumed of CESM as much as possible by looking for the optimal process configuration. In which, a novel process layout searching algorithm is presented that can look for the optimal process count of each component as well as the best process layout across components simultaneously. Moreover, a lightweight and accurate performance model is built to reduce searching overhead effectively. With the evaluation over TianHe-1A, CESMTuner can achieve 58.49% performance improvement compared to the widely-used sequential process layout and achieve 38.23% performance improvement compared to the heuristic branch and bound algorithm based on the performance model of simply fitting each component's runtime.
引用
收藏
页码:282 / 289
页数:8
相关论文
共 50 条
  • [21] Robust PID Auto-tuning for the Quadruple Tank System
    Ionescu, Clara M.
    Maxim, Anca
    Copot, Cosmin
    De Keyser, Robin
    IFAC PAPERSONLINE, 2016, 49 (07): : 919 - 924
  • [22] Auto-tuning TensorFlow Threading Model for CPU Backend
    Hasabnis, Niranjan
    PROCEEDINGS OF 2018 IEEE/ACM MACHINE LEARNING IN HPC ENVIRONMENTS (MLHPC 2018), 2018, : 14 - 25
  • [23] Auto-tuning fuzzy PID control of a pendubot system
    Wu, Chia-Ju
    Lee, Tsong-Li
    Fu, Yu-Yi
    Lai, Li-Chun
    2007 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS, 2007, : 391 - +
  • [24] Auto-tuning PID Module of Robot Motion System
    Xiao, Huafei
    Wang, Shaoping
    2011 6TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2011, : 668 - 673
  • [25] Auto-Tuning for Military Microgrids
    Podlesak, Thomas
    Vitale, Joseph
    Wilson, Blane
    Bohn, Frank
    Gonzalez, Michael
    Bosse, Richard
    Siegfried, Stefan
    Lynch, Jaclyn
    Barnhill, William
    2019 IEEE ENERGY CONVERSION CONGRESS AND EXPOSITION (ECCE), 2019, : 6270 - 6277
  • [26] The interpolation method for auto-tuning
    Skvortsov, L.M.
    Shuiyun Gongcheng/Port & Waterway Engineering, 1998, (09):
  • [27] Benefits of auto-tuning VFDs
    Avery, Paul
    Control Engineering, 2021, 68 (09)
  • [28] AUTO-TUNING PARALLEL SKELETONS
    Collins, Alexander
    Fensch, Christian
    Leather, Hugh
    PARALLEL PROCESSING LETTERS, 2012, 22 (02)
  • [29] Least squares auto-tuning
    Barratt, Shane T.
    Boyd, Stephen P.
    ENGINEERING OPTIMIZATION, 2021, 53 (05) : 789 - 810
  • [30] An Analytical Model-Based Auto-tuning Framework for Locality-Aware Loop Scheduling
    Xu, Rengan
    Chandrasekaran, Sunita
    Tian, Xiaonan
    Chapman, Barbara
    HIGH PERFORMANCE COMPUTING, 2016, 9697 : 3 - 20