Runtime prediction of parallel applications with workload-aware clustering

被引:0
|
作者
Ju-Won Park
Eunhye Kim
机构
[1] Korea Institute of Science and Technology Information,IT Convergence Technology Research Laboratory
[2] Electronics and Telecommunications Research Institute,undefined
来源
关键词
Runtime prediction; Workload-aware clustering; Support vector regression; Machine learning approach;
D O I
暂无
中图分类号
学科分类号
摘要
Traditionally, many science fields require great support for a massive workflow, which utilizes multiple cores simultaneously. In order to support such large-scale scientific workflows, high-capacity parallel systems such as supercomputers are widely used. To increase the utilization of these systems, most schedulers use backfilling policy based on user’s estimated runtime. However, it is found to be extremely inaccurate because users overestimate their jobs. Therefore, in this paper, an efficient machine learning approach is present to predict the runtime of parallel application. The proposed method is divided into three phases. First is to analyze important feature of the history log data by factor analysis. Second is to carry out clustering for the parallel program based on the important features. Third is to build a prediction models by pattern similarity of parallel program log data and estimate runtime. In the experiments, we use workload logs on parallel systems (i.e., NASA-iPSC, LANL-CM5, SDSC-Par95, SDSC-Par96, and CTC-SP2) to evaluate the effectiveness of our approach. Comparing root-mean-square error with other techniques, experimental results show that the proposed method improves the accuracy up to 69.56%.
引用
收藏
页码:4635 / 4651
页数:16
相关论文
共 50 条
  • [1] Runtime prediction of parallel applications with workload-aware clustering
    Park, Ju-Won
    Kim, Eunhye
    [J]. JOURNAL OF SUPERCOMPUTING, 2017, 73 (11): : 4635 - 4651
  • [2] Workload-Aware Runtime Energy Management for HPC Systems
    Basireddy, Karunakar R.
    Wachter, Eduardo W.
    Al-Hashimi, Bashir M.
    Merrett, Geoff V.
    [J]. PROCEEDINGS 2018 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2018, : 292 - 299
  • [3] Flexible workload-aware clustering of XML documents
    Bordawekar, R
    Shmueli, O
    [J]. DATABASE AND XML TECHNOLOGIES, PROCEEDINGS, 2004, 3186 : 204 - 218
  • [4] Workload-aware histograms for remote applications
    Malik, Tanu
    Burns, Randal
    [J]. DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2008, 5182 : 402 - +
  • [5] Workload-aware anomaly detection for Web applications
    Wang, Tao
    Wei, Jun
    Zhang, Wenbo
    Zhong, Hua
    Huang, Tao
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2014, 89 : 19 - 32
  • [6] Workload-Aware Column Imprints
    Slavitch, Noah
    [J]. SIGMOD'20: PROCEEDINGS OF THE 2020 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2020, : 2865 - 2867
  • [7] Workload-Aware DRAM Error Prediction using Machine Learning
    Mukhanov, Lev
    Tovletoglou, Konstantinos
    Vandierendonck, Hans
    Nikolopoulos, Dimitrios S.
    Karakonstantis, Georgios
    [J]. PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC 2019), 2019, : 106 - 118
  • [8] cCluster: A Core Clustering Mechanism for Workload-Aware Virtual Machine Scheduling
    Dehsangi, Mostafa
    Asyabi, Esmail
    Sharifi, Mohsen
    Azhari, Seyed Vahid
    [J]. 2015 3RD INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD) AND INTERNATIONAL CONFERENCE ON OPEN AND BIG (OBD), 2015, : 248 - 255
  • [9] STHoles: A multidimensional workload-aware histogram
    Bruno, N
    Chaudhuri, S
    Gravano, L
    [J]. SIGMOD RECORD, 2001, 30 (02) : 211 - 222
  • [10] Workload-Aware Approximate Computing Configuration
    Ma, Dongning
    Thapa, Rahul
    Wang, Xingjian
    Jiao, Xun
    Hao, Cong
    [J]. PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, : 920 - 925