Adaptive Learning for Concept Drift in Application Performance Modeling

被引:1
|
作者
Madireddy, Sandeep [1 ]
Balaprakash, Prasanna [1 ]
Carns, Philip [1 ]
Latham, Robert [1 ]
Lockwood, Glenn K. [2 ]
Ross, Robert [1 ]
Snyder, Shane [1 ]
Wild, Stefan M. [1 ]
机构
[1] Argonne Natl Lab, Lemont, IL 60439 USA
[2] Lawrence Berkeley Natl Lab, Berkeley, CA USA
关键词
D O I
10.1145/3337821.3337922
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Supervised learning is a promising approach for modeling the performance of applications running on large HPC systems. A key assumption in supervised learning is that the training and testing data are obtained under the same conditions. However, in production HPC systems these conditions might not hold because the conditions of the platform can change over time as a result of hardware degradation, hardware replacement, software upgrade, and configuration updates. These changes could alter the data distribution in a way that affects the accuracy of the predictive performance models and render them less useful; this phenomenon is referred to as concept drift. Ignoring concept drift can lead to suboptimal resource usage and decreased efficiency when those performance models are deployed for tuning and job scheduling in production systems. To address this issue, we propose a concept-drift-aware predictive modeling approach that comprises two components: (1) an online Bayesian changepoint detection method that can automatically identify the location of events that lead to concept drift in near-real time and (2) a moment-matching transformation inspired by transfer learning that converts the training data collected before the drift to be useful for retraining. We use application input/output performance data collected on Cori, a production supercomputing system at the National Energy Research Scientific Computing Center, to demonstrate the effectiveness of our approach. The results show that concept-drift-aware models obtain significant improvement in accuracy; the median absolute error of the best-performing Gaussian process regression improved by 58.8% when the proposed approaches were used.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Incremental Learning of Variable Rate Concept Drift
    Elwell, Ryan
    Polikar, Robi
    MULTIPLE CLASSIFIER SYSTEMS, PROCEEDINGS, 2009, 5519 : 142 - 151
  • [32] An Impact Study of Concept Drift in Federated Learning
    Yang, Guanhui
    Chen, Xiaoting
    Zhang, Tengsen
    Wang, Shuo
    Yang, Yun
    23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 1457 - 1462
  • [33] Learning in the presence of concept drift and hidden contexts
    Widmer, G
    Kubat, M
    MACHINE LEARNING, 1996, 23 (01) : 69 - 101
  • [34] Explaining Concept Drift of Deep Learning Models
    Wang, Xiaolu
    Wang, Zhi
    Shao, Wei
    Jia, Chunfu
    Li, Xiang
    CYBERSPACE SAFETY AND SECURITY, PT II, 2019, 11983 : 524 - 534
  • [35] Active learning approach to concept drift problem
    Kurlej, Bartosz
    Wozniak, Michal
    LOGIC JOURNAL OF THE IGPL, 2012, 20 (03) : 550 - 559
  • [36] Incremental Learning of Concept Drift in Nonstationary Environments
    Elwell, Ryan
    Polikar, Robi
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (10): : 1517 - 1531
  • [37] Concept drift adaptation with continuous kernel learning
    Chen, Yingying
    Dai, Hong-Liang
    INFORMATION SCIENCES, 2024, 670
  • [38] Incremental discretization, application to data with concept drift
    Pinto, Carlos
    Gama, Joao
    APPLIED COMPUTING 2007, VOL 1 AND 2, 2007, : 467 - +
  • [39] Concept drift detection and adaption framework using optimized deep learning and adaptive sliding window approach
    Desale, Ketan Sanjay
    Shinde, Swati V.
    EXPERT SYSTEMS, 2023, 40 (09)
  • [40] Maritime vessel traffic modeling in the context of concept drift
    Osekowska, Ewa
    Johnson, Henric
    Carlssson, Bengt
    WORLD CONFERENCE ON TRANSPORT RESEARCH - WCTR 2016, 2017, 25 : 1457 - 1476