Resource Time Series Analysis and Forecasting in Large-Scale Virtual Clusters

被引:0
|
作者
Lin, Yue [1 ,2 ]
Wen, Jiamin [3 ]
Zhang, Xudong [4 ]
Liang, Yan [3 ]
Li, Jianjiang [1 ]
机构
[1] Univ Sci & Technol Beijing, Dept Comp Sci & Technol, Beijing 100083, Peoples R China
[2] 41st Inst CETC, Qingdao 266555, Peoples R China
[3] China Natl Petr Corp, BGP Inc, Zhuozhou 072751, Peoples R China
[4] Natl Engn Res Ctr Oil & Gas Explorat Comp Software, Zhuozhou 072751, Peoples R China
来源
BIG DATA MINING AND ANALYTICS | 2025年 / 8卷 / 03期
关键词
workload forecasting; multivariate time series forecasting; deep learning; MODEL; PREDICTION;
D O I
10.26599/BDMA.2024.9020085
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In today's rapidly evolving internet landscape, prominent companies across various industries face increasingly complex business operations, leading to significant cluster-scale growth. However, this growth brings about challenges in cluster management and the inefficient utilization of vast amounts of data due to its low value density. This paper, based on the large-scale cluster virtualization and monitoring system of the data center of the Bureau of Geophysical Prospecting (BGP), utilizes time series data of host resources from the monitoring system's time series database to propose a multivariate multi-step time series forecasting model, MUL-CNN-BiGRU-Attention, for forecasting CPU load on virtual cluster hosts. The model undergoes extensive offline training using a large volume of time series data, followed by deployment using TensorFlow Serving. Recent small-batch data are employed for fine-tuning model parameters to better adapt to current data patterns. Comparative experiments are conducted between the proposed model and other baseline models, demonstrating notable improvements in Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and $R<^>{2}$ metrics by up to 35.2%, 56.1%, 32.5%, and 10.3%, respectively. Additionally, ablation experiments are designed to investigate the impact of different factors on the performance of the forecasting model, providing valuable insights for parameter optimization based on experimental results.
引用
收藏
页码:592 / 605
页数:14
相关论文
共 50 条
  • [1] A Large-Scale Empirical Study of Aligned Time Series Forecasting
    Pilyugina, Polina
    Medvedeva, Svetlana
    Mosievich, Kirill
    Trofimov, Ilya
    Kostromina, Alina
    Simakov, Dmitry
    Burnaev, Evgeny
    IEEE ACCESS, 2024, 12 : 131100 - 131121
  • [2] An Analysis Framework for Large-Scale Time Series
    Teng F.
    Huang Q.-C.
    Li T.-R.
    Wang C.
    Tian C.-H.
    Jisuanji Xuebao/Chinese Journal of Computers, 2020, 43 (07): : 1279 - 1292
  • [3] Ensemble Learning Models for Large-Scale Time Series Forecasting in Supply Chain
    Zhang, Minjuan
    Wu, Chase Q.
    Hou, Aiqin
    2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023, 2024, : 2286 - 2294
  • [4] Feature-aware forecasting of large-scale time series data sets
    Hartmann, Claudio
    Kegel, Lars
    Lehner, Wolfgang
    IT-INFORMATION TECHNOLOGY, 2020, 62 (3-4): : 157 - 168
  • [5] Time series forecasting of solar power generation for large-scale photovoltaic plants
    Sharadga, Hussein
    Hajimirza, Shima
    Balog, Robert S.
    RENEWABLE ENERGY, 2020, 150 : 797 - 807
  • [6] A Global Forecasting Approach to Large-Scale Crop Production Prediction with Time Series Transformers
    Ibanez, Sebastian C.
    Monterola, Christopher P.
    AGRICULTURE-BASEL, 2023, 13 (09):
  • [7] Evolving Super Graph Neural Networks for Large-Scale Time-Series Forecasting
    Chen, Hongjie
    Rossi, Ryan
    Kim, Sungchul
    Mahadik, Kanak
    Eldardiry, Hoda
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT VI, PAKDD 2024, 2024, 14650 : 201 - 212
  • [8] VMThunder: Fast Provisioning of Large-Scale Virtual Machine Clusters
    Zhang, Zhaoning
    Li, Ziyang
    Wu, Kui
    Li, Dongsheng
    Li, Huiba
    Peng, Yuxing
    Lu, Xicheng
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (12) : 3328 - 3338
  • [9] Large-Scale Unusual Time Series Detection
    Hyndman, Rob J.
    Wang, Earo
    Laptev, Nikolay
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 1616 - 1619
  • [10] TIME SERIES ANALYSIS ABOUT A SET OF LARGE-SCALE CLIMATE DATA
    Zhao, Linlin
    Wang, Chengshan
    Huo, Zhenyu
    INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY: PROCEEDINGS, 2012, : 101 - 105