Comparison of data driven modeling approaches for temperature prediction in data centers

被引:64
|
作者
Athavale, Jayati [1 ]
Yoda, Minami [1 ]
Joshi, Yogendra [1 ]
机构
[1] Georgia Inst Technol, George W Woodruff Sch Mech Engn, Atlanta, GA 30332 USA
关键词
Compact modeling; Data center; Rapid temperature prediction;
D O I
10.1016/j.ijheatmasstransfer.2019.02.041
中图分类号
O414.1 [热力学];
学科分类号
摘要
Energy-efficient thermal management of data centers based on dynamic optimization and provisioning of cooling resources requires rapid (nearly real-time) predictions of temperatures within data centers. This work for the first time compares multiple Data-Driven Models (DDMs) to achieve such rapid temperature predictions. DDM typically employs statistical or machine learning-based tools, in combination with physics-based modeling and/or experimental data to predict system behavior. In general, DDM approaches are well-suited to systems that have multiple operational states based on interactions between the many electrical, mechanical and control parameters typical of data centers. This study compares the performance of three different DDM methods, namely Artificial Neural Networks (ANN), Support Vector Regression (SVR), Gaussian Process Regression (GPR) in predicting both steady-state and transient rack inlet air temperature distributions in data centers. Additionally, Proper Orthogonal Decomposition (POD) was considered for transient modeling. The data used for training and analysis were obtained by performing 300 offline numerical simulations with a room-level, experimentally validated computational fluid dynamics/heat transfer (CFD/HT) model. The performance of the four data-driven models was evaluated based on the absolute mean error for interpolation and extrapolation, and the adaptability of the models to changes in physical domain (data center room) configuration. Additionally, the impact of the size of the training data set on prediction accuracy is also compared for the four models. For the steady-state case study, the predictions for ANN, SVR and GPR models are in good agreement with CFD/HT simulations, with the GPR model having the smallest overall average prediction error of 0.6 degrees C in rack inlet air temperature, corresponding to a relative error of 2.7% with respect to rack inlet temperature measured in degrees C. It was found that for all the frameworks the prediction error increases when the size of training data set was less than 300 samples. The GPR model had the best accuracy for smaller training data sets compared with the other models, with an average prediction error for rack inlet temperatures <1 degrees C when trained on only 50 simulations. For the transient case study, the interpolative prediction error for all the models is very low ( <0.3 degrees C); however, the extrapolative prediction errors are much greater, and appear to be directly proportional to the (here, temporal) "distance" from the interrogation point to the input parameter space. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1039 / 1052
页数:14
相关论文
共 50 条
  • [41] Approaches to Improving the Efficiency of Data Centers
    Kostenko, V. A.
    Chupakhin, A. A.
    PROGRAMMING AND COMPUTER SOFTWARE, 2019, 45 (05) : 251 - 256
  • [42] Prediction of Protein Aggregation Propensity via Data-Driven Approaches
    Kang, Seungpyo
    Kim, Minseon
    Sun, Jiwon
    Lee, Myeonghun
    Min, Kyoungmin
    ACS BIOMATERIALS SCIENCE & ENGINEERING, 2023, 9 (11) : 6451 - 6463
  • [43] Approaches to Improving the Efficiency of Data Centers
    V. A. Kostenko
    A. A. Chupakhin
    Programming and Computer Software, 2019, 45 : 251 - 256
  • [44] Application of Convolutional Neural Network to Prediction of Temperature Distribution in Data Centers
    Tashiro, Shinya
    Nakamura, Yutaka
    Matsuda, Kazuhiro
    Matsuoka, Morito
    PROCEEDINGS OF 2016 IEEE 9TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2016, : 656 - 661
  • [45] Gap filling crowdsourced air temperature data in cities using data-driven approaches
    He, Miao
    Luo, Zhiwen
    Xie, Xiaoxiong
    Wang, Peng
    Wang, Haichao
    Zapata-Lancaster, Gabriela
    BUILDING AND ENVIRONMENT, 2025, 271
  • [46] Modeling extreme events: Univariate and multivariate data-driven approaches
    Buritica, Gloria
    Hentschel, Manuel
    Pasche, Olivier C.
    Rottger, Frank
    Zhang, Zhongwei
    EXTREMES, 2024,
  • [47] Editorial: Advances in data-driven approaches and modeling of complex systems
    Mohd, Mohd Hafiz
    Nguyen-Huu, Tri
    Park, Junpyo
    Addawe, Joel M.
    Haga, Hirohide
    FRONTIERS IN APPLIED MATHEMATICS AND STATISTICS, 2023, 9
  • [48] Application of data-driven modeling approaches to industrial hydroprocessing units
    Ghosh, Debanjan
    Moreira, Jesús
    Mhaskar, Prashant
    Chemical Engineering Research and Design, 2022, 177 : 123 - 135
  • [49] Application of data-driven modeling approaches to industrial hydroprocessing units
    Ghosh, Debanjan
    Moreira, Jesus
    Mhaskar, Prashant
    CHEMICAL ENGINEERING RESEARCH & DESIGN, 2022, 177 : 123 - 135
  • [50] Data driven approaches to modeling and analysis of bioprocesses: Some industrial examples
    Hodge, D
    Simon, L
    Karim, MN
    PROCEEDINGS OF THE 2003 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2003, : 2062 - 2076