A data-driven approach to simultaneous fault detection and diagnosis in data centers

被引:17
|
作者
Asgari, Sahar [1 ,2 ]
Gupta, Rohit [2 ]
Puri, Ishwar K. [1 ,2 ]
Zheng, Rong [1 ,3 ]
机构
[1] McMaster Univ, Comp Infrastruct Res Ctr, Hamilton, ON, Canada
[2] McMaster Univ, Dept Mech Engn, Hamilton, ON, Canada
[3] McMaster Univ, Dept Comp & Software Engn, Hamilton, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Data center; Fault diagnosis; Classification; Time-series analysis; Gray-box model; MULTIPLE SIMULTANEOUS FAULTS; QUANTITATIVE MODEL; NEURAL-NETWORKS; AIR; TEMPERATURE; PREDICTIONS; ENVIRONMENT; MANAGEMENT; BUILDINGS; STRATEGY;
D O I
10.1016/j.asoc.2021.107638
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The failure of cooling systems in data centers (DCs) leads to higher indoor temperatures, causing crucial electronic devices to fail, and produces a significant economic loss. To circumvent this issue, fault detection and diagnosis (FDD) algorithms and associated control strategies can be applied to detect, diagnose, and isolate faults. Existing methods that apply FDD to DC cooling systems are designed to successfully overcome individually occurring faults but have difficulty in handling simultaneous faults. These methods either require expensive measurements or those made over a wide range of conditions to develop training models, which can be time-consuming and costly. We develop a rapid and accurate, single and multiple FDD strategy for a DC with a row-based cooling system using data-driven fault classifiers informed by a gray-box temperature prediction model. The gray-box model provides thermal maps of the DC airspace for single as well as a few simultaneous failure conditions, which are used as inputs for two different data-driven classifiers, CNN and RNN, to rapidly predict multiple simultaneous failures. The model is validated with testing data from an experimental DC. Also, the effect of adding Gaussian white noise to training data is discussed and observed that even with low noisy environment, the FDD strategy can diagnose multiple faults with accuracy as high as 100% while requiring relatively few simultaneous fault training data samples. Finally, the different classifiers are compared in terms of accuracy, confusion matrix, precision, recall and F1-score. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] An H∞ approach to data-driven simultaneous fault detection and control
    Salim, M.
    Khosrowjerdi, M. J.
    [J]. IMA JOURNAL OF MATHEMATICAL CONTROL AND INFORMATION, 2017, 34 (04) : 1195 - 1213
  • [2] A Data-Driven Clustering Approach for Fault Diagnosis
    Hou, Jian
    Xiao, Bing
    [J]. IEEE ACCESS, 2017, 5 : 26512 - 26520
  • [3] Fault Detection and Diagnosis for Wind Turbines using Data-Driven Approach
    Francisco Manrique, Ruben
    Andres Giraldo, Fabian
    Sofrony Esmeral, Jorge
    [J]. 2012 7TH COLOMBIAN COMPUTING CONGRESS (CCC), 2012,
  • [4] A Data-Driven Approach of Fault Detection for LTI Systems
    Chen Zhaoxu
    Fang Huajing
    [J]. 2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 6174 - 6179
  • [5] A DATA-DRIVEN FAULT DETECTION APPROACH WITH PERFORMANCE OPTIMIZATION
    Li, Linlin
    Ding, Steven X.
    Peng, Kaixiang
    Han, Huayun
    Yang, Ying
    Yang, Xu
    [J]. CANADIAN JOURNAL OF CHEMICAL ENGINEERING, 2018, 96 (02): : 507 - 514
  • [6] Cold Start Approach for Data-Driven Fault Detection
    Grbovic, Mihajlo
    Li, Weichang
    Subrahmanya, Niranjan A.
    Usadi, Adam K.
    Vucetic, Slobodan
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2013, 9 (04) : 2264 - 2273
  • [7] Data-driven Fault Detection and Diagnosis for HVAC water chillers
    Beghi, A.
    Brignoli, R.
    Cecchinato, L.
    Menegazzo, G.
    Rampazzo, M.
    Simmini, F.
    [J]. CONTROL ENGINEERING PRACTICE, 2016, 53 : 79 - 91
  • [8] Fault detection, diagnosis and data-driven modeling in HVAC chillers
    Namburu, SM
    Luo, JH
    Azam, M
    Choi, K
    Pattipati, KR
    [J]. Signal Processing, Sensor Fusion, and Target Recognition XIV, 2005, 5809 : 143 - 154
  • [9] A Data-Driven and Probabilistic Approach to Residual Evaluation for Fault Diagnosis
    Svard, Carl
    Nyberg, Mattias
    Frisk, Erik
    Krysander, Mattias
    [J]. 2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 95 - 102
  • [10] A Data-Driven Fault Diagnosis Approach for Anemometers in Wind Farm
    Zhang, Jiusi
    Li, Kuan
    Luo, Hao
    Yin, Shen
    [J]. IECON 2020: THE 46TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2020, : 405 - 410