Cooling Anomaly Detection for Servers and Datacenters with Naive Ensemble

被引:0
|
作者
Li, Cong [1 ]
机构
[1] Intel Corp, 880 Zixing Rd, Shanghai, Peoples R China
关键词
Cooling failures; predictive failure analysis; unsupervised anomaly detection; probability estimation;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We propose a novel approach to predictive analysis of potential cooling failures in servers and datacenters in which unsupervised anomaly detection is performed in multidimensional temperature sensor data. A naive and obviously invalid independence assumption is employed to model the probability distribution. We provide a theoretical justification demonstrating that the approach relies on correctly comparing the probabilities estimated rather than accurate probability estimation. The approach is also justified empirically in simulation experiments for two different predictive failure analysis scenarios: identifying potentially worn-out fans based on the server component temperature sensor data and identifying computer room air-conditioner failures before hotspots arise based on server inlet temperature sensor data.
引用
收藏
页码:157 / 162
页数:6
相关论文
共 50 条
  • [1] Thermal anomaly detection in datacenters
    Yuan, Yang
    Lee, Eun Kyung
    Pompili, Dario
    Liao, Junbi
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2012, 226 (C8) : 2104 - 2117
  • [2] Adversarial Impact on Anomaly Detection in Cloud Datacenters
    Deka, Pratyush Kr.
    Bhuyan, Monowar H.
    Kadobayashi, Youki
    Elmroth, Erik
    2019 IEEE 24TH PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING (PRDC 2019), 2019, : 188 - 197
  • [3] GAN Ensemble for Anomaly Detection
    Han, Xu
    Chen, Xiaohui
    Liu, Li-Ping
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 4090 - 4097
  • [4] Characterizing servers workload in Cloud Datacenters
    Gbaguidi, Frejus
    Boumerdassi, Selma
    Renault, Eric
    Ezin, Eugene
    2015 3RD INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD) AND INTERNATIONAL CONFERENCE ON OPEN AND BIG (OBD), 2015, : 657 - 661
  • [5] Failure Prediction in Datacenters Using Unsupervised Multimodal Anomaly Detection
    Zhao, Minglu
    Furuhata, Reo
    Agung, Mulya
    Takizawa, Hiroyuki
    Soma, Tomoya
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 3545 - 3549
  • [6] Model-based Thermal Anomaly Detection in Cloud Datacenters
    Lee, Eun Kyung
    Viswanathan, Hariharasudhan
    Pompili, Dario
    2013 9TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SENSOR SYSTEMS (IEEE DCOSS 2013), 2013, : 191 - 198
  • [7] Latency Comparison of Cloud Datacenters and Edge Servers
    Charyyev, Batyr
    Arslan, Engin
    Gunes, Mehmet Hadi
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [8] Ensemble Algorithms for Unsupervised Anomaly Detection
    Zhao, Zhiruo
    Mehrotra, Kishan G.
    Mohan, Chilukuri K.
    CURRENT APPROACHES IN APPLIED ARTIFICIAL INTELLIGENCE, 2015, 9101 : 514 - 525
  • [9] Greedy Ensemble Hyperspectral Anomaly Detection
    Hossain, Mazharul
    Younis, Mohammed
    Robinson, Aaron
    Wang, Lan
    Preza, Chrysanthe
    JOURNAL OF IMAGING, 2024, 10 (06)
  • [10] PBAD: Perception-Based Anomaly Detection System for Cloud Datacenters
    Kim, Jiyeon
    Kim, Hyong S.
    2015 IEEE 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, 2015, : 678 - 685