Predicting Hard Disk Failures in Data Centers Using Temporal Convolutional Neural Networks

被引:3
|
作者
Burrello, Alessio [1 ]
Pagliari, Daniele Jahier [2 ]
Bartolini, Andrea [1 ]
Benini, Luca [1 ,4 ]
Macii, Enrico [3 ]
Poncino, Massimo [2 ]
机构
[1] Univ Bologna, Dept Elect Elect & Informat Engn, I-40136 Bologna, Italy
[2] Politecn Torino, Dept Control & Comp Engn, Turin, Italy
[3] Politecn Torino, Interuniv Dept Reg & Urban Studies & Planning, Turin, Italy
[4] Swiss Fed Inst Technol, Dept Informat Technol & Elect Engn, CH-8092 Zurich, Switzerland
关键词
Predictive maintenance; IoT; Deep learning; Sequence analysis; Temporal Convolutional Networks;
D O I
10.1007/978-3-030-71593-9_22
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In modern data centers, storage system failures are major contributors to downtimes and maintenance costs. Predicting these failures by collecting measurements from disks and analyzing them with machine learning techniques can effectively reduce their impact, enabling timely maintenance. While there is a vast literature on this subject, most approaches attempt to predict hard disk failures using either classic machine learning solutions, such as Random Forests (RFs) or deep Recurrent Neural Networks (RNNs). In this work, we address hard disk failure prediction using Temporal Convolutional Networks (TCNs), a novel type of deep neural network for time series analysis. Using a real-world dataset, we show that TCNs outperform both RFs and RNNs. Specifically, we can improve the Fault Detection Rate (FDR) approximate to 7.5% of (FDR = 89.1%) compared to the state-of-the-art, while simultaneously reducing the False Alarm Rate (FAR = 0.052%). Moreover, we explore the network architecture design space showing that TCNs are consistently superior to RNNs for a given model size and complexity and that even relatively small TCNs can reach satisfactory performance. All the codes to reproduce the results presented in this paper are available at https://github.com/ABurrello/tcn-hard-disk-failure-prediction.
引用
收藏
页码:277 / 289
页数:13
相关论文
共 50 条
  • [1] FailureSim: A System for Predicting Hardware Failures in Cloud Data Centers Using Neural Networks
    Davis, Nickolas Allen
    Rezgui, Abdelmounaam
    Soliman, Hamdy
    Manzanares, Skyler
    Coates, Milagre
    [J]. 2017 IEEE 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2017, : 544 - 551
  • [2] Spatio-Temporal Downscaling of Climate Data Using Convolutional and Error-Predicting Neural Networks
    Serifi, Agon
    Guenther, Tobias
    Ban, Nikolina
    [J]. FRONTIERS IN CLIMATE, 2021, 3
  • [3] Predicting Sleeping Quality Using Convolutional Neural Networks
    Sathish, Vidya Rohini Konanur
    Woo, Wai Lok
    Ho, Edmond S. L.
    [J]. ADVANCES IN CYBERSECURITY, CYBERCRIMES, AND SMART EMERGING TECHNOLOGIES, 2023, 4 : 175 - 184
  • [4] Predicting Magnetization Directions Using Convolutional Neural Networks
    Nurindrawati, Felicia
    Sun, Jiajia
    [J]. JOURNAL OF GEOPHYSICAL RESEARCH-SOLID EARTH, 2020, 125 (10)
  • [5] Predicting mortgage default using convolutional neural networks
    Kvamme, Havard
    Sellereite, Nikolai
    Aas, Kjersti
    Sjursen, Steffen
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 102 : 207 - 217
  • [6] PREDICTING UNIVERSITY DROPOUT BY USING CONVOLUTIONAL NEURAL NETWORKS
    Mezzini, Mauro
    Bonavolonta, Gianmarco
    Agrusti, Francesco
    [J]. 13TH INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE (INTED2019), 2019, : 9155 - 9163
  • [7] Predicting Failures in Hard Drives with LSTM Networks
    Lima, Fernando Dione S.
    Amaral, Gabriel M. R.
    Leite, Lucas G. M.
    Gomes, Joao Paulo P.
    Machado, Javam C.
    [J]. 2017 6TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2017, : 222 - 227
  • [8] On Predicting Geolocation of Tweets Using Convolutional Neural Networks
    Huang, Binxuan
    Carley, Kathleen M.
    [J]. SOCIAL, CULTURAL, AND BEHAVIORAL MODELING, 2017, 10354 : 281 - 291
  • [9] Predicting Eye Fixations using Convolutional Neural Networks
    Liu, Nian
    Han, Junwei
    Zhang, Dingwen
    Wen, Shifeng
    Liu, Tianming
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 362 - 370
  • [10] Anomaly Detection Model for Predicting Hard Disk Drive Failures
    Djurasevic, Sladjana M.
    Pesovic, Uros M.
    Djordjevic, Borislav S.
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2021, 35 (08) : 549 - 566