Fault Detection and Localization in Distributed Systems Using Recurrent Convolutional Neural Networks

被引:9
|
作者
Qi, Guangyang [1 ]
Yao, Lina [1 ]
Uzunov, Anton V. [2 ]
机构
[1] Univ New South Wales, Sydney, NSW, Australia
[2] Def Sci & Technol Grp, Adelaide, SA, Australia
关键词
D O I
10.1007/978-3-319-69179-4_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Early detection of faults is essential to maintaining the reliability of a distributed system. While there are many solutions for detecting faults, handling high dimensionality and uncertainty of system observations to make an accurate detection still remains a challenge. In this paper, we address this challenge with a two-dimensional convolutional neural network in the form of a denoising autoencoder with recurrent neural networks that performs simultaneous fault detection and diagnosis based on real-time system metrics from a given distributed system (e.g. CPU usage, memory consumption, etc.). The model provides a unified way to automatically learn useful features and make adaptive inferences regarding the onset of faults without hand-crafted feature extraction and human diagnostic expertise. In addition, we develop a Bayesian change point detection approach for fault localization, in order to support the fault recovery process. We conducted extensive experiments in a real distributed environment over Amazon EC2 and the results demonstrate our proposal outperforms a variety of state-of-the-art machine learning algorithms that are used for fault detection and diagnosis in distributed systems.
引用
收藏
页码:33 / 48
页数:16
相关论文
共 50 条
  • [1] Distributed Fault Detection in Sensor Networks using a Recurrent Neural Network
    Oliver Obst
    [J]. Neural Processing Letters, 2014, 40 : 261 - 273
  • [2] Distributed Fault Detection in Sensor Networks using a Recurrent Neural Network
    Obst, Oliver
    [J]. NEURAL PROCESSING LETTERS, 2014, 40 (03) : 261 - 273
  • [3] Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks
    Adavanne, Sharath
    Politis, Archontis
    Nikunen, Joonas
    Virtanen, Tuomas
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (01) : 34 - 48
  • [4] Distributed Fault Detection using a Recurrent Neural Network
    Obst, Oliver
    [J]. 2009 INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING IN SENSOR NETWORKS (IPSN 2009), 2009, : 373 - 374
  • [5] Fault detection of wind energy conversion systems using recurrent neural networks
    Talebi, Nasser
    Sadrnia, Mohammad Ali
    Darabi, Ahmad
    [J]. INTERNATIONAL JOURNAL OF SUSTAINABLE ENERGY, 2015, 34 (01) : 52 - 70
  • [6] Sound Event Localization and Detection Using Convolutional Recurrent Neural Networks and Gated Linear Units
    Komatsu, Tatsuya
    Togami, Masahito
    Takahashi, Tsubasa
    [J]. 28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 41 - 45
  • [7] Distribution Grid Fault Classification and Localization using Convolutional Neural Networks
    Zhou, Ming
    Kazemi, Nazli
    Musilek, Petr
    [J]. SMART GRIDS AND SUSTAINABLE ENERGY, 2024, 9 (01)
  • [8] Uncertainty quantification in fault detection using convolutional neural networks
    Feng, Runhai
    Grana, Dario
    Balling, Niels
    [J]. GEOPHYSICS, 2021, 86 (03) : M41 - M48
  • [9] Detection and Localization of Ultrasound Scatterers Using Convolutional Neural Networks
    Youn, Jihwan
    Ommen, Martin Lind
    Stuart, Matthias Bo
    Thomsen, Erik Vilain
    Larsen, Niels Bent
    Jensen, Jorgen Arendt
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (12) : 3855 - 3867
  • [10] Angiodysplasia detection and localization using deep convolutional neural networks
    Shvets, Alexey A.
    Iglovikov, Vladimir I.
    Rakhlin, Alexander
    Kalinin, Alexandr A.
    [J]. 2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 612 - 617