Fault Detection and Localization in Distributed Systems Using Recurrent Convolutional Neural Networks

被引:9
|
作者
Qi, Guangyang [1 ]
Yao, Lina [1 ]
Uzunov, Anton V. [2 ]
机构
[1] Univ New South Wales, Sydney, NSW, Australia
[2] Def Sci & Technol Grp, Adelaide, SA, Australia
关键词
D O I
10.1007/978-3-319-69179-4_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Early detection of faults is essential to maintaining the reliability of a distributed system. While there are many solutions for detecting faults, handling high dimensionality and uncertainty of system observations to make an accurate detection still remains a challenge. In this paper, we address this challenge with a two-dimensional convolutional neural network in the form of a denoising autoencoder with recurrent neural networks that performs simultaneous fault detection and diagnosis based on real-time system metrics from a given distributed system (e.g. CPU usage, memory consumption, etc.). The model provides a unified way to automatically learn useful features and make adaptive inferences regarding the onset of faults without hand-crafted feature extraction and human diagnostic expertise. In addition, we develop a Bayesian change point detection approach for fault localization, in order to support the fault recovery process. We conducted extensive experiments in a real distributed environment over Amazon EC2 and the results demonstrate our proposal outperforms a variety of state-of-the-art machine learning algorithms that are used for fault detection and diagnosis in distributed systems.
引用
收藏
页码:33 / 48
页数:16
相关论文
共 50 条
  • [1] Distributed Fault Detection in Sensor Networks using a Recurrent Neural Network
    Oliver Obst
    Neural Processing Letters, 2014, 40 : 261 - 273
  • [2] Distributed Fault Detection in Sensor Networks using a Recurrent Neural Network
    Obst, Oliver
    NEURAL PROCESSING LETTERS, 2014, 40 (03) : 261 - 273
  • [3] Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks
    Adavanne, Sharath
    Politis, Archontis
    Nikunen, Joonas
    Virtanen, Tuomas
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (01) : 34 - 48
  • [4] Distributed Fault Detection using a Recurrent Neural Network
    Obst, Oliver
    2009 INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING IN SENSOR NETWORKS (IPSN 2009), 2009, : 373 - 374
  • [5] Fault detection of wind energy conversion systems using recurrent neural networks
    Talebi, Nasser
    Sadrnia, Mohammad Ali
    Darabi, Ahmad
    INTERNATIONAL JOURNAL OF SUSTAINABLE ENERGY, 2015, 34 (01) : 52 - 70
  • [6] Sound Event Localization and Detection Using Convolutional Recurrent Neural Networks and Gated Linear Units
    Komatsu, Tatsuya
    Togami, Masahito
    Takahashi, Tsubasa
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 41 - 45
  • [7] Distribution Grid Fault Classification and Localization using Convolutional Neural Networks
    Zhou, Ming
    Kazemi, Nazli
    Musilek, Petr
    SMART GRIDS AND SUSTAINABLE ENERGY, 2024, 9 (01)
  • [8] Uncertainty quantification in fault detection using convolutional neural networks
    Feng, Runhai
    Grana, Dario
    Balling, Niels
    GEOPHYSICS, 2021, 86 (03) : M41 - M48
  • [9] Detection and Localization of Ultrasound Scatterers Using Convolutional Neural Networks
    Youn, Jihwan
    Ommen, Martin Lind
    Stuart, Matthias Bo
    Thomsen, Erik Vilain
    Larsen, Niels Bent
    Jensen, Jorgen Arendt
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (12) : 3855 - 3867
  • [10] Angiodysplasia detection and localization using deep convolutional neural networks
    Shvets, Alexey A.
    Iglovikov, Vladimir I.
    Rakhlin, Alexander
    Kalinin, Alexandr A.
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 612 - 617