DeepHYDRA: Resource-Efficient Time-Series Anomaly Detection in Dynamically-Configured Systems

被引:0
|
作者
Stehle, Franz Kevin [1 ,2 ]
Vandelli, Wainer [2 ]
Avolio, Giuseppe [2 ]
Zahn, Felix [2 ]
Froening, Holger [1 ]
机构
[1] Heidelberg Univ, Heidelberg, Germany
[2] CERN, Geneva, Switzerland
关键词
TRANSFORMER;
D O I
10.1145/3650200.3656637
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Anomaly detection in distributed systems such as High-Performance Computing (HPC) clusters is vital for early fault detection, performance optimisation, security monitoring, reliability in general but also operational insights. It enables proactive measures to address issues, ensuring system reliability, resource efficiency, and protection against potential threats. Deep Neural Networks have seen successful use in detecting long-term anomalies in multidimensional data, originating for instance from industrial or medical systems, or weather prediction. A downside of such methods is that they require a static input size, or lose data through cropping, sampling, or other dimensionality reduction methods, making deployment on systems with variability on monitored data channels, such as computing clusters difficult. To address these problems, we present DeepHYDRA (Deep Hybrid DBSCAN/Reduction-Based Anomaly Detection) which combines DBSCAN and learning-based anomaly detection. DBSCAN clustering is used to find point anomalies in time-series data, mitigating the risk of missing outliers through loss of information when reducing input data to a fixed number of channels. A deep learning-based time-series anomaly detection method is then applied to the reduced data in order to identify long-term outliers. This hybrid approach reduces the chances of missing anomalies that might be made indistinguishable from normal data by the reduction process, and likewise enables the algorithm to be scalable and tolerate partial system failures while retaining its detection capabilities. Using a subset of the well-known SMD dataset family, a modified variant of the Eclipse dataset, as well as an in-house dataset with a large variability in active data channels, made publicly available with this work, we furthermore analyse computational intensity, memory footprint, and activation counts. DeepHYDRA is shown to reliably detect different types of anomalies in both large and complex datasets. At the same time, the applied reduction approach is shown to enable real-time anomaly detection of a whole computing cluster while occupying proportionally miniscule compute resources, enabling its usage on existing systems without the need for hardware changes.
引用
收藏
页码:272 / 285
页数:14
相关论文
共 50 条
  • [1] NagareDB: A Resource-Efficient Document-Oriented Time-Series Database
    Garcia Calatrava, Carlos
    Becerra Fontal, Yolanda
    Cucchietti, Fernando M.
    Divi Cuesta, Carla
    DATA, 2021, 6 (08)
  • [2] Contrastive Time-Series Anomaly Detection
    Kim, Hyungi
    Kim, Siwon
    Min, Seonwoo
    Lee, Byunghan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (10) : 5053 - 5065
  • [3] Resource-efficient scheduling for real time systems
    Larsen, Kim G.
    2003, Springer Verlag (2855):
  • [4] Resource-efficient scheduling for real time systems
    Larsen, KG
    EMBEDDED SOFTWARE, PROCEEDINGS, 2003, 2855 : 16 - 19
  • [5] ATCN: Resource-efficient Processing of Time Series on Edge
    Baharani, Mohammadreza
    Tabkhi, Hamed
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2022, 21 (05)
  • [6] Two dimensional time-series for anomaly detection and regulation in adaptive systems
    Burgess, M
    MANAGEMENT TECHNOLOGIES FOR E-COMMERCE AND E-BUSINESS APPLICATIONS, PROCEEDINGS, 2002, 2506 : 169 - 180
  • [7] Adaptive Multivariate Time-Series Anomaly Detection
    Lv, Jianming
    Wang, Yaquan
    Chen, Shengjing
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (04)
  • [8] Granger Causality for Time-Series Anomaly Detection
    Qiu, Huida
    Liu, Yan
    Subrahmanya, Niranjan A.
    Li, Weichang
    12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, : 1074 - 1079
  • [9] Time-Series Anomaly Detection Service at Microsoft
    Ren, Hansheng
    Xu, Bixiong
    Wang, Yujing
    Yi, Chao
    Huang, Congrui
    Kou, Xiaoyu
    Xing, Tony
    Yang, Mao
    Tong, Jie
    Zhang, Qi
    KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 3009 - 3017
  • [10] VUS: effective and efficient accuracy measures for time-series anomaly detectionVUS: effective and efficient accuracy measures for time-series anomaly detection...P. Boniol et al.
    Paul Boniol
    Ashwin K. Krishna
    Marine Bruel
    Qinghua Liu
    Mingyi Huang
    Themis Palpanas
    Ruey S. Tsay
    Aaron Elmore
    Michael J. Franklin
    John Paparrizos
    The VLDB Journal, 2025, 34 (3)