Signal Processing Based Method for Real-Time Anomaly Detection in High-Performance Computing

被引:1
|
作者
Dey, ArwIavo [1 ]
Islam, Tanzima [1 ]
Phelps, Chase [1 ]
Kelly, Christopher [2 ]
机构
[1] Texas State Univ, Dept Comp Sci, San Marcos, TX 78666 USA
[2] Brookhaven Natl Lab, Comp Sci Initiat, Long Isl City, NY USA
关键词
Real-time anomaly detection in HPC; Signal based anomaly detection; Fast Fourier Transform; CHIMBUKO;
D O I
10.1109/COMPSAC57700.2023.00037
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Performance anomalies can manifest as irregular execution times or abnormal execution events for many reasons, including network congestion and resource contention. Detecting such anomalies in real-time by analyzing the details of performance traces at scale is impractical due to the sheer volume of data High-Performance Computing (HPC) applications produce. In this paper, we propose formulating HPC performance anomaly detection as a signal-processing problem where anomalies can be treated as noise. We evaluate our proposed method in comparison with two other commonly used anomaly detection techniques of varying complexity based on their detection accuracy and scalability. Since real-time in-situ anomaly detection at a large scale requires lightweight methods that can handle a large volume of streaming data, we find that our proposed method provides the best trade-off. We then implement the proposed method in CHIMBUKO, the first online, distributed, and scalable workflow-level performance trace analysis framework. We compare our proposed signal-based anomaly detection algorithm with two other methods using a function of their accuracy, F1 score, and detection overhead. Our experiments demonstrate that our proposed approach achieves a 99% improvement for the benchmark datasets and a 93% improvement with CHIMBUKO traces.
引用
收藏
页码:233 / 240
页数:8
相关论文
共 50 条
  • [41] Soft computing for anomaly detection and prediction to mitigate IoT-based real-time abuse
    Bhatia M.P.S.
    Sangwan S.R.
    Personal and Ubiquitous Computing, 2024, 28 (01) : 123 - 133
  • [42] High-performance meteorological data processing framework for real-time analysis and visualization
    Mbogo, Gali-Ketema
    Rakitin, Stepan, V
    Visheratin, Alexander
    6TH INTERNATIONAL YOUNG SCIENTIST CONFERENCE ON COMPUTATIONAL SCIENCE, YSC 2017, 2017, 119 : 334 - 340
  • [43] High-performance phase-locked loop for real-time image processing
    Zhang, Xiang
    Bai, Tingzhu
    Guangxue Jishu/Optical Technique, 1999, (01): : 32 - 33
  • [44] A Real-time Temperature Anomaly Detection Method for IoT Data
    Liu, Wei
    Jiang, Hongyi
    Che, Dandan
    Chen, Lifei
    Jiang, Qingshan
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY (IOTBDS), 2020, : 112 - 118
  • [45] Anomaly Detection on Real-time Security Log using Stream Processing
    Limprasert, Wasit
    Jantana, Patcharapon
    Liangsiri, Avirut
    2022 17TH INTERNATIONAL JOINT SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE PROCESSING (ISAI-NLP 2022) / 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INTERNET OF THINGS (AIOT 2022), 2022,
  • [46] Real-time simulation based on high-speed signal processing system
    Fu, Zhi-Hong
    Ma, Jing
    Xie, Pin-Fang
    Chen, Qing-Li
    Xitong Fangzhen Xuebao / Journal of System Simulation, 2007, 19 (16): : 3680 - 3683
  • [47] CNN Based High Performance Computing for Real Time Image Processing on GPU
    Potluri, Sasanka
    Fasih, Alireza
    Vutukuru, Laxminand Kishore
    Al Machot, Fadi
    Kyamakya, Kyandoghere
    AUTONOMOUS SYSTEMS: DEVELOPMENTS AND TRENDS, 2011, 391 : 255 - 266
  • [48] HIGH-PERFORMANCE REAL-TIME HETERODYNE INTERFEROMETRY
    MASSIE, NA
    NELSON, RD
    HOLLY, S
    APPLIED OPTICS, 1979, 18 (11) : 1797 - 1803
  • [49] Real-Time Implementation of Signal Processing Techniques for Disturbances Detection
    Singh, Rupal H.
    Mohanty, Soumya R.
    Kishor, Nand
    Thakur, Ankit K.
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2019, 66 (05) : 3550 - 3560