Practical Anomaly Detection over Multivariate Monitoring Metrics for Online Services

被引:2
|
作者
Liu, Jinyang [1 ]
Yang, Tianyi [1 ]
Chen, Zhuangbin [2 ]
Su, Yuxin [2 ]
Feng, Cong [3 ]
Yang, Zengyin [3 ]
Lyu, Michael R. [1 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Sun Yat Sen Univ, Sch Software Engn, Zhuhai, Peoples R China
[3] Fluawei Cloud Comp Technol Co Ltd, Comp & Networking Innovat Lab, Dongguan, Peoples R China
来源
2023 IEEE 34TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING, ISSRE | 2023年
基金
中国国家自然科学基金;
关键词
Anomaly Detection; Multivariate Monitoring Metrics; Software Reliability;
D O I
10.1109/ISSRE59848.2023.00045
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As modern software systems continue to grow in terms of complexity and volume, anomaly detection on multivariate monitoring metrics, which profile systems' health status, becomes more and more critical and challenging. In particular, the dependency between different metrics and their historical patterns plays a critical role in pursuing prompt and accurate anomaly detection. Existing approaches fall short of industrial needs for being unable to capture such information efficiently. To fill this significant gap, in this paper, we propose CMAnomaly, an anomaly detection framework on multivariate monitoring metrics based on collaborative machine. The proposed collaborative machine is a mechanism to capture the pairwise interactions along with feature and temporal dimensions with linear time complexity. Cost-effective models can then be employed to leverage both the dependency between monitoring metrics and their historical patterns for anomaly detection. The proposed framework is extensively evaluated with both public data and industrial data collected from a large-scale online service system of Huawei Cloud. The experimental results demonstrate that compared with state-of-the-art baseline models, CMAnomaly achieves an average F1 score of 0.9494, outperforming baselines by 6.77% similar to 10.68%, and runs 10x similar to 20x faster. Furthermore, we also share our experience of deploying CMAnomaly in Huawei Cloud.
引用
收藏
页码:36 / 45
页数:10
相关论文
共 50 条
  • [21] An extreme learning machine for unsupervised online anomaly detection in multivariate time series
    Peng, Xinggan
    Li, Hanhui
    Yuan, Feng
    Razul, Sirajudeen Gulam
    Chen, Zhebin
    Lin, Zhiping
    NEUROCOMPUTING, 2022, 501 : 596 - 608
  • [22] Online Multivariate Time Series Anomaly Detection Method Based on Contrastive Learning
    Dong, Xiyao
    Liu, Hui
    Du, Junzhao
    Wang, Zhengkai
    Wang, Cheng
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XIII, ICIC 2024, 2024, 14874 : 468 - 479
  • [23] Protecting VNF services with smart online behavior anomaly detection method
    Cheng, Yuxia
    Yao, Huijuan
    Wang, Yu
    Xiang, Yang
    Li, Hongpei
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 95 : 265 - 276
  • [24] Back to the Metrics: Exploration of Distance Metrics in Anomaly Detection
    Lin, Yujing
    Li, Xiaoqiang
    APPLIED SCIENCES-BASEL, 2024, 14 (16):
  • [25] StreamAD: A cloud platform metrics-oriented benchmark for unsupervised online anomaly detection
    Xu J.
    Lin C.
    Liu F.
    Wang Y.
    Xiong W.
    Li Z.
    Guan H.
    Xie G.
    BenchCouncil Transactions on Benchmarks, Standards and Evaluations, 2023, 3 (02):
  • [26] Improving Predictability of User-Affecting Metrics to Support Anomaly Detection in Cloud Services
    Rufino, Vilc Queupe
    Nogueira, Mateus Schulz
    Avritzer, Alberto
    Menasche, Daniel Sadoc
    Russo, Barbara
    Janes, Andrea
    Ferme, Vincenzo
    Van Hoorn, Andre
    Schulz, Henning
    Lima, Cabral
    IEEE ACCESS, 2020, 8 : 198152 - 198167
  • [27] Unsupervised Anomaly Event Detection for Cloud Monitoring using Online Arima
    Schmidt, Florian
    Suri-Payer, Florian
    Gulenko, Anton
    Wallschlager, Marcel
    Acker, Alexander
    Kao, Odej
    2018 IEEE/ACM INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING COMPANION (UCC COMPANION), 2018, : 71 - 76
  • [28] Hierarchical PCA-Based Multivariate Statistical Network Monitoring for Anomaly Detection
    Macia-Fernandez, Gabriel
    Camacho, Jose
    Garcia-Teodoro, Pedro
    Rodriguez-Gomez, Rafael A.
    2016 8TH IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS 2016), 2016,
  • [29] GRAPH4: A Security Monitoring Architecture Based on Data Plane Anomaly Detection Metrics Calculated over Attack Graphs
    Gori, Giacomo
    Rinieri, Lorenzo
    Al Sadi, Amir
    Melis, Andrea
    Callegati, Franco
    Prandini, Marco
    FUTURE INTERNET, 2023, 15 (11)
  • [30] Concept Drift Adaption for Online Anomaly Detection in Structural Health Monitoring
    Tian, Hongda
    Nguyen Lu Dang Khoa
    Anaissi, Ali
    Wang, Yang
    Chen, Fang
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2813 - 2821