Batch and online anomaly detection for scientific applications in a Kubernetes environment

被引:5
|
作者
Hariri, Sahand [1 ]
Kind, Matias Carrasco [2 ]
机构
[1] Univ Illinois, Champaign, IL 61820 USA
[2] Natl Ctr Supercomp Applicat, Urbana, IL 61801 USA
关键词
Anomaly Detection; Isolation Forest; Cloud Computing; Kubernetes; Apache Spark;
D O I
10.1145/3217880.3217883
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a cloud based anomaly detection service framework that uses a containerized Spark cluster and ancillary user interfaces all managed by Kubernetes. The stack of technology put together allows for fast, reliable, resilient and easily scalable service for either batch or streaming data. At the heart of the service, we utilize an improved version of the algorithm Isolation Forest called Extended Isolation Forest for robust and efficient anomaly detection. We showcase the design and a normal workflow of our infrastructure which is ready to deploy on any Kubernetes cluster without extra technical knowledge. With exposed APIs and simple graphical interfaces, users can load any data and detect anomalies on the loaded set or on newly presented data points using a batch or a streaming mode. With the latter, users can subscribe and get notifications on the desired output. Our aim is to develop and apply these techniques to use with scientific data. In particular we are interested in finding anomalous objects within the overwhelming set of images and catalogs produced by current and future astronomical surveys, but that can be easily adopted to other fields.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Isolation Mondrian Forest for Batch and Online Anomaly Detection
    Ma, Haoran
    Ghojogh, Benyamin
    Samad, Maria N.
    Zheng, Dongyu
    Crowley, Mark
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 3051 - 3058
  • [2] A scalable architecture for online anomaly detection of WLCG batch jobs
    Kuehn, E.
    Fischer, M.
    Giffels, M.
    Jung, C.
    Petzold, A.
    [J]. 17TH INTERNATIONAL WORKSHOP ON ADVANCED COMPUTING AND ANALYSIS TECHNIQUES IN PHYSICS RESEARCH (ACAT2016), 2016, 762
  • [3] Clustering Evolving Batch System Jobs for Online Anomaly Detection
    Kuehn, Eileen
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 1534 - 1535
  • [4] Anomaly Detection for Scientific Workflow Applications on Networked Clouds
    Gaikwad, Prathamesh
    Mandal, Anirban
    Ruth, Paul
    Juve, Gideon
    Krol, Dariusz
    Deelman, Ewa
    [J]. 2016 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS 2016), 2016, : 645 - 652
  • [5] Online Monitoring Automation Using Anomaly Detection in IoT/IT Environment
    Kim, Chul
    Joe, Inwhee
    Jang, Deokwon
    Kim, Eunji
    Nam, Sanghun
    [J]. ARTIFICIAL INTELLIGENCE METHODS IN INTELLIGENT ALGORITHMS, 2019, 985 : 96 - 106
  • [6] Distributed Online Anomaly Detection for Virtualized Network Slicing Environment
    Wang, Weili
    Liang, Chengchao
    Chen, Qianbin
    Tang, Lun
    Yanikomeroglu, Halim
    Liu, Tong
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (11) : 12235 - 12249
  • [7] Differentially Private Online Active Learning with Applications to Anomaly Detection
    Ghassemi, Mohsen
    Sarwate, Anand D.
    Wright, Rebecca N.
    [J]. AISEC'16: PROCEEDINGS OF THE 2016 ACM WORKSHOP ON ARTIFICIAL INTELLIGENCE AND SECURITY, 2016, : 117 - 128
  • [8] An Anomaly-based Detection System for Monitoring Kubernetes Infrastructures
    Almaraz-Rivera, Josue Genaro
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2023, 21 (03) : 457 - 465
  • [9] Data Stream Clustering for Online Anomaly Detection in Cloud Applications
    Sauvanaud, Carla
    Silvestre, Guthemberg
    Kaaniche, Mohamed
    Kanoun, Karama
    [J]. 2015 ELEVENTH EUROPEAN DEPENDABLE COMPUTING CONFERENCE (EDCC), 2015, : 120 - 131
  • [10] Anomaly detection in batch chemical processes
    Monroy, Isaac
    Escudero, Gerard
    Graells, Moises
    [J]. 19TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING, 2009, 26 : 255 - 260