Clustering-based real-time anomaly detection-A breakthrough in big data technologies

被引:46
|
作者
Habeeb, Riyaz Ahamed Ariyaluran [1 ]
Nasaruddin, Fariza [1 ]
Gani, Abdullah [6 ]
Amanullah, Mohamed Ahzam [3 ]
Hashem, Ibrahim Abaker Targio [2 ]
Ahmed, Ejaz [4 ]
Imran, Muhammad [5 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Informat Syst, Kuala Lumpur 50603, Malaysia
[2] Taylors Univ, Sch Comp & Informat Technol, Subang Jaya, Malaysia
[3] Telekom Res & Dev Sdn Bhd, Res & Innovat Dev, Cyberjaya, Malaysia
[4] Univ Malaya, Ctr Mobile Cloud Comp Res C4MCCR, Kuala Lumpur, Malaysia
[5] King Saud Univ, Coll Appl Comp Sci, Riyadh, Saudi Arabia
[6] Univ Malaya, Dept Comp Syst & Technol, Fac Comp Sci & Informat Technol, Kuala Lumpur, Malaysia
来源
TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES | 2022年 / 33卷 / 08期
关键词
DETECTION SYSTEM; FRAMEWORK; INTERNET; MACHINE;
D O I
10.1002/ett.3647
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Off late, the ever increasing usage of a connected Internet-of-Things devices has consequently augmented the volume of real-time network data with high velocity. At the same time, threats on networks become inevitable; hence, identifying anomalies in real time network data has become crucial. To date, most of the existing anomaly detection approaches focus mainly on machine learning techniques for batch processing. Meanwhile, detection approaches which focus on the real-time analytics somehow deficient in its detection accuracy while consuming higher memory and longer execution time. As such, this paper proposes a novel framework which focuses on real-time anomaly detection based on big data technologies. In addition, this paper has also developed streaming sliding window local outlier factor coreset clustering algorithms (SSWLOFCC), which was then implemented into the framework. The proposed framework that comprises BroIDS, Flume, Kafka, Spark streaming, SparkMLlib, Matplot and HBase was evaluated to substantiate its efficacy, particularly in terms of accuracy, memory consumption, and execution time. The evaluation is done by performing critical comparative analysis using existing approaches, such as K-means, hierarchical density-based spatial clustering of applications with noise (HDBSCAN), isolation forest, spectral clustering and agglomerative clustering. Moreover, Adjusted Rand Index and memory profiler package were used for the evaluation of the proposed framework against the existing approaches. The outcome of the evaluation has substantially proven the efficacy of the proposed framework with a much higher accuracy rate of 96.51% when compared to other algorithms. Besides, the proposed framework also outperformed the existing algorithms in terms of lesser memory consumption and execution time. Ultimately the proposed solution enable analysts to precisely track and detect anomalies in real time.
引用
收藏
页数:27
相关论文
共 50 条
  • [41] A Fast Clustering-based Recommender System for Big Data
    Hong-Quan Do
    Nguyen, T. H-An
    Quoc-Anh Nguyen
    Trung-Hieu Nguyen
    Viet-Vu Vu
    Cuong Le
    2022 24TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): ARITIFLCIAL INTELLIGENCE TECHNOLOGIES TOWARD CYBERSECURITY, 2022, : 353 - +
  • [42] Real-time Bayesian anomaly detection in streaming environmental data
    Hill, David J.
    Minsker, Barbara S.
    Amir, Eyal
    WATER RESOURCES RESEARCH, 2009, 45
  • [43] Real-time Detection for Anomaly Data in Microseismic Monitoring System
    Ji Chang-peng
    Liu Li-li
    PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND NATURAL COMPUTING, VOL II, 2009, : 307 - +
  • [44] Real-Time Anomaly Detection from Environmental Data Streams
    Trilles, Sergio
    Schade, Sven
    Belmonte, Oscar
    Huerta, Joaquin
    AGILE 2015: GEOGRAPHIC INFORMATION SCIENCE AS AN ENABLER OF SMARTER CITIES AND COMMUNITIES, 2015, : 125 - 144
  • [45] A Real-time Temperature Anomaly Detection Method for IoT Data
    Liu, Wei
    Jiang, Hongyi
    Che, Dandan
    Chen, Lifei
    Jiang, Qingshan
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY (IOTBDS), 2020, : 112 - 118
  • [46] Real-time Anomaly Detection and Classification in Streaming PMU Data
    Hannon, Christopher
    Deka, Deepjyoti
    Jin, Dong
    Vuffray, Marc
    Lokhov, Andrey Y.
    2021 IEEE MADRID POWERTECH, 2021,
  • [47] Framework for Real-Time Predictive Maintenance Supported by Big Data Technologies
    Teixeira, Marco
    Thierstein, Francisco
    Entringer, Pedro
    Sa, Hugo
    Leitao, Jose Demetrio
    Leal, Fatima
    GOOD PRACTICES AND NEW PERSPECTIVES IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1, WORLDCIST 2024, 2024, 985 : 13 - 22
  • [48] Trajectory Clustering-Based Anomaly Detection in Indoor Human Movement
    Lan, Doi Thi
    Yoon, Seokhoon
    SENSORS, 2023, 23 (06)
  • [49] A Clustering-Based Method to Anomaly Detection in Thermal Power Plants
    Drapal, Patricia
    Clemente, Jullya
    Reyes, Dailys Maite
    de Souza, Starch Melo
    Lins, Anthony
    Prudencio, Ricardo B. C.
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [50] Real-Time Maritime Traffic Anomaly Detection Based on Sensors and History Data Embedding
    Venskus, Julius
    Treigys, Povilas
    Bernataviciene, Jolita
    Tamulevicius, Gintautas
    Medvedev, Viktor
    SENSORS, 2019, 19 (17)