KMDT: A Hybrid Cluster Approach for Anomaly Detection Using Big Data

被引:2
|
作者
Thakur, Santosh [1 ]
Dharavath, Ramesh [1 ]
机构
[1] Indian Inst Technol ISM, Dept Comp Sci & Engn, Dhanbad 826004, Bihar, India
来源
关键词
Hadoop; Spark; K-means; Decision tree; Big Data; ALGORITHMS;
D O I
10.1007/978-981-10-7563-6_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the current digital era, huge data are being generated in a voluminous state from different sources. This lead towards a processing repository called Big Data. Managing and processing such data in parallel clusters is a big challenge. To capture this problem, in this paper, we propose a hybrid algorithm for cluster analysis using the Spark framework for analyzing the Big Data instances. The proposed algorithm is the combination of two machine learning techniques namely, K-Means (KM) and C5.0 Decision Tree (DT). As per the factor of cluster, euclidean distance is used to find the nearest cluster and the related DT is built for each cluster using C5.0 DT algorithm. The inferences of the DT are used to classify each anomaly and the normal instances of the large datasets. Experimental results show that the proposed hybrid algorithm outperforms with other existing algorithms and produces better classification accuracy for anomaly detection.
引用
收藏
页码:169 / 176
页数:8
相关论文
共 50 条
  • [1] A Big Data Analytics Based Approach to Anomaly Detection
    Razaq, Abdul
    Tianfield, Huaglory
    Barrie, Peter
    [J]. 2016 3RD IEEE/ACM INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES (BDCAT), 2016, : 187 - 193
  • [2] Anomaly Detection Approach using Hybrid Algorithm of Data Mining Technique
    Gadal, Saad Mohamed Ali Mohamed
    Mokhtar, Rania A.
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMMUNICATION, CONTROL, COMPUTING AND ELECTRONICS ENGINEERING (ICCCCEE), 2017,
  • [3] An unsupervised anomaly detection approach based on industrial big data
    Zhang, Cong
    Zhu, Yongsheng
    Ren, Zhijun
    Chen, Kaida
    [J]. 2019 2ND WORLD CONFERENCE ON MECHANICAL ENGINEERING AND INTELLIGENT MANUFACTURING (WCMEIM 2019), 2019, : 703 - 709
  • [4] Hybrid Approach for Detection of Anomaly Network Traffic using Data Mining Techniques
    Agarwal, Basant
    Mittal, Namita
    [J]. 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, COMPUTING & SECURITY [ICCCS-2012], 2012, 1 : 996 - 1003
  • [5] Hybrid approach for Anomaly Detection in Time Series Data
    Ghrib, Zeineb
    Jaziri, Rakia
    Romdhane, Rim
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [6] A Scalable Genetic Algorithm for Discovering Comprehensible Anomaly Detection Rules Using Big Data in Computer Cluster
    Zhao, Lei
    Ren, Yuan
    [J]. 2016 3RD INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2016, : 727 - 731
  • [7] Anomaly Detection for Big Log Data Using a Hadoop Ecosystem
    Son, Siwoon
    Gil, Myeong-Seon
    Moon, Yang-Sae
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2017, : 377 - 380
  • [8] Anomaly detection for cellular networks using big data analytics
    Li, Bing
    Zhao, Shengjie
    Zhang, Rongqing
    Shi, Qingjiang
    Yang, Kai
    [J]. IET COMMUNICATIONS, 2019, 13 (20) : 3351 - 3359
  • [9] Online Anomaly Detection in Big Data
    Balasingam, B.
    Sankavaram, M. S.
    Choi, K.
    Ayala, D. F. M.
    Sidoti, D.
    Pattipati, K.
    Willett, P.
    Lintz, C.
    Commeau, G.
    Dorigo, F.
    Fahrny, J.
    [J]. 2014 17TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2014,
  • [10] An Anomaly Detection Approach Based on Integrated LSTM for IoT Big Data
    Li, Chao
    Fu, Yuhan
    Zhang, Rui
    Liang, Hai
    Wang, Chonghua
    Li, Junjian
    [J]. Security and Communication Networks, 2023, 2023