KMDT: A Hybrid Cluster Approach for Anomaly Detection Using Big Data

被引:2
|
作者
Thakur, Santosh [1 ]
Dharavath, Ramesh [1 ]
机构
[1] Indian Inst Technol ISM, Dept Comp Sci & Engn, Dhanbad 826004, Bihar, India
来源
关键词
Hadoop; Spark; K-means; Decision tree; Big Data; ALGORITHMS;
D O I
10.1007/978-981-10-7563-6_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the current digital era, huge data are being generated in a voluminous state from different sources. This lead towards a processing repository called Big Data. Managing and processing such data in parallel clusters is a big challenge. To capture this problem, in this paper, we propose a hybrid algorithm for cluster analysis using the Spark framework for analyzing the Big Data instances. The proposed algorithm is the combination of two machine learning techniques namely, K-Means (KM) and C5.0 Decision Tree (DT). As per the factor of cluster, euclidean distance is used to find the nearest cluster and the related DT is built for each cluster using C5.0 DT algorithm. The inferences of the DT are used to classify each anomaly and the normal instances of the large datasets. Experimental results show that the proposed hybrid algorithm outperforms with other existing algorithms and produces better classification accuracy for anomaly detection.
引用
收藏
页码:169 / 176
页数:8
相关论文
共 50 条
  • [31] Online Anomaly Detection over Big Data Streams
    Rettig, Laura
    Khayati, Mourad
    Cudre-Mauroux, Philippe
    Piorkowski, Michal
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1113 - 1122
  • [32] Anomaly detection in big data from UWB radars
    Wang, Wei
    Zhou, Xin
    Zhang, Baoju
    Mu, Jiasong
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2015, 8 (14) : 2469 - 2475
  • [33] Intelligent Big Data Summarization for Rare Anomaly Detection
    Ahmed, Mohiuddin
    [J]. IEEE ACCESS, 2019, 7 : 68669 - 68677
  • [34] Contextual anomaly detection framework for big sensor data
    Hayes M.A.
    Capretz M.A.
    [J]. Journal of Big Data, 2 (1)
  • [35] Robust archetypoids for anomaly detection in big functional data
    Vinue, Guillermo
    Epifanio, Irene
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2021, 15 (02) : 437 - 462
  • [36] Anomaly Detection in Big Data with Separable Compressive Sensing
    Wang, Wei
    Wang, Dan
    Jiang, Shu
    Qin, Shan
    Xue, Lei
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2016, 386 : 589 - 594
  • [37] Big Data Driven Anomaly Detection for Cellular Networks
    Zhu, Qiqi
    Sun, Li
    [J]. IEEE ACCESS, 2020, 8 : 31398 - 31408
  • [38] A Rapid Anomaly Detection Technique for Big Data Curation
    Poonsirivong, Korn
    Jittawiriyanukoon, Chanintorn
    [J]. PROCEEDINGS OF 2017 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2017,
  • [39] Robust archetypoids for anomaly detection in big functional data
    Guillermo Vinue
    Irene Epifanio
    [J]. Advances in Data Analysis and Classification, 2021, 15 : 437 - 462
  • [40] Anomaly Detection In Onboard-Recorded Flight Data Using Cluster Analysis
    Li, Lishuai
    Gariel, Maxime
    Hansman, R. John
    Palacios, Rafael
    [J]. 2011 IEEE/AIAA 30TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2011,