Anomaly Detection by Using Streaming K-Means and Batch K-Means

被引:0
|
作者
Wang, Zhuo [1 ]
Zhou, Yanghui [2 ]
Li, Gangmin [2 ]
机构
[1] Univ Liverpool, Dept Comp Sci, Liverpool, Merseyside, England
[2] XiAn JiaoTong Liverpool Univ, Dept Comp Sci & Software Engn, Suzhou, Peoples R China
关键词
big data; k-means clustering; streaming k-means clustering; cluster distribution; optimized K-value;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces K-Means algorithm as new technique for detecting anomaly. Data analysis has been applied to industry field widely and plays important role in it. However, conventional data analysis method cannot process large-scale data in considerable time and waste lots of computing resources. Conversely, Batch processing and Stream processing are equipped with property of processing data in short time interval, especially stream processing, can process data in real-time. This paper also compares Batch K-Means processing with Streaming K-Means processing according to distance, cost value and cluster distribution factors. Moreover, this paper also discusses how to reach optimized K value of Batch K-means model and Streaming K-means model, analyzes attributes of Batch K-Means processing and Streaming K-Means processing and finds limitations of these two processing models. Finally, the paper proposes limitations of research experiment and future improvement of clustering technique.
引用
收藏
页码:11 / 17
页数:7
相关论文
共 50 条
  • [1] An Improved K-Means Using in Anomaly Detection
    Yin, Chunyong
    Zhang, Sun
    Wang, Jin
    Kim, Jeong-Uk
    [J]. 2015 FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE THEORY, SYSTEMS AND APPLICATIONS (CCITSA 2015), 2015, : 129 - 131
  • [2] A hybrid approach for anomaly detection using K-means and PSO
    Wang, Ke-Wei
    Qin, Su-Juan
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ELECTRONICS, NETWORK AND COMPUTER ENGINEERING (ICENCE 2016), 2016, 67 : 821 - 826
  • [3] Improving Clustering Method Performance Using K-Means, Mini Batch K-Means, BIRCH and Spectral
    Wahyuningrum, Tenia
    Khomsah, Siti
    Suyanto, Suyanto
    Meliana, Selly
    Yunanto, Prasti Eko
    Al Maki, Wikky F.
    [J]. 2021 4TH INTERNATIONAL SEMINAR ON RESEARCH OF INFORMATION TECHNOLOGY AND INTELLIGENT SYSTEMS (ISRITI 2021), 2020,
  • [4] Clustering of Image Data Using K-Means and Fuzzy K-Means
    Rahmani, Md. Khalid Imam
    Pal, Naina
    Arora, Kamiya
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (07) : 160 - 163
  • [6] Unsupervised anomaly detection using an evolutionary extension of k-means algorithm
    Lu, Wei
    Traoreá, Issa
    [J]. International Journal of Information and Computer Security, 2008, 2 (02) : 107 - 139
  • [7] Optimizing Fuzzy K-means for network anomaly detection using PSO
    Ensafi, Roya
    Dehghanzadeh, Soheila
    Akbarzadeh, Mohammad -R
    [J]. 2008 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1-3, 2008, : 686 - +
  • [8] Comparative Study of K-means and Mini Batch K-means Clustering Algorithms in Android Malware Detection Using Network Traffic Analysis
    Feizollah, Ali
    Anuar, Nor Badrul
    Salleh, Rosli
    Amalina, Fairuz
    [J]. 2014 INTERNATIONAL SYMPOSIUM ON BIOMETRICS AND SECURITY TECHNOLOGIES (ISBAST), 2014, : 193 - 197
  • [9] Soil data clustering by using K-means and fuzzy K-means algorithm
    Hot, Elma
    Popovic-Bugarin, Vesna
    [J]. 2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 890 - 893
  • [10] An Improvement for Human Intestinal Parasites Detection Methodology using k-Means and Fast k-Means Clustering
    Khairudin, N. A. A.
    Nasir, A. S. A.
    Chin, L. C.
    Mohamed, Z.
    Fook, C. Y.
    [J]. 2020 IEEE-EMBS CONFERENCE ON BIOMEDICAL ENGINEERING AND SCIENCES (IECBES 2020): LEADING MODERN HEALTHCARE TECHNOLOGY ENHANCING WELLNESS, 2021, : 378 - 383