A Parallel Clustering Algorithm for Power Big Data Analysis

被引:0
|
作者
Meng, Xiangjun [1 ]
Chen, Liang [2 ]
Li, Yidong [3 ]
机构
[1] State Grid Shandong Power Co, Jinan, Shandong, Peoples R China
[2] Shandong Luneng Software Technol, Jinan, Shandong, Peoples R China
[3] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
Parallel algorithm; K-means clustering; Power data;
D O I
10.1007/978-981-10-6442-5_51
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the fast development of information technology, the power data is growing at an exponentially speed. In the face of multi-dimensional and complicated power network data, the performance of the traditional clustering algorithms are not satisfied. How to effectively cope with the power network data is becoming a hot topic. This paper proposes a parallel implement of K-means clustering algorithm based on Hadoop distributed file system and Mapreduce distributed computing framework to deal this problem. The experimental results show that the performance of our proposed algorithm significantly outperforms the traditional clustering algorithm and the parallel clustering algorithm can significantly reduce the time complexity and can be applied in analyzing and mining of the power network data.
引用
下载
收藏
页码:533 / 540
页数:8
相关论文
共 50 条
  • [21] Batch Clustering Algorithm for Big Data Sets
    Alguliyev, Rasim
    Aliguliyev, Ramiz
    Bagirov, Adil
    Karimov, Rafael
    2016 IEEE 10TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2016, : 79 - 82
  • [22] Parallel K-prototypes for Clustering Big Data
    Ben HajKacem, Mohamed Aymen
    Ben N'cir, Chiheb-Eddine
    Essoussi, Nadia
    COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2015), PT II, 2015, 9330 : 628 - 637
  • [23] K-Means Parallel Algorithm of Big Data Clustering Based on Mapreduce PCAM Method
    Li, Yongyi
    Yang, Zhongqiang
    Han, Kaixu
    Engineering Intelligent Systems, 2021, 29 (06): : 411 - 418
  • [24] A Novel Method of Data Correlation Analysis of the Big Data Based on Network Clustering Algorithm
    Yang, Yue
    Wang, Chunting
    PROCEEDINGS OF 2015 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2015, : 360 - 366
  • [25] PSCAN: A Parallel Structural Clustering Algorithm for Big Networks in MapReduce
    Zhao, Weizhong
    Martha, VenkataSwamy
    Xu, Xiaowei
    2013 IEEE 27TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2013, : 862 - 869
  • [26] Optimization of parallel SVM algorithm for big data
    Xue, Rui
    Cai, Yan
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2024, 24 (02) : 1253 - 1266
  • [27] A Parallel data preprocessing algorithm for hierarchical clustering
    Li Zhao-Peng
    Li Zhao-jian
    2013 FIFTH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2013), 2013, : 70 - 73
  • [28] PGAC: A parallel genetic algorithm for data clustering
    Lo Bosco, G
    CAMP 2005: Seventh International Workshop on Computer Architecture for Machine Perception , Proceedings, 2005, : 283 - 287
  • [29] A parallel clustering algorithm for categorical data set
    Wang, YX
    Wang, ZH
    Li, XM
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING - ICAISC 2004, 2004, 3070 : 928 - 933
  • [30] Customer Segmentation Marketing Strategy Based on Big Data Analysis and Clustering Algorithm
    Li, Xiaotong
    Lee, Young Sook
    JOURNAL OF CASES ON INFORMATION TECHNOLOGY, 2024, 26 (01)