A Parallel Clustering Algorithm for Power Big Data Analysis

被引:0
|
作者
Meng, Xiangjun [1 ]
Chen, Liang [2 ]
Li, Yidong [3 ]
机构
[1] State Grid Shandong Power Co, Jinan, Shandong, Peoples R China
[2] Shandong Luneng Software Technol, Jinan, Shandong, Peoples R China
[3] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
Parallel algorithm; K-means clustering; Power data;
D O I
10.1007/978-981-10-6442-5_51
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the fast development of information technology, the power data is growing at an exponentially speed. In the face of multi-dimensional and complicated power network data, the performance of the traditional clustering algorithms are not satisfied. How to effectively cope with the power network data is becoming a hot topic. This paper proposes a parallel implement of K-means clustering algorithm based on Hadoop distributed file system and Mapreduce distributed computing framework to deal this problem. The experimental results show that the performance of our proposed algorithm significantly outperforms the traditional clustering algorithm and the parallel clustering algorithm can significantly reduce the time complexity and can be applied in analyzing and mining of the power network data.
引用
收藏
页码:533 / 540
页数:8
相关论文
共 50 条
  • [1] Parallel Clustering Optimization Algorithm Based on MapReduce in Big Data Mining
    Zhang, Huajie
    Song, Lei
    Zhang, Sen
    [J]. IAENG International Journal of Applied Mathematics, 2023, 53 (01):
  • [2] An Efficient Parallel Algorithm for Clustering Big Data based on the Spark Framework
    Faculty of Science of Rabat, Mohammed V University, Rabat, Morocco
    [J]. Intl. J. Adv. Comput. Sci. Appl., 7 (890-896):
  • [3] An Efficient Parallel Algorithm for Clustering Big Data based on the Spark Framework
    Dafir, Zineb
    Slaoui, Said
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (07) : 890 - 896
  • [4] p-PIC: Parallel power iteration clustering for big data
    Yan, Weizhong
    Brahmakshatriya, Umang
    Xue, Ya
    Gilder, Mark
    Wise, Bowden
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2013, 73 (03) : 352 - 359
  • [5] A GPU Based Parallel Clustering Method for Electric Power Big Data
    Ji, Cong
    Xiong, Zheng
    Fang, Chao
    Lv, Hui
    Zhang, Kaizhen
    [J]. 2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2017, : 29 - 33
  • [6] Optimization of Big Data Parallel Scheduling Based on Dynamic Clustering Scheduling Algorithm
    Liu, Fang
    He, Yanxiang
    He, Jing
    Gao, Xing
    Huang, Feihu
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2022, 94 (11): : 1243 - 1251
  • [7] Optimization of Big Data Parallel Scheduling Based on Dynamic Clustering Scheduling Algorithm
    Liu, Fang
    He, Yanxiang
    He, Jing
    Gao, Xing
    Huang, Feihu
    [J]. Journal of Signal Processing Systems, 2022, 94 (11) : 1243 - 1251
  • [8] Optimization of Big Data Parallel Scheduling Based on Dynamic Clustering Scheduling Algorithm
    Fang Liu
    Yanxiang He
    Jing He
    Xing Gao
    Feihu Huang
    [J]. Journal of Signal Processing Systems, 2022, 94 : 1243 - 1251
  • [9] Parallel Processing of Big Data using Power Iteration Clustering over MapReduce
    Jayalatchumy, D.
    Thambidurai, P.
    Alamelu, A. Vasumathi
    [J]. 2014 WORLD CONGRESS ON COMPUTING AND COMMUNICATION TECHNOLOGIES (WCCCT 2014), 2014, : 176 - 178
  • [10] A survey on parallel clustering algorithms for Big Data
    Dafir, Zineb
    Lamari, Yasmine
    Slaoui, Said Chah
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (04) : 2411 - 2443