A novel clustering algorithm based on data transformation approaches

被引:21
|
作者
Azimi, Rasool [1 ]
Ghayekhloo, Mohadeseh [1 ]
Ghofrani, Mahmoud [2 ]
Sajedi, Hedieh [3 ]
机构
[1] Islamic Azad Univ, Qazvin Branch, Young Researchers & Elite Club, Qazvin, Iran
[2] Univ Washington, Sch Sci Technol Engn & Math STEM, Bothell, WA USA
[3] Univ Tehran, Dept Comp Sci, Coll Sci, Tehran, Iran
关键词
Data mining; Clustering; K-means; Data transformation; Silhouette; Transformed K-means;
D O I
10.1016/j.eswa.2017.01.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering provides a knowledge acquisition method for intelligent systems. This paper proposes a novel data-clustering algorithm, by combining a new initialization technique, K-means algorithm and a new gradual data transformation approach to provide more accurate clustering results than the K-means algorithm and its variants by increasing the clusters' coherence. The proposed data transformation approach solves the problem of generating empty clusters, which frequently occurs for other clustering algorithms. An efficient method based on the principal component transformation and a modified silhouette algorithm is also proposed in this paper to determine the number of clusters. Several different data sets are used to evaluate the efficacy of the proposed method to deal with the empty cluster generation problem and its accuracy and computational performance in comparison with other K-means based initialization techniques and clustering methods. The developed estimation method for determining the number of clusters is also evaluated and compared with other estimation algorithms. Significances of the proposed method include addressing the limitations of the K-means based clustering and improving the accuracy of clustering as an important method in the field of data mining and expert systems. Application of the proposed method for the knowledge acquisition in time series data such as wind, solar, electric load and stock market provides a pre-processing tool to select the most appropriate data to feed in neural networks or other estimators in use for forecasting such time series. In addition, utilization of the knowledge discovered by the proposed K-means clustering to develop rule based expert systems is one of the main impacts of the proposed method. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:59 / 70
页数:12
相关论文
共 50 条
  • [31] Novel clustering algorithm based on central symmetry
    Lin, JY
    Peng, H
    Xie, JM
    Zheng, QL
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1329 - 1334
  • [32] A NOVEL GRID-BASED CLUSTERING ALGORITHM
    Starczewski, Artur
    Scherer, Magdalena M.
    Ksiazek, Wojciech
    Debski, Maciej
    Wang, Lipo
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2021, 11 (04) : 319 - 330
  • [33] Novel Fuzzy Clustering Algorithm Based on Fireflies
    Dan, Li
    Ke, Luo
    Zhen, Sun
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED SCIENCE AND ENGINEERING INNOVATION, 2015, 12 : 112 - 116
  • [34] A NOVEL CLUSTERING ALGORITHM BASED ON P SYSTEMS
    Jiang, Yang
    Peng, Hong
    Huang, Xiaoli
    Zhang, Jiarong
    Shi, Peng
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2014, 10 (02): : 753 - 765
  • [35] A Novel Clustering Algorithm based on Searched Experiences
    Tsai, Chun-Wei
    Ding, Yong-Chun
    Chiang, Ming-Chao
    Yang, Chu-Sing
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 804 - 808
  • [36] A Novel Clustering Algorithm Based on DPC and PSO
    Cai, Jianghui
    Wei, Huiling
    Yang, Haifeng
    Zhao, Xujun
    IEEE ACCESS, 2020, 8 : 88200 - 88214
  • [37] A Novel Clustering Algorithm Based on Neighborhood Expansion
    Yuan, Rui
    Hu, Xiaobing
    2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 1, 2014, : 340 - 343
  • [38] A novel clustering algorithm based on graph matching
    Lin, Guoyuan
    Bie, Yuyu
    Wang, Guohui
    Lei, Min
    Journal of Software, 2013, 8 (04) : 1035 - 1041
  • [39] A Novel Clustering Algorithm Based on Mobility for VANET
    Ge, Xianlei
    Gao, Qiang
    Quan, Xunzhong
    2018 IEEE 18TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT), 2018, : 473 - 477
  • [40] A novel clustering algorithm based on quantum games
    Li, Qiang
    He, Yan
    Jiang, Jing-ping
    JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2009, 42 (44)