PCFA: Mining of Projected Clusters in High Dimensional Data Using Modified FCM Algorithm

被引:0
|
作者
Murugappan, Ilango [1 ]
Vasudev, Mohan [2 ]
机构
[1] KLN Coll Engn, Dept Comp Applicat, Maduri, Tamil Nadu, India
[2] Thiagarajar Coll Engn, Dept Math, Madurai, Tamil Nadu, India
关键词
Clustering; FCM; Modified FCM; k-mean clustering; accuracy; memory usage; computation time;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data deals with the specific problem of partitioning a group of objects into a fixed number of subsets, so that the similarity of the objects in each subset is increased and the similarity across subsets is reduced. Several algorithms have been proposed in the literature for clustering, where k-means clustering and Fuzzy C-Means (FCM) clustering are the two popular algorithms for partitioning the numerical data into groups. But, due to the drawbacks of both categories of algorithms, recent researches ha. e paid more attention on modifying the clustering algorithms. In this paper, we have made an extensive analysis on modifying the FCM clustering algorithm to overcome the difficulties possessed by the k-means and FCM algorithms over high dimensional data. According to, we have proposed an algorithm, called Projected Clustering based on FCM Algorithm (PCFA). Here, we have utilized the standard FCM clustering algorithm for sub-clustering high dimensional data into reference centroids. The matrix containing the reference values is then fed as an input to the modified FCM algorithm. Finally, experimentation is carried out on the very large dimensional datasets obtained from the benchmarks data repositories and the performance of the PCFA algorithm is evaluated with the help of clustering accuracy, memory usage and the computation time. The evaluation results showed that, the PCFA algorithm shows approximately 20% improvement in the execution time and 50% improvement in memory usage over the PCKA algorithm.
引用
收藏
页码:168 / 177
页数:10
相关论文
共 50 条
  • [1] Mining Projected Clusters in High-Dimensional Spaces
    Bouguessa, Mohamed
    Wang, Shengrui
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (04) : 507 - 522
  • [2] An improved IWO-FCM Data Mining Algorithm
    Zhao Xiaoqiang
    Zhou Jinhu
    [J]. 26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 4997 - 5001
  • [3] Exploratory data mining lead by text mining using a novel high dimensional clustering algorithm
    Amarasiri, R
    Ceddia, J
    Alahakoon, D
    [J]. ICMLA 2005: FOURTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2005, : 267 - 272
  • [4] Progressive subspace skyline clusters mining on high dimensional data
    Hu, Rong
    Lu, Yansheng
    Zou, Lei
    Zhou, Chong
    [J]. EMERGING TECHNOLOGIES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2007, 4819 : 268 - +
  • [5] An Algorithm of Mining Exceptions in High Dimensional Data Cube
    Ding, Youwei
    Hu, Kongfa
    Cen, Ling
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, VOL I, 2009, : 458 - 461
  • [6] Finding generalized projected clusters in high dimensional spaces
    Aggarwal, CC
    Yu, PS
    [J]. SIGMOD RECORD, 2000, 29 (02) : 70 - 81
  • [7] FCM Classifier for High-dimensional Data
    Ichihashi, Hidetomo
    Honda, Katsuhiro
    Notsu, Akira
    Miyamoto, Eri
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 200 - 206
  • [8] Apriori Algorithm for Association Rule Mining in High Dimensional Data
    Harikumar, Sandhya
    Dilipkumar, Divya Usha
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON DATA SCIENCE & ENGINEERING (ICDSE), 2016, : 115 - 120
  • [9] A modified clustering algorithm for data mining
    Xu, ZJ
    Wang, LS
    Luo, JC
    Zhang, JQ
    [J]. IGARSS 2005: IEEE International Geoscience and Remote Sensing Symposium, Vols 1-8, Proceedings, 2005, : 741 - 744
  • [10] Integration of projected clusters and principal axis trees for high-dimensional data indexing and query
    Wang, B
    Gan, JQ
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 191 - 196