INCORPORATING STABILITY AND ERROR-BASED CONSTRAINTS FOR A NOVEL PARTITIONAL CLUSTERING ALGORITHM

被引:4
|
作者
Aparna, K. [1 ]
Nair, Mydhili K. [2 ]
机构
[1] BMS Inst Technol & Management, Dept Comp Applicat, Bengaluru 560064, Karnataka, India
[2] MS Ramaiah Inst Technol, Dept Informat Sci & Engn, Bengaluru 560054, Karnataka, India
关键词
Bisecting K-Means; Constraints; High dimensionality; Mean Square Error (MSE); Partitional clustering; Stability;
D O I
10.14716/ijtech.v7i4.1579
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Data clustering is one of the major areas in data mining. The bisecting clustering algorithm is one of the most widely used for high dimensional dataset. But its performance degrades as the dimensionality increases. Also, the task of selection of a cluster for further bisection is a challenging one. To overcome these drawbacks, we developed a novel partitional clustering algorithm called a HB-K-Means algorithm (High dimensional Bisecting K-Means). In order to improve the performance of this algorithm, we incorporate two constraints, such as a stability-based measure and a Mean Square Error (MSE) resulting in CHB-K-Means (Constraint-based High dimensional Bisecting K-Means) algorithm. The CHB-K-Means algorithm generates two initial partitions. Subsequently, it calculates the stability and MSE for each partition generated. Inference techniques are applied on the stability and MSE values of the two partitions to select the next partition for the re-clustering process. This process is repeated until K number of clusters is obtained. From the experimental analysis, we infer that an average clustering accuracy of 75% has been achieved. The comparative analysis of the proposed approach with the other traditional algorithms shows an achievement of a higher clustering accuracy rate and an increase in computation time.
引用
收藏
页码:691 / 700
页数:10
相关论文
共 50 条
  • [1] REDPC: A residual error-based density peak clustering algorithm
    Parmar, Milan
    Wang, Di
    Zhang, Xiaofeng
    Tan, Ah-Hwee
    Miao, Chunyan
    Jiang, Jianhua
    Zhou, You
    [J]. NEUROCOMPUTING, 2019, 348 : 82 - 96
  • [2] Novel partitional clustering algorithm for large data processing
    Lu, Zhi-Mao
    Feng, Jin-Mei
    Fan, Dong-Mei
    Yang, Peng
    Tian, Ye
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2014, 36 (05): : 1010 - 1015
  • [3] EFFICIENT DENSITY-BASED PARTITIONAL CLUSTERING ALGORITHM
    Alamgir, Zareen
    Naveed, Hina
    [J]. COMPUTING AND INFORMATICS, 2021, 40 (06) : 1322 - 1344
  • [4] An effective partitional clustering algorithm based on new clustering validity index
    Zhu, Erzhou
    Ma, Ruhui
    [J]. APPLIED SOFT COMPUTING, 2018, 71 : 608 - 621
  • [5] A control error-based fractal encoding algorithm
    Ping, F
    [J]. PREVIOUS EXPERIENCE AND CURRENT INNOVATIONS IN NON-DESTRUCTIVE TESTING, 2001, : 424 - 424
  • [6] Try and error-based scheduling algorithm for cluster tools of wafer fabrications with residency time constraints
    Bing-hai Zhou
    Xin Li
    [J]. Journal of Central South University, 2012, 19 : 187 - 192
  • [7] Research of Case Retrieval Strategy Based on Partitional Clustering Algorithm
    Ma, Shi-xia
    Liu, Jian-hua
    Liu, Dan
    [J]. 2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 2, 2010, : 307 - 310
  • [8] A Fast Partitional Clustering Algorithm based on Nearest Neighbours Heuristics
    Ganguly, Debasis
    [J]. PATTERN RECOGNITION LETTERS, 2018, 112 : 198 - 204
  • [9] Try and error-based scheduling algorithm for cluster tools of wafer fabrications with residency time constraints
    Zhou Bing-hai
    Li Xin
    [J]. JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2012, 19 (01) : 187 - 192
  • [10] Try and error-based scheduling algorithm for cluster tools of wafer fabrications with residency time constraints
    周炳海
    李鑫
    [J]. Journal of Central South University, 2012, 19 (01) : 187 - 192