An Improved Initialization Method for Clustering High-Dimensional Data

被引：0

作者：

Zhang, Yanping ^{[1
]}

Jiang, Qingshan ^{[1
]}

机构：

[1] Xiamen Univ, Software Sch, Xiamen 361005, Fujian, Peoples R China

来源：

2010 2ND INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS PROCEEDINGS (DBTA) | 2010年

关键词：

K-Means type clustering; initialization method; distance weight coefficient; neighborhood density;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Searching initial centers in high dimensional space is an interesting and important problem which is relevant for the wide various types of K-Means algorithm. However, this is a very difficult problem, due to the "curse of dimensionality" and the inherently sparse data. Algorithm IMSND is one of the latest initialization methods that are based on the idea of sharing neighborhood density. Concerning the accuracy and the input parameters of IMSND, an optimized algorithm is presented, which employs a new density measure with distance weight coefficient to improve the search accuracy. Experimental results on real world datasets show that our algorithm outperforms other algorithms, including IMSND.

引用

页数：4

共 50 条

[1] An Initialization Method for Clustering High-Dimensional Data
Chen, Luying
Chen, Lifei
Jiang, Qingshan
Wang, Beizhan
Shi, Liang
[J]. FIRST INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS, PROCEEDINGS, 2009, : 444 - +
[2] An efficient clustering method of data mining for high-dimensional data
Chang, JW
Kang, HM
[J]. 8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL II, PROCEEDINGS: COMPUTING TECHNIQUES, 2004, : 273 - 278
[3] High-dimensional clustering method for high performance data mining
Chang, Jae-Woo
Lee, Hyun-Jo
[J]. COMPUTATIONAL SCIENCE - ICCS 2007, PT 3, PROCEEDINGS, 2007, 4489 : 621 - +
[4] An efficient clustering method for high-dimensional data mining
Chang, JW
Kim, YK
[J]. ADVANCES IN ARTIFICIAL INTELLIGENCE - SBIA 2004, 2004, 3171 : 276 - 285
[5] High-dimensional data clustering
Bouveyron, C.
Girard, S.
Schmid, C.
[J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) : 502 - 519
[6] Clustering High-Dimensional Data
Masulli, Francesco
Rovetta, Stefano
[J]. CLUSTERING HIGH-DIMENSIONAL DATA, CHDD 2012, 2015, 7627 : 1 - 13
[7] Clustering of High-Dimensional and Correlated Data
McLachlan, Geoffrey J.
Ng, Shu-Kay
Wang, K.
[J]. DATA ANALYSIS AND CLASSIFICATION, 2010, : 3 - 11
[8] Clustering in high-dimensional data spaces
Murtagh, FD
[J]. STATISTICAL CHALLENGES IN ASTRONOMY, 2003, : 279 - 292
[9] Compressive Clustering of High-dimensional Data
Ruta, Andrzej
Porikli, Fatih
[J]. 2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 380 - 385
[10] A feature group weighting method for subspace clustering of high-dimensional data
Chen, Xiaojun
Ye, Yunming
Xu, Xiaofei
Huang, Joshua Zhexue
[J]. PATTERN RECOGNITION, 2012, 45 (01) : 434 - 446

← 1 2 3 4 5 →