Clustering item data sets with association-taxonomy similarity

被引：1

作者：

Yun, CH ^{[1
]}

Chuang, KT ^{[1
]}

Chen, MS ^{[1
]}

机构：

[1] Natl Taiwan Univ, Dept Elect Engn, Grad Inst Commun Engn, Taipei, Taiwan

来源：

THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS | 2003年

关键词：

D O I：

10.1109/ICDM.2003.1251011

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We explore in this paper the efficient clustering of item data. Different from those of the traditional data, the features of item data are known to be of high dimensionality and sparsity. In view of the features of item data, we devise in this paper a novel measurement, called the association-taxonomy similarity, and utilize this measurement to perform the clustering. With this association-taxonomy similarity measurement, we develop an efficient clustering algorithm, called algorithm AT (standing for Association-Taxonomy), for item data. Two validation indexes based on association and taxonomy properties are also devised to assess the quality of clustering for item data. As validated by the real dataset, it is shown by our experimental results that algorithm AT devised in this paper significantly outperforms the prior works in the clustering quality as measured by the validation indexes, indicating the usefulness of association-taxonomy similarity in item data clustering.

引用

页码：697 / 700

页数：4

共 50 条

[41] Image-mapped data clustering: An efficient technique for clustering large data sets
Al-Omari, Faruq
Al-Fayoumi, Nabeel
Al-Jarrah, Mohammad
INTELLIGENT DATA ANALYSIS, 2008, 12 (06) : 573 - 586
[42] Generation of Frequent Item Sets in Multidimensional Data by Means of Templates for Mining Inter-Dimensional Association Rules
Fisun, Mykola
Kulakovska, Inessa
Horban, Hlib
2015 IEEE 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS: TECHNOLOGY AND APPLICATIONS (IDAACS), VOLS 1-2, 2015, : 368 - 375
[43] Asymmetric Item-Item Similarity Measure for Linked Open Data Enabled Collaborative Filtering
Mao, Chengwang
Xu, Zhuoming
Wang, Xiuli
2017 14TH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE (WISA 2017), 2017, : 228 - 233
[44] Bayesian nonparametric clustering for large data sets
Daiane Aparecida Zuanetti
Peter Müller
Yitan Zhu
Shengjie Yang
Yuan Ji
Statistics and Computing, 2019, 29 : 203 - 215
[45] Batch Clustering Algorithm for Big Data Sets
Alguliyev, Rasim
Aliguliyev, Ramiz
Bagirov, Adil
Karimov, Rafael
2016 IEEE 10TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2016, : 79 - 82
[46] Clustering Algorithms for Large Temporal Data Sets
Scepi, Germana
DATA ANALYSIS AND CLASSIFICATION, 2010, : 369 - 377
[47] A New Clustering Algorithm On Nominal Data Sets
Wang, Bin
INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS (IMECS 2010), VOLS I-III, 2010, : 605 - 610
[48] Clustering Analysis for Large Scale Data Sets
Singh, Sachin
Mishra, Ashish
2015 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION & AUTOMATION (ICCCA), 2015, : 1 - 4
[49] CLUSTERING OF LARGE DATA SETS - ZUPAN,J
EVERITT, BS
STATISTICIAN, 1983, 32 (03): : 355 - 355
[50] Bayesian nonparametric clustering for large data sets
Zuanetti, Daiane Aparecida
Mueller, Peter
Zhu, Yitan
Yang, Shengjie
Ji, Yuan
STATISTICS AND COMPUTING, 2019, 29 (02) : 203 - 215

← 1 2 3 4 5 →