Clustering item data sets with association-taxonomy similarity

被引:1
|
作者
Yun, CH [1 ]
Chuang, KT [1 ]
Chen, MS [1 ]
机构
[1] Natl Taiwan Univ, Dept Elect Engn, Grad Inst Commun Engn, Taipei, Taiwan
关键词
D O I
10.1109/ICDM.2003.1251011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We explore in this paper the efficient clustering of item data. Different from those of the traditional data, the features of item data are known to be of high dimensionality and sparsity. In view of the features of item data, we devise in this paper a novel measurement, called the association-taxonomy similarity, and utilize this measurement to perform the clustering. With this association-taxonomy similarity measurement, we develop an efficient clustering algorithm, called algorithm AT (standing for Association-Taxonomy), for item data. Two validation indexes based on association and taxonomy properties are also devised to assess the quality of clustering for item data. As validated by the real dataset, it is shown by our experimental results that algorithm AT devised in this paper significantly outperforms the prior works in the clustering quality as measured by the validation indexes, indicating the usefulness of association-taxonomy similarity in item data clustering.
引用
收藏
页码:697 / 700
页数:4
相关论文
共 50 条
  • [41] Image-mapped data clustering: An efficient technique for clustering large data sets
    Al-Omari, Faruq
    Al-Fayoumi, Nabeel
    Al-Jarrah, Mohammad
    INTELLIGENT DATA ANALYSIS, 2008, 12 (06) : 573 - 586
  • [42] Generation of Frequent Item Sets in Multidimensional Data by Means of Templates for Mining Inter-Dimensional Association Rules
    Fisun, Mykola
    Kulakovska, Inessa
    Horban, Hlib
    2015 IEEE 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS: TECHNOLOGY AND APPLICATIONS (IDAACS), VOLS 1-2, 2015, : 368 - 375
  • [43] Asymmetric Item-Item Similarity Measure for Linked Open Data Enabled Collaborative Filtering
    Mao, Chengwang
    Xu, Zhuoming
    Wang, Xiuli
    2017 14TH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE (WISA 2017), 2017, : 228 - 233
  • [44] Bayesian nonparametric clustering for large data sets
    Daiane Aparecida Zuanetti
    Peter Müller
    Yitan Zhu
    Shengjie Yang
    Yuan Ji
    Statistics and Computing, 2019, 29 : 203 - 215
  • [45] Batch Clustering Algorithm for Big Data Sets
    Alguliyev, Rasim
    Aliguliyev, Ramiz
    Bagirov, Adil
    Karimov, Rafael
    2016 IEEE 10TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2016, : 79 - 82
  • [46] Clustering Algorithms for Large Temporal Data Sets
    Scepi, Germana
    DATA ANALYSIS AND CLASSIFICATION, 2010, : 369 - 377
  • [47] A New Clustering Algorithm On Nominal Data Sets
    Wang, Bin
    INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS (IMECS 2010), VOLS I-III, 2010, : 605 - 610
  • [48] Clustering Analysis for Large Scale Data Sets
    Singh, Sachin
    Mishra, Ashish
    2015 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION & AUTOMATION (ICCCA), 2015, : 1 - 4
  • [49] CLUSTERING OF LARGE DATA SETS - ZUPAN,J
    EVERITT, BS
    STATISTICIAN, 1983, 32 (03): : 355 - 355
  • [50] Bayesian nonparametric clustering for large data sets
    Zuanetti, Daiane Aparecida
    Mueller, Peter
    Zhu, Yitan
    Yang, Shengjie
    Ji, Yuan
    STATISTICS AND COMPUTING, 2019, 29 (02) : 203 - 215