Clustering Algorithms for Chains

被引:0
|
作者
Ukkonen, Antti [1 ]
机构
[1] Yahoo Res, Barcelona 08018, Spain
基金
芬兰科学院;
关键词
Lloyd's algorithm; orders; preference statements; planted partition model; randomization testing;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of clustering a set of chains to k clusters. A chain is a totally ordered subset of a finite set of items. Chains are an intuitive way to express preferences over a set of alternatives, as well as a useful representation of ratings in situations where the item-specific scores are either difficult to obtain, too noisy due to measurement error, or simply not as relevant as the order that they induce over the items. First we adapt the classical k-means for chains by proposing a suitable distance function and a centroid structure. We also present two different approaches for mapping chains to a vector space. The first one is related to the planted partition model, while the second one has an intuitive geometrical interpretation. Finally we discuss a randomization test for assessing the significance of a clustering. To this end we present an MCMC algorithm for sampling random sets of chains that share certain properties with the original data. The methods are studied in a series of experiments using real and artificial data. Results indicate that the methods produce interesting clusterings, and for certain types of inputs improve upon previous work on clustering algorithms for orders.
引用
收藏
页码:1389 / 1423
页数:35
相关论文
共 50 条
  • [31] ULTRAMETRIC HIERARCHICAL CLUSTERING ALGORITHMS
    MILLIGAN, GW
    [J]. PSYCHOMETRIKA, 1979, 44 (03) : 343 - 346
  • [32] Recovery Rate of Clustering Algorithms
    Li, Fajie
    Klette, Reinhard
    [J]. ADVANCES IN IMAGE AND VIDEO TECHNOLOGY, PROCEEDINGS, 2009, 5414 : 1058 - +
  • [33] A Study of Hierarchical Clustering Algorithms
    Patel, Sakshi
    Sihmar, Shivani
    Jatain, Aman
    [J]. 2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 537 - 541
  • [34] Research on Text Clustering Algorithms
    Li Qun
    Huang Xinyuan
    [J]. 2010 2ND INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS PROCEEDINGS (DBTA), 2010,
  • [35] Fundamental clustering algorithms suite
    Thrun, Michael C.
    Stier, Quirin
    [J]. SOFTWAREX, 2021, 13
  • [36] Fuzzy clustering with evolutionary algorithms
    Klawonn, F
    Keller, A
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1998, 13 (10-11) : 975 - 991
  • [37] A Survey of Distributed Clustering Algorithms
    Hai, Mo
    Zhang, Shuyun
    Zhu, Lei
    Wang, Yue
    [J]. 2012 INTERNATIONAL CONFERENCE ON INDUSTRIAL CONTROL AND ELECTRONICS ENGINEERING (ICICEE), 2012, : 1142 - 1145
  • [38] COMPARISION OF ALGORITHMS FOR DOCUMENT CLUSTERING
    Gupta, Mamta
    Rajavat, Anand
    [J]. 2014 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS, 2014, : 541 - 545
  • [39] Survey of Clustering: Algorithms and Applications
    Greenlaw, Raymond
    Kantabutra, Sanpawat
    [J]. INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2013, 3 (02) : 1 - 29
  • [40] Stability analysis of clustering algorithms
    Maqbool, O.
    Babri, H. A.
    [J]. 10TH IEEE INTERNATIONAL MULTITOPIC CONFERENCE 2006, PROCEEDINGS, 2006, : 314 - +