Adaptive Hierarchical Clustering Using Ordinal Queries

被引:0
|
作者
Emamjomeh-Zadeh, Ehsan [1 ]
Kempe, David [1 ]
机构
[1] Univ Southern Calif, Dept Comp Sci, Los Angeles, CA 90089 USA
基金
美国国家科学基金会;
关键词
BINARY SEARCH; APPROXIMATION; TREE;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In many applications of clustering (for example, ontologies or clusterings of animal or plant species), hierarchical clusterings are more descriptive than a flat clustering. A hierarchical clustering over n elements is represented by a rooted binary tree with n leaves, each corresponding to one element. The subtrees rooted at interior nodes capture the clusters. In this paper, we study active learning of a hierarchical clustering using only ordinal queries. An ordinal query consists of a set of three elements, and the response to a query reveals the two elements (among the three elements in the query) which are "closer" to each other than to the third one. We say that elements x and x' are closer to each other than x '' if there exists a cluster containing x and x', but not x ''. When all the query responses are correct, there is a deterministic algorithm that learns the underlying hierarchical clustering using at most n log(2) n adaptive ordinal queries. We generalize this algorithm to be robust in a model in which each query response is correct independently with probability p > 1/2, and adversarially incorrect with probability 1 - p. We show that in the presence of noise, our algorithm outputs the correct hierarchical clustering with probability at least 1 - delta, using O(n log n + n log(1/delta)) adaptive ordinal queries. For our results, adaptivity is crucial: we prove that even in the absence of noise, every non-adaptive algorithm requires Omega(n(3)) ordinal queries in the worst case.
引用
收藏
页码:415 / 429
页数:15
相关论文
共 50 条
  • [1] Optimal Clustering in Stable Instances Using Combinations of Exact and Noisy Ordinal Queries
    Bianchi, Enrico
    Penna, Paolo
    ALGORITHMS, 2021, 14 (02)
  • [2] Hierarchical clustering based on ordinal consistency
    Lee, JWT
    Yeung, DS
    Tsang, ECC
    PATTERN RECOGNITION, 2005, 38 (11) : 1913 - 1925
  • [3] Adaptive document clustering using incrementally expanded queries
    Eguchi, Koji
    Ito, Hidetaka
    Kumamoto, Akira
    Kanata, Yakichi
    Systems and Computers in Japan, 2001, 32 (02) : 64 - 74
  • [4] Hierarchical adaptive clustering
    Serban, Gabriela
    Campan, Alina
    INFORMATICA, 2008, 19 (01) : 101 - 112
  • [5] Correlation Clustering with Adaptive Similarity Queries
    Bressan, Marco
    Cesa-Bianchi, Nicolo
    Paudice, Andrea
    Vitale, Fabio
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [6] An agglomerative hierarchical clustering algorithm for linear ordinal rankings
    Liu, Nana
    Xu, Zeshui
    Zeng, Xiao-Jun
    Ren, Peijia
    INFORMATION SCIENCES, 2021, 557 : 170 - 193
  • [7] ADAPTIVE HIERARCHICAL CLUSTERING SCHEMES
    ROHLF, FJ
    SYSTEMATIC ZOOLOGY, 1970, 19 (01): : 58 - &
  • [8] Scalable adaptive hierarchical clustering
    Mathy, L
    Canonico, R
    Simpson, S
    Hutchison, D
    IEEE COMMUNICATIONS LETTERS, 2002, 6 (03) : 117 - 119
  • [9] Hierarchical co-clustering for web queries and selected URLs
    Hosseini, Mehdi
    Abolhassani, Hassan
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2007, PROCEEDINGS, 2007, 4831 : 653 - 662
  • [10] An adaptive parallel hierarchical clustering algorithm
    Li, Zhaopeng
    Li, Kenli
    Xiao, Degui
    Yang, Lei
    HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2007, 4782 : 97 - 107