Convex Hierarchical Clustering for Graph-Structured Data

被引:0
|
作者
Donnat, Claire [1 ]
Holmes, Susan [1 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
关键词
ALGORITHMS;
D O I
10.1109/ieeeconf44664.2019.9048653
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convex clustering [6] is a recent stable alternative to hierarchical clustering. It formulates the recovery of progressively coalescing clusters as a regularized convex problem. While convex clustering was originally designed for handling Euclidean distances between data points, in a growing number of applications, the data is directly characterized by a similarity matrix or weighted graph. In this paper, we extend the robust hierarchical clustering approach to these broader classes of similarities. Having defined an appropriate convex objective, the crux of this adaptation lies in our ability to provide: (a) an efficient recovery of the regularization path and (b) an empirical demonstration of the use of our method. We address the first challenge through a proximal dual algorithm, for which we characterize both the theoretical efficiency and the empirical performance on a set of experiments. Finally, we highlight the potential of our method by showing its application to several real-life datasets, thus providing a natural extension to the current scope of applications of convex clustering.
引用
收藏
页码:1999 / 2006
页数:8
相关论文
共 50 条
  • [21] Exploiting local similarity for indexing paths in graph-structured data
    Kaushik, R
    Shenoy, P
    Bohannon, P
    Gudes, E
    [J]. 18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, : 129 - 140
  • [22] Labeling scheme and structural joins for graph-structured XML data
    Wang, HZ
    Wang, W
    Lin, XM
    Li, JZ
    [J]. WEB TECHNOLOGIES RESEARCH AND DEVELOPMENT - APWEB 2005, 2005, 3399 : 277 - 289
  • [23] Expressive Languages for Path Queries over Graph-Structured Data
    Barcelo, Pablo
    Libkin, Leonid
    Lin, Anthony W.
    Wood, Peter T.
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2012, 37 (04):
  • [24] A New Reachability Query Method for Graph-structured XML Data
    Lu Yan
    Ma, Funing
    Chu, Shanzhong
    [J]. ADVANCES IN COMPUTING, CONTROL AND INDUSTRIAL ENGINEERING, 2012, 235 : 394 - +
  • [25] Managing Change in Graph-Structured Data Using Description Logics
    Ahmetaj, Shqiponja
    Calvanese, Diego
    Ortiz, Magdalena
    Simkus, Mantas
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 966 - 973
  • [26] Constructing a decision tree for graph-structured data and its applications
    Geamsakul, W
    Yoshida, T
    Ohara, K
    Motoda, H
    Yokoi, H
    Takabayashi, K
    [J]. FUNDAMENTA INFORMATICAE, 2005, 66 (1-2) : 131 - 160
  • [27] Managing Change in Graph-Structured Data Using Description Logics
    Ahmetaj, Shqiponja
    Calvanese, Diego
    Ortiz, Magdalena
    Simkus, Mantas
    [J]. ACM TRANSACTIONS ON COMPUTATIONAL LOGIC, 2017, 18 (04)
  • [28] Visualization and classification of graph-structured data: the case of the Enron dataset
    Bouveyron, Charles
    Chipman, Hugh
    [J]. 2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 1506 - 1517
  • [29] Expressive Languages for Path Queries over Graph-Structured Data
    Barcelo, Pablo
    Hurtado, Carlos
    Libkin, Leonid
    Wood, Peter
    [J]. PODS 2010: PROCEEDINGS OF THE TWENTY-NINTH ACM SIGMOD-SIGACT-SIGART SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2010, : 3 - 14
  • [30] What can we do with graph-structured data? A data mining perspective
    Motoda, Hiroshi
    [J]. AI 2006: Advances in Artificial Intelligence, Proceedings, 2006, 4304 : 1 - 2