A Dynamic Programming Framework for Large-Scale Online Clustering on Graphs

被引:0
|
作者
Yantao Li
Xiang Zhao
Zehui Qu
机构
[1] Chongqing University,College of Computer Science
[2] Southwest University,College of Computer and Information Sciences
来源
Neural Processing Letters | 2020年 / 52卷
关键词
Online graph clustering; Large-scale graphs; supernodes; Running time; Efficiency;
D O I
暂无
中图分类号
学科分类号
摘要
As a fundamental technique for data analysis, graph clustering grouping graph data into clusters has attracted great attentions in recent years. In this paper, we present DPOCG, a dynamic programming framework for large-scale online clustering on graphs, which improves the scalability of a wide range of graph clustering algorithms. Specifically, DPOCG first identifies the nodes whose states are unchanged compared with the states at the previous time on a large-scale graph, then constructs these unchanged nodes as supernodes, which greatly reduces the size of the graph at the current time, and collapses nodes whose degrees are less than a predefined threshold. Based on our density-based graph clustering algorithm (DGCM), DPOCG partitions the reduced graph into clusters. In addition, we theoretically analyze DPOCG in terms of supernode generation, clustering on reduced graph, and computational complexity. We evaluate DPOCG on a synthetic dataset and seven real-world datasets, respectively, and the experimental results show that DPOCG consumes less running time and improves the efficiency of clustering.
引用
收藏
页码:1613 / 1629
页数:16
相关论文
共 50 条
  • [1] A Dynamic Programming Framework for Large-Scale Online Clustering on Graphs
    Li, Yantao
    Zhao, Xiang
    Qu, Zehui
    [J]. NEURAL PROCESSING LETTERS, 2020, 52 (02) : 1613 - 1629
  • [2] Large-Scale Clustering Using Mathematical Programming
    Gnagi, Mario
    Baumann, Philipp
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2017, : 789 - 793
  • [3] Adaptive Partitioning of Large-Scale Dynamic Graphs
    Vaquero, Luis M.
    Cuadrado, Felix
    Logothetis, Dionysios
    Martella, Claudio
    [J]. 2014 IEEE 34TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2014), 2014, : 144 - 153
  • [4] Large-scale Nonlinear Programming: An Integrating Framework for Enterprise-Wide Dynamic Optimization
    Biegler, Lorenz T.
    [J]. 17TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING, 2007, 24 : 575 - 582
  • [5] A distributed clustering algorithm for large-scale dynamic networks
    Bernard, Thibault
    Bui, Alain
    Pilard, Laurence
    Sohier, Devan
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2012, 15 (04): : 335 - 350
  • [6] A distributed clustering algorithm for large-scale dynamic networks
    Thibault Bernard
    Alain Bui
    Laurence Pilard
    Devan Sohier
    [J]. Cluster Computing, 2012, 15 : 335 - 350
  • [7] Online Clustering Modeling of Large-Scale Photovoltaic Power Plants
    Ma, Zhimin
    Zheng, Jinghong
    Zhu, Shouzhen
    Shen, Xinwei
    Wei, Ling
    Wang, Xiaoyu
    Men, Kun
    [J]. 2015 IEEE POWER & ENERGY SOCIETY GENERAL MEETING, 2015,
  • [8] Large-scale financial planning via a partially observable stochastic dual dynamic programming framework
    Lee, Jinkyu
    Kwon, Do-Gyun
    Lee, Yongjae
    Kim, Jang Ho
    Kim, Woo Chang
    [J]. QUANTITATIVE FINANCE, 2023, 23 (09) : 1341 - 1360
  • [9] Hub Labels on the database for large-scale graphs with the COLD framework
    Efentakis, Alexandros
    Efstathiades, Christodoulos
    Pfoser, Dieter
    [J]. GEOINFORMATICA, 2017, 21 (04) : 703 - 732
  • [10] Hub Labels on the database for large-scale graphs with the COLD framework
    Alexandros Efentakis
    Christodoulos Efstathiades
    Dieter Pfoser
    [J]. GeoInformatica, 2017, 21 : 703 - 732