The K-clique Densest Subgraph Problem

被引:139
|
作者
Tsourakakis, Charalampos E. [1 ]
机构
[1] Harvard Sch Engn & Appl Sci, Cambridge, MA 02138 USA
关键词
Densest subgraph problem; Graph algorithms; Graph Mining; Near-clique extraction; ALGORITHM; MAPREDUCE; MOTIFS;
D O I
10.1145/2736277.2741098
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Numerous graph mining applications rely on detecting subgraphs which are large near-cliques. Since formulations that are geared towards finding large near-cliques are NP-hard and frequently inapproximable due to connections with the Maximum Clique problem, the poly-time solvable densest subgraph problem which maximizes the average degree over all possible subgraphs "lies at the core of large scale data mining"[10]. However, frequently the densest subgraph problem fails in detecting large near-cliques in networks. In this work, we introduce the k-clique densest subgraph problem, k >= 2. This generalizes the well studied densest subgraph problem which is obtained as a special case for k = 2. For k = 3 we obtain a novel formulation which we refer to as the triangle densest subgraph problem : given a graph G (V, E), find a subset of vertices S* such that tau (S*) = (max)(S subset of V) t (S)/vertical bar S vertical bar, where t (S) is the number of triangles induced by the set S. On the theory side, we prove that for any k constant, there exist an exact polynomial time algorithm for the k-clique densest subgraph problem. Furthermore, we propose an efficient 1/k-approximation algorithm which generalizes the greedy peeling algorithm of Asahiro and Charikar [8, 18] for k = 2. Finally, we show how to implement efficiently this peeling framework on MapReduce for any k >= 3, generalizing the work of Bahmani, Kumar and Vassilvitskii for the case k = 2 [10]. On the empirical side, our two main findings are that (i) the triangle densest subgraph is consistently closer to being a large near-clique compared to the densest subgraph and (ii) the peeling approximation algorithms for both k = 2 and k = 3 achieve on real-world networks approximation ratios closer to 1 rather than the pessimistic 1 k guarantee. An interesting consequence of our work is that triangle counting, a well-studied computational problem in the context of social network analysis can be used to detect large near-cliques. Finally, we evaluate our proposed method on a popular graph mining application.
引用
收藏
页码:1122 / 1132
页数:11
相关论文
共 50 条
  • [1] The densest k-subgraph problem on clique graphs
    Maria Liazi
    Ioannis Milis
    Fanny Pascual
    Vassilis Zissimopoulos
    [J]. Journal of Combinatorial Optimization, 2007, 14 : 465 - 474
  • [2] The densest k-subgraph problem on clique graphs
    Liazi, Maria
    Milis, Ioannis
    Pascual, Fanny
    Zissimopoulos, Vassilis
    [J]. JOURNAL OF COMBINATORIAL OPTIMIZATION, 2007, 14 (04) : 465 - 474
  • [3] Approximation algorithms for the k-clique covering problem
    Goldschmidt, O
    Hochbaum, DS
    Hurkens, C
    Yu, G
    [J]. SIAM JOURNAL ON DISCRETE MATHEMATICS, 1996, 9 (03) : 492 - 509
  • [4] KClist++: A Simple Algorithm for Finding k-Clique Densest Subgraphs in Large Graphs
    Sun, Bintao
    Danisch, Maximilien
    Chan, T-H Hubert
    Sozio, Mauro
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 13 (10): : 1628 - 1640
  • [5] The Binomial k-Clique
    Narayana, Nithya Sai
    Sane, Sharad
    [J]. JOURNAL OF COMBINATORIAL DESIGNS, 2013, 21 (01) : 36 - 45
  • [6] The Densest k Subgraph Problem in b-Outerplanar Graphs
    Gonzales, Sean
    Migler, Theresa
    [J]. COMPLEX NETWORKS AND THEIR APPLICATIONS VIII, VOL 1, 2020, 881 : 116 - 127
  • [7] On solving the densest k-subgraph problem on large graphs
    Sotirov, Renata
    [J]. OPTIMIZATION METHODS & SOFTWARE, 2020, 35 (06): : 1160 - 1178
  • [8] A Cohesive Subgraph Visualization-Based Approach to Efficiently Discover Large k-Clique Community
    Xu, Kaikuo
    He, Jia
    Zou, Surong
    Zhang, Hongwei
    Yan, Tianyun
    Wei, Xuzhong
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2012, 37 (07) : 1959 - 1968
  • [9] Faster Combinatorial k-Clique Algorithms
    Abboud, Amir
    Fischer, Nick
    Shechter, Yarin
    [J]. LATIN 2024: THEORETICAL INFORMATICS, PT I, 2024, 14578 : 193 - 206
  • [10] Doubly Nonnegative and Semidefinite Relaxations for the Densest k-Subgraph Problem
    Guo, Chuan-Hao
    Guo, Yuan
    Liu, Bei-Bei
    [J]. ENTROPY, 2019, 21 (02):