The K-clique Densest Subgraph Problem

被引:147
|
作者
Tsourakakis, Charalampos E. [1 ]
机构
[1] Harvard Sch Engn & Appl Sci, Cambridge, MA 02138 USA
关键词
Densest subgraph problem; Graph algorithms; Graph Mining; Near-clique extraction; ALGORITHM; MAPREDUCE; MOTIFS;
D O I
10.1145/2736277.2741098
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Numerous graph mining applications rely on detecting subgraphs which are large near-cliques. Since formulations that are geared towards finding large near-cliques are NP-hard and frequently inapproximable due to connections with the Maximum Clique problem, the poly-time solvable densest subgraph problem which maximizes the average degree over all possible subgraphs "lies at the core of large scale data mining"[10]. However, frequently the densest subgraph problem fails in detecting large near-cliques in networks. In this work, we introduce the k-clique densest subgraph problem, k >= 2. This generalizes the well studied densest subgraph problem which is obtained as a special case for k = 2. For k = 3 we obtain a novel formulation which we refer to as the triangle densest subgraph problem : given a graph G (V, E), find a subset of vertices S* such that tau (S*) = (max)(S subset of V) t (S)/vertical bar S vertical bar, where t (S) is the number of triangles induced by the set S. On the theory side, we prove that for any k constant, there exist an exact polynomial time algorithm for the k-clique densest subgraph problem. Furthermore, we propose an efficient 1/k-approximation algorithm which generalizes the greedy peeling algorithm of Asahiro and Charikar [8, 18] for k = 2. Finally, we show how to implement efficiently this peeling framework on MapReduce for any k >= 3, generalizing the work of Bahmani, Kumar and Vassilvitskii for the case k = 2 [10]. On the empirical side, our two main findings are that (i) the triangle densest subgraph is consistently closer to being a large near-clique compared to the densest subgraph and (ii) the peeling approximation algorithms for both k = 2 and k = 3 achieve on real-world networks approximation ratios closer to 1 rather than the pessimistic 1 k guarantee. An interesting consequence of our work is that triangle counting, a well-studied computational problem in the context of social network analysis can be used to detect large near-cliques. Finally, we evaluate our proposed method on a popular graph mining application.
引用
收藏
页码:1122 / 1132
页数:11
相关论文
共 50 条
  • [21] A constant approximation algorithm for the densest k-subgraph problem on chordal graphs
    Liazi, Maria
    Milis, Ioannis
    Zissimopoulos, Vassilis
    INFORMATION PROCESSING LETTERS, 2008, 108 (01) : 29 - 32
  • [22] A Survey on the Densest Subgraph Problem and Its Variants
    Lanciano, Tommaso
    Miyauchi, Atsushi
    Fazzone, Adriano
    Bonchi, Francesco
    ACM COMPUTING SURVEYS, 2024, 56 (08)
  • [23] THE DENSEST SUBGRAPH PROBLEM IN SPARSE RANDOM GRAPHS
    Anantharam, Venkat
    Salez, Justin
    ANNALS OF APPLIED PROBABILITY, 2016, 26 (01): : 305 - 327
  • [24] Heterogeneous quantum computing for satellite constellation optimization: solving the weighted k-clique problem
    Bass, Gideon
    Tomlin, Casey
    Kumar, Vaibhaw
    Rihaczek, Pete
    Dulny, Joseph, III
    QUANTUM SCIENCE AND TECHNOLOGY, 2018, 3 (02):
  • [25] THE MONOTONE COMPLEXITY OF k-CLIQUE ON RANDOM GRAPHS
    Rossman, Benjamin
    SIAM JOURNAL ON COMPUTING, 2014, 43 (01) : 256 - 279
  • [26] The Monotone Complexity of k-Clique on Random Graphs
    Rossman, Benjamin
    2010 IEEE 51ST ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, 2010, : 193 - 201
  • [27] An Algorithm to Discover the k-Clique Cover in Networks
    Cavique, Luis
    Mendes, Armando B.
    Santos, Jorge M. A.
    PROGRESS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, 5816 : 363 - +
  • [28] On the Constant-Depth Complexity of k-Clique
    Rossman, Benjamin
    STOC'08: PROCEEDINGS OF THE 2008 ACM INTERNATIONAL SYMPOSIUM ON THEORY OF COMPUTING, 2008, : 721 - 730
  • [29] Lightning Fast and Space Efficient k-clique Counting
    Ye, Xiaowei
    Li, Rong-Hua
    Dai, Qiangqiang
    Chen, Hongzhi
    Wang, Guoren
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 1191 - 1202
  • [30] Automatic Generation of Opaque Constants Based on the K-Clique Problem for Resilient Data Obfuscation
    Tiella, Roberto
    Ceccato, Mariano
    2017 IEEE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER), 2017, : 182 - 192