A Learned Sketch for Subgraph Counting

被引:22
|
作者
Zhao, Kangfei [1 ]
Yu, Jeffrey Xu [1 ]
Zhang, Hao [1 ]
Li, Qiyan [2 ]
Rong, Yu [3 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Wuhan Univ, Wuhan, Peoples R China
[3] Tecent AI Lab, Shenzhen, Peoples R China
关键词
Subgraph counting; Deep learning; CARDINALITY ESTIMATION; PREDICTION; ALGORITHM; GRAPHLETS; NETWORKS; QUERIES; BOUNDS; ORDER;
D O I
10.1145/3448016.3457289
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Subgraph counting, as a fundamental problem in network analysis, is to count the number of subgraphs in a data graph that match a given query graph by either homomorphism or subgraph isomorphism. The importance of subgraph counting derives from the fact that it provides insights of a large graph, in particular a labeled graph, when a collection of query graphs with different sizes and labels are issued. The problem of counting is challenging. On one hand, exact counting by enumerating subgraphs is NP-hard. On the other hand, approximate counting by subgraph isomorphism can only support 3/5-node query graphs over unlabeled graphs. Another way for subgraph counting is to specify it as an SQL query and estimate the cardinality of the query in RDBMS. Existing approaches for cardinality estimation can only support subgraph counting by homomorphism up to some extent, as it is difficult to deal with sampling failure when a query graph becomes large. A question that arises is if subgraph counting can be supported by machine learning (ML) and deep learning (DL). The existing DL approach for subgraph isomorphism can only support small data graphs. The ML/DL approaches proposed in RDBMS context for approximate query processing and cardinality estimation cannot be used, as subgraph counting is to do complex self-joins over one relation, whereas existing approaches focus on multiple relations. In this paper, we propose an Active Learned Sketch for Subgraph Counting (ALSS) with two main components: a sketch learned (LSS) and an active learner (AL). The sketch is learned by a neural network regression model, and the active learner is to perform model updates based on new arrival test query graphs. We conduct extensive experimental studies to confirm the effectiveness and efficiency of ALSS using large real labeled graphs. Moreover, we show that ALSS can assist query optimizers to find a better query plan for complex multi-way self-joins.
引用
收藏
页码:2142 / 2155
页数:14
相关论文
共 50 条
  • [31] Counting copies of a fixed subgraph in F-free graphs
    Gerbner, Daniel
    Palmer, Cory
    EUROPEAN JOURNAL OF COMBINATORICS, 2019, 82
  • [32] Some applications of Wagner's weighted subgraph counting polynomial
    Bencs, Ferenc
    Csikvari, Peter
    Regts, Guus
    ELECTRONIC JOURNAL OF COMBINATORICS, 2021, 28 (04):
  • [33] Robust subgraph counting with distribution-free random graph analysis
    van Leeuwaarden, Johan S. H.
    Stegehuis, Clara
    PHYSICAL REVIEW E, 2021, 104 (04)
  • [34] A Color-Avoiding Approach to Subgraph Counting in Bounded Expansion Classes
    Reidl, Felix
    Sullivan, Blair D. D.
    ALGORITHMICA, 2023, 85 (08) : 2318 - 2347
  • [35] An efficiently computable subgraph pattern support measure: counting independent observations
    Wang, Yuyi
    Ramon, Jan
    Fannes, Thomas
    DATA MINING AND KNOWLEDGE DISCOVERY, 2013, 27 (03) : 444 - 477
  • [36] A Color-Avoiding Approach to Subgraph Counting in Bounded Expansion Classes
    Felix Reidl
    Blair D. Sullivan
    Algorithmica, 2023, 85 : 2318 - 2347
  • [37] Improving Graph Neural Network Expressivity via Subgraph Isomorphism Counting
    Bouritsas, Giorgos
    Frasca, Fabrizio
    Zafeiriou, Stefanos
    Bronstein, Michael M.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 657 - 668
  • [38] An efficiently computable subgraph pattern support measure: counting independent observations
    Yuyi Wang
    Jan Ramon
    Thomas Fannes
    Data Mining and Knowledge Discovery, 2013, 27 : 444 - 477
  • [39] Reconstructing subgraph-counting graph polynomials of increasing families of graphs
    Bresar, B
    Imrich, W
    Klavzar, S
    DISCRETE MATHEMATICS, 2005, 297 (1-3) : 159 - 166
  • [40] A Survey on Subgraph Counting: Concepts, Algorithms, and Applications to Network Motifs and Graphlets
    Ribeiro, Pedro
    Paredes, Pedro
    Silva, Miguel E. P.
    Aparicio, David
    Silva, Fernando
    ACM COMPUTING SURVEYS, 2021, 54 (02)