A Learned Sketch for Subgraph Counting

被引:22
|
作者
Zhao, Kangfei [1 ]
Yu, Jeffrey Xu [1 ]
Zhang, Hao [1 ]
Li, Qiyan [2 ]
Rong, Yu [3 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Wuhan Univ, Wuhan, Peoples R China
[3] Tecent AI Lab, Shenzhen, Peoples R China
关键词
Subgraph counting; Deep learning; CARDINALITY ESTIMATION; PREDICTION; ALGORITHM; GRAPHLETS; NETWORKS; QUERIES; BOUNDS; ORDER;
D O I
10.1145/3448016.3457289
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Subgraph counting, as a fundamental problem in network analysis, is to count the number of subgraphs in a data graph that match a given query graph by either homomorphism or subgraph isomorphism. The importance of subgraph counting derives from the fact that it provides insights of a large graph, in particular a labeled graph, when a collection of query graphs with different sizes and labels are issued. The problem of counting is challenging. On one hand, exact counting by enumerating subgraphs is NP-hard. On the other hand, approximate counting by subgraph isomorphism can only support 3/5-node query graphs over unlabeled graphs. Another way for subgraph counting is to specify it as an SQL query and estimate the cardinality of the query in RDBMS. Existing approaches for cardinality estimation can only support subgraph counting by homomorphism up to some extent, as it is difficult to deal with sampling failure when a query graph becomes large. A question that arises is if subgraph counting can be supported by machine learning (ML) and deep learning (DL). The existing DL approach for subgraph isomorphism can only support small data graphs. The ML/DL approaches proposed in RDBMS context for approximate query processing and cardinality estimation cannot be used, as subgraph counting is to do complex self-joins over one relation, whereas existing approaches focus on multiple relations. In this paper, we propose an Active Learned Sketch for Subgraph Counting (ALSS) with two main components: a sketch learned (LSS) and an active learner (AL). The sketch is learned by a neural network regression model, and the active learner is to perform model updates based on new arrival test query graphs. We conduct extensive experimental studies to confirm the effectiveness and efficiency of ALSS using large real labeled graphs. Moreover, we show that ALSS can assist query optimizers to find a better query plan for complex multi-way self-joins.
引用
收藏
页码:2142 / 2155
页数:14
相关论文
共 50 条
  • [21] Parallel Clique-Like Subgraph Counting and Listing
    Yang, Yi
    Yan, Da
    Zhou, Shuigeng
    Guo, Guimu
    CONCEPTUAL MODELING, ER 2019, 2019, 11788 : 484 - 497
  • [22] SubGraph2Vec: Highly-Vectorized Tree-like Subgraph Counting
    Chen, Langshi
    Li, Jiayu
    Sahinaip, Cenk
    Marathe, Madhav
    Vullikanti, Anil
    Nikolaev, Andrey
    Smirnov, Egor
    Israfilov, Ruslan
    Qiu, Judy
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 483 - 492
  • [23] The Complexity of Average-Case Dynamic Subgraph Counting
    Henzinger, Monika
    Lincoln, Andrea
    Saha, Barna
    PROCEEDINGS OF THE 2022 ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, SODA, 2022, : 459 - 498
  • [24] DeSCo: Towards Generalizable and Scalable Deep Subgraph Counting
    Fu, Tianyu
    Wei, Chiyue
    Wang, Yu
    Ying, Rex
    PROCEEDINGS OF THE 17TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, WSDM 2024, 2024, : 218 - 227
  • [25] Parameterised and Fine-Grained Subgraph Counting, Modulo 2
    Leslie Ann Goldberg
    Marc Roth
    Algorithmica, 2024, 86 : 944 - 1005
  • [26] Termination of Graph Transformation Systems Using Weighted Subgraph Counting
    Overbeek, Roy
    Endrullis, Jorg
    GRAPH TRANSFORMATION, ICGT 2023, 2023, 13961 : 81 - 101
  • [27] Fuzzy Differential Privacy Theory and Its Applications in Subgraph Counting
    Hou, Yongchao
    Xia, Xiaofang
    Li, Hui
    Cui, Jiangtao
    Mardani, Abbas
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2023, 31 (02) : 356 - 369
  • [28] MAGO: Maliciously Secure Subgraph Counting on Decentralized Social Graphs
    Wang, Songlei
    Zheng, Yifeng
    Jia, Xiaohua
    Wang, Qian
    Wang, Cong
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 2929 - 2944
  • [29] TERMINATION OF GRAPH TRANSFORMATION SYSTEMS USING WEIGHTED SUBGRAPH COUNTING
    Overbeek, Roy
    Endrullis, Jorg
    LOGICAL METHODS IN COMPUTER SCIENCE, 2024, 20 (04)
  • [30] Parameterised and Fine-Grained Subgraph Counting, Modulo 2
    Goldberg, Leslie Ann
    Roth, Marc
    ALGORITHMICA, 2024, 86 (04) : 944 - 1005