Fast Top-K Graph Similarity Search Via Representative Matrices

被引:2
|
作者
Sun, Zhigang [1 ]
Huo, Hongwei [1 ]
Chen, Xiaoyang [1 ]
机构
[1] Xidian Univ, Dept Comp Sci, Xian 710071, Shaanxi, Peoples R China
来源
IEEE ACCESS | 2018年 / 6卷
基金
美国国家科学基金会;
关键词
Graph similarity search; top-k; representative matrix;
D O I
10.1109/ACCESS.2018.2819426
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graph similarity search is a crucial problem in many applications, such as cheminformatics, data mining, and pattern recognition. Top-k graph similarity search aims to find the most similar k graphs to a query graph in graph databases. In this paper, we present a fast top-k graph similarity search algorithm with high classification accuracy. We introduce a new graph similarity measure based upon the number of occurrences of subtree patterns in graphs. In order to accelerate search, we also construct hierarchical representative matrices for graph databases, where each row of the matrices represents a graph set. Using representative matrices, we can derive a similarity upper bound of a query graph and the graph set so as to reduce search space. Comprehensive experiments on real data sets demonstrate that our algorithm has a better performance than compared methods on classification accuracy and query time, and it also can scale to large data sets including 15 million chemical structure graphs.
引用
收藏
页码:21408 / 21417
页数:10
相关论文
共 50 条
  • [1] Fast top-k search with relaxed graph simulation
    Habi, Abdelmalek
    Effantin, Brice
    Kheddouci, Hamamache
    [J]. 2018 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2018, : 495 - 502
  • [2] Efficient Top-k Graph Similarity Search With GED Constraints
    Kim, Jongik
    [J]. IEEE ACCESS, 2022, 10 : 79180 - 79191
  • [3] Panther: Fast Top-k Similarity Search on Large Networks
    Zhang, Jing
    Tang, Jie
    Ma, Cong
    Tong, Hanghang
    Jing, Yu
    Li, Juanzi
    [J]. KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 1445 - 1454
  • [4] Fast and Flexible Top-k Similarity Search on Large Networks
    Zhang, Jing
    Tang, Jie
    Ma, Cong
    Tong, Hanghang
    Jing, Yu
    Li, Juanzi
    Luyten, Walter
    Moens, Marie-Francine
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2017, 36 (02)
  • [5] On Top-k Structural Similarity Search
    Lee, Pei
    Lakshmanan, Laks V. S.
    Yu, Jeffrey Xu
    [J]. 2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 774 - 785
  • [6] Fast top-k similarity search in large dynamic attributed networks
    Meng, Zaiqiao
    Shen, Hong
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (06)
  • [7] Fast top-k similarity join for SimRank
    Li, Ruiqi
    Zhao, Xiang
    Shang, Haichuan
    Chen, Yifan
    Xiao, Weidong
    [J]. INFORMATION SCIENCES, 2017, 381 : 1 - 19
  • [8] Answering Top-k Graph Similarity Queries in Graph Databases
    Zhu, Yuanyuan
    Qin, Lu
    Yu, Jeffrey Xu
    Cheng, Hong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (08) : 1459 - 1474
  • [9] Answering Top-k Representative Queries on Graph Databases
    Ranu, Sayan
    Minh Hoang
    Singh, Ambuj
    [J]. SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 1163 - 1174
  • [10] Fast Top-K Search in Knowledge Graphs
    Yang, Shengqi
    Han, Fangqiu
    Wu, Yinghui
    Yan, Xifeng
    [J]. 2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 990 - 1001