Fast Top-K Graph Similarity Search Via Representative Matrices

被引:2
|
作者
Sun, Zhigang [1 ]
Huo, Hongwei [1 ]
Chen, Xiaoyang [1 ]
机构
[1] Xidian Univ, Dept Comp Sci, Xian 710071, Shaanxi, Peoples R China
来源
IEEE ACCESS | 2018年 / 6卷
基金
美国国家科学基金会;
关键词
Graph similarity search; top-k; representative matrix;
D O I
10.1109/ACCESS.2018.2819426
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graph similarity search is a crucial problem in many applications, such as cheminformatics, data mining, and pattern recognition. Top-k graph similarity search aims to find the most similar k graphs to a query graph in graph databases. In this paper, we present a fast top-k graph similarity search algorithm with high classification accuracy. We introduce a new graph similarity measure based upon the number of occurrences of subtree patterns in graphs. In order to accelerate search, we also construct hierarchical representative matrices for graph databases, where each row of the matrices represents a graph set. Using representative matrices, we can derive a similarity upper bound of a query graph and the graph set so as to reduce search space. Comprehensive experiments on real data sets demonstrate that our algorithm has a better performance than compared methods on classification accuracy and query time, and it also can scale to large data sets including 15 million chemical structure graphs.
引用
收藏
页码:21408 / 21417
页数:10
相关论文
共 50 条
  • [21] Fast and Exact Top-k Search for Random Walk with Restart
    Fujiwara, Yasuhiro
    Nakatsuji, Makoto
    Onizuka, Makoto
    Kitsuregawa, Masaru
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (05): : 442 - 453
  • [22] Approximate top-k structural similarity search over XML documents
    Xie, T
    Sha, CF
    Wang, XL
    Zhou, AY
    [J]. FRONTIERS OF WWW RESEARCH AND DEVELOPMENT - APWEB 2006, PROCEEDINGS, 2006, 3841 : 319 - 330
  • [23] Semantic Enhanced Top-k Similarity Search on Heterogeneous Information Networks
    Yu, Minghe
    Zhang, Yun
    Zhang, Tiancheng
    Yu, Ge
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT III, 2020, 12114 : 104 - 119
  • [24] Subspace Similarity Search Using the Ideas of Ranking and Top-k Retrieval
    Bernecker, Thomas
    Emrich, Tobias
    Graf, Franz
    Kriegel, Hans-Peter
    Kroeger, Peer
    Renz, Matthias
    Schubert, Erich
    Zimek, Arthur
    [J]. 2010 IEEE 26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDE 2010), 2010, : 4 - 9
  • [25] Fast Top-k Similar Sequence Search on DNA Databases
    Yagi, Ryuichi
    Shiokawa, Hiroaki
    [J]. INFORMATION INTEGRATION AND WEB INTELLIGENCE, IIWAS 2022, 2022, 13635 : 145 - 150
  • [26] Top-k String Similarity Search with Edit-Distance Constraints
    Deng, Dong
    Li, Guoliang
    Feng, Jianhua
    Li, Wen-Syan
    [J]. 2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2013, : 925 - 936
  • [27] Fast Action Detection via Discriminative Random Forest Voting and Top-K Subvolume Search
    Yu, Gang
    Goussies, Norberto A.
    Yuan, Junsong
    Liu, Zicheng
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (03) : 507 - 517
  • [28] Graph Encryption for Top-K Nearest Keyword Search Queries on Cloud
    Liu, Chang
    Zhu, Liehuang
    Chen, Jinjun
    [J]. IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2017, 2 (04): : 371 - 381
  • [29] Top-k String Similarity Joins
    Qi, Shuyao
    Bouros, Panagiotis
    Mamoulis, Nikos
    [J]. PROCEEDINGS OF THE 32TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, SSDBM 2020, 2020,
  • [30] Top-k Tree Similarity Join
    Wang, Jianhua
    Yang, Jianye
    Zhang, Wenjie
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 1939 - 1948