Scalable Graph Similarity Search in Large Graph Databases

被引:0
|
作者
Kiran, P. [1 ]
Sivadasan, Naveen [2 ]
机构
[1] NSS Coll Engn, Palakkad, India
[2] TCS Innovat Labs, Hyderabad, Andhra Pradesh, India
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We consider the problem of searching a collection of graphs D to find graphs that are most similar to a query graph Q. This has several applications in areas like computational biology, drug design, computational chemistry, collaborative networks, social networks etc. We use graphlet kernel to define similarity between graphs. In order to make the similarity search faster, we build an efficient nearest neighbor data structure on the graph collection using locality sensitive hashing technique. The graphs are embedded into a vector space and these vectors are used to build the nearest neighbor data structure. Computing the vector space embedding is the most compute intensive part in our algorithm. To scale our algorithm to large graph collections, we give an efficient Map-Reduce implementation for vector space embedding. We perform experiments on real world datasets (AIDS dataset) and synthetic datasets to show the effectiveness of our algorithm.
引用
收藏
页码:207 / 211
页数:5
相关论文
共 50 条
  • [1] Graph similarity search on large uncertain graph databases
    Yuan, Ye
    Wang, Guoren
    Chen, Lei
    Wang, Haixun
    [J]. VLDB JOURNAL, 2015, 24 (02): : 271 - 296
  • [2] Graph similarity search on large uncertain graph databases
    Ye Yuan
    Guoren Wang
    Lei Chen
    Haixun Wang
    [J]. The VLDB Journal, 2015, 24 : 271 - 296
  • [3] Efficient Graph Similarity Search Over Large Graph Databases
    Zheng, Weiguo
    Zou, Lei
    Lian, Xiang
    Wang, Dong
    Zhao, Dongyan
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (04) : 964 - 978
  • [4] Scalable Supergraph Search in Large Graph Databases
    Lyu, Bingqing
    Qin, Lu
    Lin, Xuemin
    Chang, Lijun
    Yu, Jeffrey Xu
    [J]. 2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 157 - 168
  • [5] Graph Similarity Search with Edit Distance Constraint in Large Graph Databases
    Zheng, Weiguo
    Zou, Lei
    Lian, Xiang
    Wang, Dong
    Zhao, Dongyan
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1595 - 1600
  • [6] Efficient Subgraph Similarity Search on Large Probabilistic Graph Databases
    Yuan, Ye
    Wang, Guoren
    Chent, Lei
    Wang, Haixun
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (09): : 800 - 811
  • [7] EmbAssi: embedding assignment costs for similarity search in large graph databases
    Bause, Franka
    Schubert, Erich
    Kriege, Nils M.
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 36 (05) : 1728 - 1755
  • [8] EmbAssi: embedding assignment costs for similarity search in large graph databases
    Franka Bause
    Erich Schubert
    Nils M. Kriege
    [J]. Data Mining and Knowledge Discovery, 2022, 36 : 1728 - 1755
  • [9] GHashing: Semantic Graph Hashing for Approximate Similarity Search in Graph Databases
    Qin, Zongyue
    Bai, Yunsheng
    Sun, Yizhou
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 2062 - 2072
  • [10] Geometric Graph Indexing for Similarity Search in Scientific Databases
    Armiti, Ayser
    Gertz, Michael
    [J]. 28TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM) 2016), 2016,