Efficient Keyword Search on Uncertain Graph Data

被引:38
|
作者
Yuan, Ye [1 ]
Wang, Guoren [1 ]
Chen, Lei [2 ]
Wang, Haixun [3 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Liaoning, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Kowloon, Hong Kong, Peoples R China
[3] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
Database; algorithm; uncertain data; graph data;
D O I
10.1109/TKDE.2012.222
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a popular search mechanism, keyword search has been applied to retrieve useful data in documents, texts, graphs, and even relational databases. However, so far, there is no work on keyword search over uncertain graph data even though the uncertain graphs have been widely used in many real applications, such as modeling road networks, influential detection in social networks, and data analysis on PPI networks. Therefore, in this paper, we study the problem of top-k keyword search over uncertain graph data. Following the similar answer definition for keyword search over deterministic graphs, we consider a subtree in the uncertain graph as an answer to a keyword query if 1) it contains all the keywords; 2) it has a high score (defined by users or applications) based on keyword matching; and 3) it has low uncertainty. Keyword search over deterministic graphs is already a hard problem as stated in [1], [2], [3]. Due to the existence of uncertainty, keyword search over uncertain graphs is much harder. Therefore, to improve the search efficiency, we employ a filtering-and-verification strategy based on a probabilistic keyword index, PKIndex. For each keyword, we offline compute path-based top-k probabilities, and attach these values to PKIndex in an optimal, compressed way. In the filtering phase, we perform existence, path-based and tree-based probabilistic pruning phases, which filter out most false subtrees. In the verification, we propose a sampling algorithm to verify the candidates. Extensive experimental results demonstrate the effectiveness of the proposed algorithms.
引用
收藏
页码:2767 / 2779
页数:13
相关论文
共 50 条
  • [1] Probabilistic Query Rewriting for Efficient and Effective Keyword Search on Graph Data
    Lei Zhang
    Tran, Thanh
    Rettinger, Achim
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (14): : 1642 - 1653
  • [2] Efficient keyword search on graph data for finding diverse and relevant answers
    Park, Chang-Sup
    [J]. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2023, 19 (01) : 19 - 41
  • [3] Review on Keyword Searching Techniques in Uncertain Graph Data
    Zambare, Nikita B.
    Manekar, Swati
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (IEEE ICCIC), 2014, : 1088 - 1090
  • [4] Effective Fuzzy Keyword Search over Uncertain Data
    Song, Xiaoming
    Li, Guoliang
    Feng, Jianhua
    Zhou, Lizhu
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2009, 5463 : 66 - 70
  • [5] ANSWER GRAPH CONSTRUCTION FOR KEYWORD SEARCH ON GRAPH STRUCTURED(RDF) DATA
    Parthasarathy, K.
    Kumar, P. Sreenivasa
    Damien, Dominic
    [J]. KDIR 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2010, : 162 - 167
  • [6] Efficient Data Structure for XML Keyword Search
    Choi, Ryan H.
    Wong, Raymond K.
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2009, 5463 : 549 - 554
  • [7] An efficient SLCA-based keyword search algorithm on uncertain XML
    Zhang, Xiaolin
    Hao, Kun
    Liu, Lixin
    Zhang, Huanxiang
    [J]. Journal of Computational Information Systems, 2015, 11 (21): : 7721 - 7729
  • [8] An Improved Keyword Search on Big Data Graph with Graphics Processors
    He, Xiu
    Yang, Bo
    [J]. COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, (ISICA 2015), 2016, 575 : 390 - 397
  • [9] Efficient Keyword Search over Encrypted Cloud Data
    Meharwade, Anuradha
    Patil, G. A.
    [J]. 1ST INTERNATIONAL CONFERENCE ON INFORMATION SECURITY & PRIVACY 2015, 2016, 78 : 139 - 145
  • [10] SEPM: Efficient Partial Keyword Search on Encrypted Data
    Kawai, Yutaka
    Hirano, Takato
    Koseki, Yoshihiro
    Munaka, Tatsuji
    [J]. CRYPTOLOGY AND NETWORK SECURITY, CANS 2015, 2015, 9476 : 75 - 91