Finding top-k min-cost connected trees in databases

被引:0
|
作者
Ding, Bolin [1 ]
Yu, Jeffrey Xu [1 ]
Wang, Shan [2 ]
Qin, Lu [1 ]
Zhang, Xiao [2 ]
Lin, Xuemin [3 ]
机构
[1] Chinese Univ Hong Kong, Sha Tin 100083, Hong Kong, Peoples R China
[2] Renmin Univ China, MOE China, Key Lab Data Engn &Knowledge Engn, Beijing, Peoples R China
[3] Univ New South Wales, NICTA, Sydney, NSW 2052, Australia
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
It is widely realized that the integration of database and information retrieval techniques will provide users with a wide range of high quality services. In this paper we study processing an l-keyword query, p(1), p(2),...,p(l), against a relational database which can be modeled as a weighted graph, G(V, E). Here V is a set of nodes (tuples) and E is a set of edges representing foreign key references between tuples. Let V-i subset of V be a set of nodes that contain the keyword p(i). We study finding top-k minimum cost connected trees that contain at least one node in every subset V-i, and denote our problem as GST-k- When k = 1, it is known as a minimum cost group Steiner tree problem which is NP-Complete. We observe that the number of keywords, l, is small, and propose a novel parameterized solution, with l as a parameter to find the optimal GST-1, in time complexity O(3(l)n + 2(l)((l + log n)n + m)), where n and m are the numbers of nodes and edges in graph G. Our solution can handle graphs with a large number of nodes. Our GST-1 solution can be easily extended to support GST-k, which outperforms the existing GST-k solutions over both weighted undirected/directed graphs. We conducted extensive experimental studies, and report our finding.
引用
收藏
页码:811 / +
页数:2
相关论文
共 50 条
  • [1] Finding Top-k Fuzzy Frequent Itemsets from Databases
    Li, Haifeng
    Wang, Yue
    Zhang, Ning
    Zhang, Yuejin
    [J]. DATA MINING AND BIG DATA, DMBD 2017, 2017, 10387 : 22 - 30
  • [2] Finding Top-k Profitable Products
    Wan, Qian
    Wong, Raymond Chi-Wing
    Peng, Yu
    [J]. IEEE 27TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2011), 2011, : 1055 - 1066
  • [3] Finding Top-k Preferable Products
    Peng, Yu
    Wong, Raymond Chi-Wing
    Wan, Qian
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (10) : 1774 - 1788
  • [4] Top-k Differential Queries in Graph Databases
    Vasilyeva, Elena
    Thiele, Maik
    Bornhoevd, Christof
    Lehner, Wolfgang
    [J]. ADVANCES IN DATABASES AND INFORMATION SYSTEMS (ADBIS 2014), 2014, 8716 : 112 - 125
  • [5] Shooting top-k stars in uncertain databases
    Xiang Lian
    Lei Chen
    [J]. The VLDB Journal, 2011, 20 : 819 - 840
  • [6] On Pruning for Top-K Ranking in Uncertain Databases
    Wang, Chonghai
    Yuan, Li Yan
    You, Jia-Huai
    Zaiane, Osmar R.
    Pei, Jian
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (10): : 598 - 609
  • [7] Top-k query processing in uncertain Databases
    Soliman, Mohamed A.
    Ilyas, Ihab F.
    Chang, Kevin Chen-Chuan
    [J]. 2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 871 - +
  • [8] Shooting top-k stars in uncertain databases
    Lian, Xiang
    Chen, Lei
    [J]. VLDB JOURNAL, 2011, 20 (06): : 819 - 840
  • [9] Finding Top-k Shortest Paths with Diversity
    Liu, Huiping
    Jin, Cheqing
    Yang, Bin
    Zhou, Aoying
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (03) : 488 - 502
  • [10] Finding Top-k Answers in Keyword Search over Relational Databases Using Tuple Units
    Feng, Jianhua
    Li, Guoliang
    Wang, Jianyong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (12) : 1781 - 1794