Reporting l Most Favorite Objects in Uncertain Databases with Probabilistic Reverse Top-k Queries

被引：8

作者：

Xiao, Guoqing ^{[1
]}

Li, Kenli ^{[1
]}

Li, Keqin ^{[2
]}

机构：

[1] Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China

[2] SUNY Coll New Paltz, Dept Comp Sci, New Paltz, NY 12561 USA

来源：

2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW) | 2015年

关键词：

EFFICIENT;

D O I：

10.1109/ICDMW.2015.47

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Top-k queries are widely studied for identifying a ranked set of the k most interesting objects based on the individual user preference. Reverse top-k queries are proposed from the perspective of the product manufacturer, which are essential for manufacturers to assess the potential market and impacts of their products. However, the existing approaches for reverse top-k queries are all based on the assumption that the underlying data are exact. Due to the intrinsic differences between uncertain and certain data, these methods are designed only in certain databases and cannot be applied to uncertain case directly. Motivated by this, in this paper, we firstly model the probabilistic reverse top-k queries in the context of uncertain data. Moreover, we formulate the challenging problem of processing queries that report l most favorite objects to users, where impact factor of an object is defined as the cardinality of the probabilistic reverse top-k query result set. For speeding up the query, we exploit several properties of probabilistic threshold top-k queries and probabilistic skyline queries to reduce the solution space of this problem. In addition, an upper bound of the potential users is estimated to reduce the cost of computing the probabilistic reverse top-k queries for the candidate objects. Furthermore, effective pruning heuristics are presented to further reduce the search space of query processing. Finally, efficient query algorithms are presented seamlessly with integration of the proposed pruning strategies. Extensive experiments demonstrate the efficiency and effectiveness of our proposed algorithms with various experimental settings.

引用

页码：1592 / 1599

页数：8

共 50 条

[1] Reporting l most influential objects in uncertain databases based on probabilistic reverse top-k queries
Xiao, Guoqing
Li, Kenli
Li, Keqin
[J]. INFORMATION SCIENCES, 2017, 405 : 207 - 226
[2] Probabilistic top-k dominating queries in uncertain databases
Lian, Xiang
Chen, Lei
[J]. INFORMATION SCIENCES, 2013, 226 : 23 - 46
[3] Distributed probabilistic top-k dominating queries over uncertain databases
Niranjan Rai
Xiang Lian
[J]. Knowledge and Information Systems, 2023, 65 : 4939 - 4965
[4] Distributed probabilistic top-k dominating queries over uncertain databases
Rai, Niranjan
Lian, Xiang
[J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (11) : 4939 - 4965
[5] Probabilistic Reverse Top-k Queries
Jin, Cheqing
Zhang, Rong
Kang, Qiangqiang
Zhang, Zhao
Zhou, Aoying
[J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2014, PT I, 2014, 8421 : 406 - 419
[6] Identifying the Most Influential Data Objects with Reverse Top-k Queries
Vlachou, Akrivi
Doulkeridis, Christos
Norvag, Kjetil
Kotidis, Yannis
[J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (01): : 364 - 372
[7] Efficient processing of top-k queries in uncertain databases
Yi, Ke
Li, Feifei
Kollios, George
Srivastava, Divesh
[J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1406 - +
[8] Semantics and evaluation of top-k queries in probabilistic databases
Zhang, Xi
Chomicki, Jan
[J]. DISTRIBUTED AND PARALLEL DATABASES, 2009, 26 (01) : 67 - 126
[9] On the semantics and evaluation of top-k queries in probabilistic databases
Zhang, Xi
Chomicki, Jan
[J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, VOLS 1 AND 2, 2008, : 231 - 238
[10] Semantics and evaluation of top-k queries in probabilistic databases
Xi Zhang
Jan Chomicki
[J]. Distributed and Parallel Databases, 2009, 26 : 67 - 126

← 1 2 3 4 5 →