Reporting l Most Favorite Objects in Uncertain Databases with Probabilistic Reverse Top-k Queries

被引:8
|
作者
Xiao, Guoqing [1 ]
Li, Kenli [1 ]
Li, Keqin [2 ]
机构
[1] Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
[2] SUNY Coll New Paltz, Dept Comp Sci, New Paltz, NY 12561 USA
关键词
EFFICIENT;
D O I
10.1109/ICDMW.2015.47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Top-k queries are widely studied for identifying a ranked set of the k most interesting objects based on the individual user preference. Reverse top-k queries are proposed from the perspective of the product manufacturer, which are essential for manufacturers to assess the potential market and impacts of their products. However, the existing approaches for reverse top-k queries are all based on the assumption that the underlying data are exact. Due to the intrinsic differences between uncertain and certain data, these methods are designed only in certain databases and cannot be applied to uncertain case directly. Motivated by this, in this paper, we firstly model the probabilistic reverse top-k queries in the context of uncertain data. Moreover, we formulate the challenging problem of processing queries that report l most favorite objects to users, where impact factor of an object is defined as the cardinality of the probabilistic reverse top-k query result set. For speeding up the query, we exploit several properties of probabilistic threshold top-k queries and probabilistic skyline queries to reduce the solution space of this problem. In addition, an upper bound of the potential users is estimated to reduce the cost of computing the probabilistic reverse top-k queries for the candidate objects. Furthermore, effective pruning heuristics are presented to further reduce the search space of query processing. Finally, efficient query algorithms are presented seamlessly with integration of the proposed pruning strategies. Extensive experiments demonstrate the efficiency and effectiveness of our proposed algorithms with various experimental settings.
引用
收藏
页码:1592 / 1599
页数:8
相关论文
共 50 条
  • [1] Reporting l most influential objects in uncertain databases based on probabilistic reverse top-k queries
    Xiao, Guoqing
    Li, Kenli
    Li, Keqin
    [J]. INFORMATION SCIENCES, 2017, 405 : 207 - 226
  • [2] Probabilistic top-k dominating queries in uncertain databases
    Lian, Xiang
    Chen, Lei
    [J]. INFORMATION SCIENCES, 2013, 226 : 23 - 46
  • [3] Distributed probabilistic top-k dominating queries over uncertain databases
    Niranjan Rai
    Xiang Lian
    [J]. Knowledge and Information Systems, 2023, 65 : 4939 - 4965
  • [4] Distributed probabilistic top-k dominating queries over uncertain databases
    Rai, Niranjan
    Lian, Xiang
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (11) : 4939 - 4965
  • [5] Probabilistic Reverse Top-k Queries
    Jin, Cheqing
    Zhang, Rong
    Kang, Qiangqiang
    Zhang, Zhao
    Zhou, Aoying
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2014, PT I, 2014, 8421 : 406 - 419
  • [6] Identifying the Most Influential Data Objects with Reverse Top-k Queries
    Vlachou, Akrivi
    Doulkeridis, Christos
    Norvag, Kjetil
    Kotidis, Yannis
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (01): : 364 - 372
  • [7] Efficient processing of top-k queries in uncertain databases
    Yi, Ke
    Li, Feifei
    Kollios, George
    Srivastava, Divesh
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1406 - +
  • [8] Semantics and evaluation of top-k queries in probabilistic databases
    Zhang, Xi
    Chomicki, Jan
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 2009, 26 (01) : 67 - 126
  • [9] On the semantics and evaluation of top-k queries in probabilistic databases
    Zhang, Xi
    Chomicki, Jan
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, VOLS 1 AND 2, 2008, : 231 - 238
  • [10] Semantics and evaluation of top-k queries in probabilistic databases
    Xi Zhang
    Jan Chomicki
    [J]. Distributed and Parallel Databases, 2009, 26 : 67 - 126