Efficient pruning for top-K ranking queries on attribute-wise uncertain datasets

被引:0
|
作者
Chen, Jianwen [1 ]
Feng, Ling [2 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
关键词
Pruning; Top-K ranking query; Uncertain dataset;
D O I
10.1007/s10844-016-0403-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Top-K ranking queries in uncertain databases aim to find the top-K tuples according to a ranking function. The interplay between score and uncertainty makes top-K ranking in uncertain databases an intriguing issue, leading to rich query semantics. Recently, a unified ranking framework based on parameterized ranking functions (PRFs) has been formulated, which generalizes many previously proposed ranking semantics. Under the PRFs based ranking framework, efficient pruning approach for Top-K ranking on datasets with tuple-wise uncertainty has been well studied in the literature. However, this cannot be applied to top-K ranking on datasets with attribute-wise uncertainty, which are often natural and useful in analyzing uncertain data in many applications. This paper aims to develop efficient pruning techniques for top-K ranking on datasets with attribute-wise uncertainty under the PRFs based ranking framework, which has not been well studied in the literature. We first develop a Tuple Insertion Based Algorithm for computing each tuple's PRF value, which reduce the time cost from the state of the art cubic order of magnitude to quadratic order of magnitude. Based on the Tuple Insertion Based Algorithm, three pruning strategies are developed to further reduce the time cost. The mathematics of deriving the Tuple Insertion Based Algorithm and corresponding pruning strategies are also presented. At last, we show that our pruning algorithms can also be applied to the computation of the top-k aggregate queries. The experimental results on both real and synthetic data demonstrate the effectiveness and efficiency of the proposed pruning techniques.
引用
收藏
页码:215 / 242
页数:28
相关论文
共 50 条
  • [1] Efficient pruning for top-K ranking queries on attribute-wise uncertain datasets
    Jianwen Chen
    Ling Feng
    [J]. Journal of Intelligent Information Systems, 2017, 48 : 215 - 242
  • [2] On Pruning for Top-K Ranking in Uncertain Databases
    Wang, Chonghai
    Yuan, Li Yan
    You, Jia-Huai
    Zaiane, Osmar R.
    Pei, Jian
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (10): : 598 - 609
  • [3] Supporting Various Top-k Queries over Uncertain Datasets
    LI Wenfeng
    FU Zufa
    WANG Liwei
    LI Deyi
    PENG Zhiyong
    [J]. Wuhan University Journal of Natural Sciences, 2014, 19 (01) : 84 - 92
  • [4] Efficient processing of top-k queries in uncertain databases
    Yi, Ke
    Li, Feifei
    Kollios, George
    Srivastava, Divesh
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1406 - +
  • [5] An efficient algorithm for top-k queries on uncertain data streams
    Dai, Caiyan
    Chen, Ling
    Chen, Yixin
    Tang, Keming
    [J]. 2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 294 - 299
  • [6] An Efficient Optimization Approach for Top-k Queries on Uncertain Data
    Zhang, Zhiqiang
    Wei, Xiaoyan
    Xie, Xiaoqin
    Pan, Haiwei
    Miao, Yu
    [J]. INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2018, 27 (01)
  • [7] Efficient Pruning Algorithm for Top-K Ranking on Dataset with Value Uncertainty
    Chen, Jianwen
    Feng, Ling
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 2231 - 2236
  • [8] Top-k Dominating Queries on Incremental Datasets
    Wu, Jimmy Ming-Tai
    Wang, Ke
    Lin, Jerry Chun-Wei
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS. DASFAA 2022 INTERNATIONAL WORKSHOPS, 2022, 13248 : 79 - 88
  • [9] Solution for Queries for Top-K Relevant Attribute
    Debbarma, Anamika
    Saravanan, P.
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2015, : 1520 - 1524
  • [10] Cleaning Uncertain Data for Top-k Queries
    Mo, Luyi
    Cheng, Reynold
    Li, Xiang
    Cheung, David W.
    Yang, Xuan S.
    [J]. 2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2013, : 134 - 145