On the semantics of top-k ranking for objects with uncertain data

被引:3
|
作者
Wang, Chonghai [1 ]
Yuan, Li Yan [1 ]
You, Jia-Huai [1 ]
机构
[1] Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada
关键词
Top-k ranking; Uncertain data; Probabilistic database; Constraint; High-dimensional space; Possible world; ALGORITHMS;
D O I
10.1016/j.camwa.2011.07.047
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
The goal of top-k ranking for objects is to rank the objects so that the best k of them can be determined. In this paper we consider an object to be an entity which consists of a number of attributes whose roles in the object are determined by an aggregation function. The problem of top-ranking in this case is conceptually simple for data that are complete and certain the aggregation value of an object represents its strength and therefore its rank. For uncertain data, the semantic basis of top-k objects becomes unclear. In this paper, we formulate a semantics of top-k ranking for objects modeled by uncertain data, where the values of an object's attributes are expressed by probability distributions and constrained by some stated conditions. Under this setting, we present a theory of top-k ranking for objects so that their strengths can be determined in the presence of uncertain data. We present our theory in three stages. The first deals with discrete domains, which is extended to include continuous domains. We show that top-k ranking for objects in this context is closely related to high-dimensional space studied in mathematics. In particular, the computation of the volumes of a high-dimensional polyhedron represented by a system of linear inequations is a special case of top-k ranking under our theory. We further extend this theory to add weights to objects' positions and aggregation values in determining ranking results. We show that a number of previous proposals for top-k ranking are special cases of our theory. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2812 / 2823
页数:12
相关论文
共 50 条
  • [1] On Pruning for Top-K Ranking in Uncertain Databases
    Wang, Chonghai
    Yuan, Li Yan
    You, Jia-Huai
    Zaiane, Osmar R.
    Pei, Jian
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (10): : 598 - 609
  • [2] Efficient Computation of Top-K Skyline Objects in Data Set With Uncertain Preferences
    Sukhwani, Nitesh
    Kagita, Venkateswara Rao
    Kumar, Vikas
    Panda, Sanjaya Kumar
    [J]. INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2021, 17 (03) : 68 - 80
  • [3] Ranking uncertain sky: The probabilistic top-k skyline operator
    Zhang, Ying
    Zhang, Wenjie
    Lin, Xuemin
    Jiang, Bin
    Pei, Jian
    [J]. INFORMATION SYSTEMS, 2011, 36 (05) : 898 - 915
  • [4] Top-K Oracle: A New Way to Present Top-K Tuples for Uncertain Data
    Song, Chunyao
    Li, Zheng
    Ge, Tingjian
    [J]. 2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2013, : 146 - 157
  • [5] Cleaning Uncertain Data for Top-k Queries
    Mo, Luyi
    Cheng, Reynold
    Li, Xiang
    Cheung, David W.
    Yang, Xuan S.
    [J]. 2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2013, : 134 - 145
  • [6] Is Top-k Sufficient for Ranking?
    Lan, Yanyan
    Niu, Shuzi
    Guo, Jiafeng
    Cheng, Xueqi
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1261 - 1270
  • [7] Adversarial Top-K Ranking
    Suh, Changho
    Tan, Vincent Y. F.
    Zhao, Renbo
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2017, 63 (04) : 2201 - 2225
  • [8] Top-k Outlier Detection from Uncertain Data
    Salman Ahmed Shaikh
    Hiroyuki Kitagawa
    [J]. International Journal of Automation and Computing, 2014, (02) : 128 - 142
  • [9] A dynamic data structure for top-k queries on uncertain data
    Chen, Jiang
    Yi, Ke
    [J]. THEORETICAL COMPUTER SCIENCE, 2008, 407 (1-3) : 310 - 317
  • [10] Dynamic structures for top-k queries on uncertain data
    Chen, Jiang
    Yi, Ke
    [J]. ALGORITHMS AND COMPUTATION, 2007, 4835 : 427 - +