A unified approach to ranking in probabilistic databases

被引:0
|
作者
Jian Li
Barna Saha
Amol Deshpande
机构
[1] University of Maryland,Computer Science Department
来源
The VLDB Journal | 2011年 / 20卷
关键词
Probabilistic databases; Ranking; Learning to rank; Approximation techniques; Graphical models;
D O I
暂无
中图分类号
学科分类号
摘要
Ranking is a fundamental operation in data analysis and decision support and plays an even more crucial role if the dataset being explored exhibits uncertainty. This has led to much work in understanding how to rank the tuples in a probabilistic dataset in recent years. In this article, we present a unified approach to ranking and top-k query processing in probabilistic databases by viewing it as a multi-criterion optimization problem and by deriving a set of features that capture the key properties of a probabilistic dataset that dictate the ranked result. We contend that a single, specific ranking function may not suffice for probabilistic databases, and we instead propose two parameterized ranking functions, called PRFω and PRFe, that generalize or can approximate many of the previously proposed ranking functions. We present novel generating functions-based algorithms for efficiently ranking large datasets according to these ranking functions, even if the datasets exhibit complex correlations modeled using probabilistic and/xor trees or Markov networks. We further propose that the parameters of the ranking function be learned from user preferences, and we develop an approach to learn those parameters. Finally, we present a comprehensive experimental study that illustrates the effectiveness of our parameterized ranking functions, especially PRFe, at approximating other ranking functions and the scalability of our proposed algorithms for exact or approximate ranking.
引用
收藏
页码:249 / 275
页数:26
相关论文
共 50 条
  • [1] A unified approach to ranking in probabilistic databases
    Li, Jian
    Saha, Barna
    Deshpande, Amol
    [J]. VLDB JOURNAL, 2011, 20 (02): : 249 - 275
  • [2] A Unified Approach to Ranking in Probabilistic Databases
    Li, Jian
    Saha, Barna
    Deshpande, Amol
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2009, 2 (01): : 502 - 513
  • [3] Generator-Recognizer Networks: A Unified Approach to Probabilistic Databases
    Chen, Ruiwen
    Mao, Yongyi
    Kiringa, Iluju
    [J]. 26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, : 169 - 172
  • [4] ProUD: Probabilistic ranking in uncertain Databases
    Bernecker, Thomas
    Kriegel, Hans-Peter
    Renz, Matthias
    [J]. SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2008, 5069 : 558 - 565
  • [5] Probabilistic inverse ranking queries in uncertain databases
    Xiang Lian
    Lei Chen
    [J]. The VLDB Journal, 2011, 20 : 107 - 127
  • [6] Scalable Probabilistic Similarity Ranking in Uncertain Databases
    Bernecker, Thomas
    Kriegel, Hans-Peter
    Mamoulis, Nikos
    Renz, Matthias
    Zuefle, Andreas
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (09) : 1234 - 1246
  • [7] Efficient Probabilistic Query Ranking in Uncertain Databases
    Katukoori, Divya
    Bhima, K.
    Sri, T. Aruna
    Chowdary, S. Hemanth
    Bhattacharya, Sujoy
    [J]. GLOBAL TRENDS IN COMPUTING AND COMMUNICATION SYSTEMS, PT 1, 2012, 269 : 169 - 177
  • [8] Probabilistic inverse ranking queries in uncertain databases
    Lian, Xiang
    Chen, Lei
    [J]. VLDB JOURNAL, 2011, 20 (01): : 107 - 127
  • [9] Ranking Query Answers in Probabilistic Databases: Complexity and Efficient Algorithms
    Olteanu, Dan
    Wen, Hongkai
    [J]. 2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 282 - 293
  • [10] A Probabilistic Approach for Distillation and Ranking of Web Pages
    Greco G.
    Greco S.
    Zumpano E.
    [J]. World Wide Web, 2001, 4 (3) : 189 - 207