k-Selection Query over Uncertain Data

被引:0
|
作者
Liu, Xingjie [1 ]
Ye, Mao [1 ]
Xu, Jianliang [2 ]
Tian, Yuan [1 ]
Lee, Wang-Chien [1 ]
机构
[1] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16801 USA
[2] Hong Kong Baptist Coll, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies a new query on uncertain data, called k-selection query. Given an uncertain dataset of N objects, where each object is associated with a preference score and a presence probability, a k-selection query returns k objects such that the expected score of the "best available" objects is maximized. This query is useful in many application domains such as entity web search and decision making. In evaluating k-selection queries, we need to compute the expected best score (EBS) for candidate k-selection sets and search for the optimal selection set with the highest EBS. Those operations are costly due to the extremely large search space. In this paper, we identify several important properties of k-selection queries, including EBS decomposition, query recursion. and EBS bounding. Based upon these properties, we first present a dynamic programming (DP) algorithm that answers the query in 0(k S N) time. Further, we propose a Bounding-and-Pruning (BP) algorithm, that exploits effective search space pruning strategies to find the optimal selection without accessing all objects. We evaluate the DP and BP algorithms using both synthetic and real data. The results show that the proposed algorithms outperform the baseline approach by several orders of magnitude.
引用
收藏
页码:444 / +
页数:2
相关论文
共 50 条
  • [1] Crowdsourcing for Top-K Query Processing over Uncertain Data
    Ciceri, Eleonora
    Fraternali, Piero
    Martinenghi, Davide
    Tagliasacchi, Marco
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (01) : 41 - 53
  • [2] Crowdsourcing for Top-K Query Processing over Uncertain Data
    Ciceri, Eleonora
    Fraternali, Piero
    Martinenghi, Davide
    Tagliasacchi, Marco
    [J]. 2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1452 - 1453
  • [3] A PROPOS OF K-SELECTION
    PERRIN, N
    [J]. ACTA OECOLOGICA-OECOLOGIA GENERALIS, 1987, 8 (04): : 549 - 552
  • [4] K-SELECTION OR BARRIER PENETRATION
    SLETTEN, G
    GJORUP, NL
    JUUTINEN, S
    MAJ, A
    NYBERG, J
    WALKER, PM
    CULLEN, DM
    FALLON, P
    JAMES, AN
    SHARPEYSCHAFER, JF
    BENTLEY, MA
    BRUCE, AM
    VARLEY, BJ
    [J]. NUCLEAR PHYSICS A, 1990, 520 : C325 - C332
  • [5] A NOTE ON K-SELECTION
    GREEN, RF
    [J]. AMERICAN NATURALIST, 1980, 116 (02): : 291 - 296
  • [6] Top-k query processing over uncertain data in distributed environments
    Sun, Yongjiao
    Yuan, Ye
    Wang, Guoren
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2012, 15 (04): : 429 - 446
  • [7] R-SELECTION AND K-SELECTION
    PIANKA, ER
    [J]. AMERICAN NATURALIST, 1970, 104 (940): : 592 - +
  • [8] INNOVATION AND R-SELECTION AND K-SELECTION
    JONES, EA
    [J]. JOURNAL OF SOCIAL AND BIOLOGICAL STRUCTURES, 1988, 11 (01) : 101 - 105
  • [9] THE MEANINGS OF R-SELECTION AND K-SELECTION
    PARRY, GD
    [J]. OECOLOGIA, 1981, 48 (02) : 260 - 264
  • [10] R-SELECTION AND K-SELECTION IN TYPHA
    MCNAUGHTON, SJ
    [J]. AMERICAN NATURALIST, 1975, 109 (967): : 251 - 261